New offer - be the first one to apply!

July 2, 2026

Principal Engineer, CPU Architecture & Performance Research

Senior • On-site

240,000 - 249,996 USD/yr

San Jose, CA

Advancing the World's Technology Together

Our technology solutions power the tools you use every day—including smartphones, electric vehicles, hyperscale data centers, IoT devices, and more. This opportunity offers the chance to contribute to next-generation CPU technologies and performance optimization.

What You'll Do

Architecture Research Lab is seeking a Principal CPU Architecture & Performance Engineer to lead the definition, analysis, and optimization of next-generation CPU microarchitectures (RISC-V core). This role focuses on end-to-end performance, including architectural trade-offs, workload characterization, micro-architectural modeling, simulation, and silicon bring-up correlation.

You will work closely with architecture, design, compiler, and system teams to drive performance and efficiency across a broad set of real-world workloads.

Location: Daily onsite presence at the San Jose office in alignment with the Flexible Work policy.

Responsibilities

  • Define and evaluate CPU micro-architectural features for future cores (frontend, execution engine, memory hierarchy, interconnect)
  • Lead performance analysis using simulators and RTL
  • Help develop and validate performance models (cycle-accurate, trace-driven, statistical)
  • Characterize workloads (SPEC, server, client, AI/ML, cloud, internal traces) and translate findings into architectural requirements
  • Identify performance bottlenecks and propose data-driven optimizations
  • Drive architecture-to-implementation alignment with design team
  • Collaborate with compiler, OS, and system architects on cross-stack performance issues
  • Mentor senior and staff engineers and provide technical leadership across projects
  • Contribute to patents and publications

What You Bring

  • Master’s degree with 18+ years of experience in Computer Engineering, Computer Science, or related field, or PhD with 15+ years preferred
  • 10+ years of experience in CPU microarchitecture and/or performance engineering
  • Experience with RISC-V, ARM, or X86 architectures
  • Strong understanding of out-of-order execution, branch prediction, pipelines, speculation, cache coherence, memory systems, prefetching, and NUMA effects
  • Hands-on experience with architectural simulators such as gem5
  • Strong programming skills in C/C++ and Python
  • Familiarity with compiler optimizations and hardware/software co-design
  • Familiarity with SIMD, vectors, and VME for AI inference workloads
  • Experience analyzing large performance datasets and traces
  • Proven ability through tapeouts, patents, or publications to influence architecture and microarchitectural decisions through quantitative analysis

Preferred Qualifications

  • Background in power/performance/area (PPA) trade-off analysis
  • Experience with SIMD and vector technologies
  • Prior technical leadership at Senior Staff or Principal level

Benefits

  • Competitive compensation and incentive opportunities
  • Medical, dental, vision, and 401(k)
  • Charitable giving match and community involvement opportunities
  • 4+ weeks of paid time off plus holidays and sick leave
  • Family support benefits including fertility, adoption, and medical travel support
  • Emotional wellness support including therapy sessions and wellness apps
  • Onsite café, gym, and virtual fitness classes
  • Flexible work environment

Base Pay Range: $219,000—$351,000 USD

Similar jobs you might like

Technology

Samsung

Principal Engineer, CPU Architecture & Performance Research

Senior

On-site

San Jose, CA

20,000 - 20,833 USD/yr

🏢 Summary: Principal-level role leading research, modeling, and optimization of next-generation RISC-V CPU microarchitectures with end-to-end performance focus from architectural exploration to silicon correlation. The position drives performance analysis, workload characterization, and cross-stack optimization in collaboration with architecture, design, compiler, and system teams. It includes technical leadership and influence on future core definitions through quantitative analysis. 🗂️ Requirements: Master’s or PhD in Computer Engineering, Computer Science, or related field, 15+ years of experience with PhD or 18+ years with Master’s, 10+ years of experience in CPU microarchitecture or performance engineering, Experience with RISC-V, ARM, or X86 architectures, Strong knowledge of out-of-order execution, branch prediction, pipelines, speculation, Strong knowledge of cache coherence, memory systems, prefetching, NUMA, Experience with architectural simulators, Proficiency in C/C++ and Python, Experience with workload characterization and performance analysis, Experience influencing architectural decisions through quantitative analysis 📃 Skills: RISC-V, ARM, X86, gem5, C, C++, Python, SIMD, VME, SPEC, NUMA, RTL 🏢 Description: Please Note: To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the World's Technology Together Our technology solutions power the tools you use every day--including smartphones, electric vehicles, hyperscale data centers, IoT devices, and so much more. Here, you'll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what's possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We're dedicated to empowering people to be their true selves. Together, we're building a better tomorrow for our employees, customers, partners, and communities. Principal, CPU Architecture & Performance Research Engineer What You'll Do Architecture Research Lab is seeking Principal CPU Architecture & Performance Engineer to lead the definition, analysis, and optimization of next-generation CPU microarchitectures (RISC-V core). This role is focused on end-to-end performance: from architectural trade-offs and workload characterization to micro-architectural modeling, simulation, and silicon bring-up correlation. You will work closely with architecture, design, compiler, and system teams to drive performance and efficiency across a broad set of real-world workloads Location: Daily onsite presence at our San Jose office in alignment with our Flexible Work policy Job ID: 42918 Define and evaluate CPU micro-architectural features for future cores (frontend, execution engine, memory hierarchy, interconnect). Lead performance analysis using simulators, RTL. Help develop and validate performance models (cycle-accurate, trace-driven, statistical). Characterize workloads (SPEC, server, client, AI/ML, cloud, internal traces) and translate findings into architectural requirements. Identify performance bottlenecks and propose data-driven optimizations. Drive architecture-to-implementation alignment with design team. Collaborate with compiler, OS, and system architects on cross-stack performance issues. Mentor senior and staff engineers; provide technical leadership across projects. Work leading to patents and publication What You Bring Master's with 18+ years of experience in Computer Engineering, Computer Science, or related field. or PhD with 15+ years of experience preferred. 10+ years of experience in CPU microarchitecture and/or performance engineering. Experience with RISC-V, ARM or X86 architectures. Strong understanding of: Out-of-order execution, branch prediction, pipelines, and speculation Cache coherence, memory systems, prefetching, and NUMA effects Hands-on experience with architectural simulators (like gem5). Strong programming skills in C/C++ and Python. Familiarity with compiler optimizations and hardware/software co-design. Familiarity with SIMD / Vectors / VME for AI inference workloads. Experience analyzing large performance datasets and traces. Proven ability (Tapeout / Patents /Publications) to influence architecture / micro architectural decisions through quantitative analysis. Preferred Qualifications Background in power/performance/area (PPA) trade-off analysis. Experience with SIMD/Vectors. Experience with compiler optimizations and hardware/software co-design. Prior technical leadership at Senior Staff or Principal level. You're inclusive, adapting your style to the situation and diverse global norms of our people. You approach challenges with curiosity and resilience, seeking data to help build understanding. You're collaborative, building relationships, humbly offering support and openly welcoming approaches. Innovative and creative, you proactively explore new ideas and adapt quickly to change. #LI-SF1 What We OfferThe pay range below is for all roles at this level across all US locations and functions. Paywithin this range varies by work locationand may also depend on job-related knowledge, skills,and experience. We also offer incentive opportunities that reward employees based on individual and company performance. This is in addition to our diverse package of benefits centered around the wellbeing of our employees and their loved ones. In addition to the usual Medical/Dental/Vision/401k, our inclusive rewards plan empowers our people to care for their whole selves. An investment in your future is an investment in ours. Give Back With a charitable giving match and frequent opportunities to get involved, we take an active role in supporting the community.Enjoy Time Away You'll start with 4+ weeks of paid time off a year, plus holidays and sick leave, to rest and recharge.Care for Family Whatever family means to you, we want to support you along the way—including a stipend for fertility care or adoption, medical travel support, and virtual vet care for your fur babies.Prioritize Emotional Wellness With on-demand apps and free confidential therapy sessions, you'll have support no matter where you are.Stay Fit Eating well and being active are important parts of a healthy life. Our onsite Café and gym, plus virtual classes, make it easier.Embrace Flexibility Benefits are best when you have the space to use them. That's why we facilitate a flexible environment so you can find the right balance for you.Base Pay Range$219,000—$351,000 USDEqual Opportunity Employment Policy Samsung Semiconductor takes pride in being an equal opportunity workplace dedicated to fostering an environment where all individuals feel valued and empowered to excel, regardless of race, religion, color, age, disability, sex, gender identity, sexual orientation, ancestry, genetic information, marital status, national origin, political affiliation, or veteran status. When selecting team members, we prioritize talent and qualities such as humility, kindness, and dedication. We extend comprehensive accommodations throughout our recruiting processes for candidates with disabilities, long-term conditions, neurodivergent individuals, or those requiring pregnancy-related support. All candidates scheduled for an interview will receive guidance on requesting accommodations. Recruiting Agency Policy We do not accept unsolicited resumes. Only authorized recruitment agencies that have a current and valid agreement with Samsung Semiconductor, Inc. are permitted to submit resumes for any job openings. Applicant AI Use Policy At Samsung Semiconductor, we support innovation and technology. However, to ensure a fair and authentic assessment, we prohibit the use of generative AI tools to misrepresent a candidate's true skills and qualifications. Permitted uses are limited to basic preparation, grammar, and research, but all submitted content and interview responses must reflect the candidate's genuine abilities and experience. Violation of this policy may result in immediate disqualification from the hiring process. Applicant Privacy Policyhttps://semiconductor.samsung.com/about-us/careers/us/privacy/

Technology

New offer

Samsung

Senior, CPU Architecture & Performance Research Engineer

Senior

On-site

San Jose, CA

🏢 Summary: Senior CPU Architecture & Performance Engineer role focused on performance analysis and microarchitectural optimization of current and next-generation RISC-V CPU cores. The position involves architectural simulation, bottleneck analysis, workload characterization, and automation of performance studies in collaboration with architecture and design teams. The role is onsite in San Jose and includes opportunities for patents and publications. 🗂️ Requirements: Master's or PhD in Computer Engineering, Computer Science, or related field, 2+ years of experience in CPU microarchitecture or performance engineering, Strong understanding of out-of-order execution, branch prediction, pipelines, and speculation, Strong understanding of cache coherence, memory systems, prefetching, and NUMA effects, Hands-on experience with architectural simulators, Strong programming skills in C/C++, Strong programming skills in Python, Experience analyzing large performance datasets and traces, Familiarity with compiler optimizations, Familiarity with hardware/software co-design 📃 Skills: RISC-V, ARM, X86, gem5, C, C++, Python, NUMA, SIMD, VME 🏢 Description: Please Note: To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the World's Technology Together Our technology solutions power the tools you use every day--including smartphones, electric vehicles, hyperscale data centers, IoT devices, and so much more. Here, you'll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what's possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We're dedicated to empowering people to be their true selves. Together, we're building a better tomorrow for our employees, customers, partners, and communities. What You'll Do Architecture Research Lab is seeking Senior CPU Architecture & Performance Engineer to contribute to performance analysis and microarchitectural optimization of current and next-generation CPU (RISC-V) cores. This role emphasizes hands-on analysis, modeling, and close collaboration with architects and design teams to improve performance across real-world workloads. You will focus on executing well-defined performance studies, identifying bottlenecks, and helping translate architectural ideas into measurable performance gains. - Perform performance analysis for CPU microarchitectural features under guidance from Staff and Principal engineers. - Run architectural simulations and analyze results (IPC, CPI stacks, latency and throughput metrics). - Identify performance bottlenecks in frontend, execution engine, or memory subsystems. - Assist in evaluating design trade-offs using trace-driven or cycle-accurate models. - Characterize workloads and benchmarks (SPEC, server, client, internal traces). - Develop scripts and tools to automate performance analysis. - Work leading to patents and publication. What You Bring - Master's, or PhD in Computer Engineering, Computer Science, or related field. - 2+ years of experience in CPU microarchitecture and/or performance engineering. - Strong understanding of: - Out-of-order execution, branch prediction, pipelines, and speculation - Cache coherence, memory systems, prefetching, and NUMA effects - Hands-on experience with architectural simulators (like gem5). - Strong programming skills in C/C++ and Python. - Experience analyzing large performance datasets and traces. - Familiarity with compiler optimizations and hardware/software co-design. Preferred Qualifications - Experience with RISC-V, ARM or X86 architectures. - Background in power/performance/area (PPA) trade-off analysis. - Familiarity with SIMD / Vectors / VME for AI inference workloads. - Prior tapeout / patent / publication experience. - You're inclusive, adapting your style to the situation and diverse global norms of our people. - You approach challenges with curiosity and resilience, seeking data to help build understanding. - You're collaborative, building relationships, humbly offering support and openly welcoming approaches. - Innovative and creative, you proactively explore new ideas and adapt quickly to change. What We Offer The pay range below is for all roles at this level across all US locations and functions. Pay within this range varies by work location and may also depend on job-related knowledge, skills, and experience. We also offer incentive opportunities that reward employees based on individual and company performance. This is in addition to our diverse package of benefits centered around the wellbeing of our employees and their loved ones. - Medical/Dental/Vision/401k benefits - Charitable giving match and community involvement opportunities - 4+ weeks of paid time off plus holidays and sick leave - Fertility care or adoption stipend, medical travel support, and virtual vet care - On-demand apps and free confidential therapy sessions - Onsite café and gym plus virtual fitness classes - Flexible work environment - Base Pay Range: $138,000—$206,000 USD Equal Opportunity Employment Policy Samsung Semiconductor takes pride in being an equal opportunity workplace dedicated to fostering an environment where all individuals feel valued and empowered to excel. Our Commitment to Innovation and Fairness AI tools may be used in the recruitment process as support tools, but all hiring decisions are made by human recruiting teams and hiring managers. Recruiting Agency Policy We do not accept unsolicited resumes. Only authorized recruitment agencies with a valid agreement may submit resumes for job openings. Applicant AI Use Policy Generative AI tools may not be used to misrepresent a candidate's skills or qualifications during the hiring process. Trade Secret Notice By submitting an application, candidates agree not to disclose confidential or proprietary information belonging to current or former employers or other entities.

Technology

New offer

Samsung

Staff, CPU Architecture & Performance Research Engineer

Senior

On-site

San Jose, CA

🏢 Summary: Staff-level CPU Architecture & Performance Research Engineer role focused on analyzing and optimizing RISC-V CPU microarchitecture performance using architectural models, simulators, and workload analysis. The position involves collaboration with architecture, compiler, and system teams to identify bottlenecks, evaluate trade-offs, and improve next-generation CPU core performance. Candidates will work on performance-critical domains, contribute architectural recommendations, and support patent and publication work. 🗂️ Requirements: Master’s degree in Computer Engineering, Computer Science, or related field with 8+ years of experience or PhD with 5+ years of experience, 5+ years of CPU microarchitecture or performance engineering experience, Experience with RISC-V, ARM, or X86 architectures, Knowledge of out-of-order execution, branch prediction, pipelines, and speculation, Knowledge of cache coherence, memory systems, prefetching, and NUMA effects, Hands-on experience with architectural simulators, Strong C/C++ programming skills, Strong Python programming skills, Experience analyzing large performance datasets and traces, Familiarity with compiler optimizations and hardware/software co-design, Prior tapeout experience 📃 Skills: RISC-V, ARM, X86, gem5, C, C++, Python, NUMA, SIMD, VME 🏢 Description: Advancing the World's Technology Together Our technology solutions power the tools you use every day—including smartphones, electric vehicles, hyperscale data centers, IoT devices, and more. This opportunity focuses on next-generation CPU architecture and performance optimization. What You'll Do Architecture Research Lab is looking for a Staff CPU Architecture & Performance Engineer to drive detailed performance analysis and architectural optimization for current and next-generation CPU (RISC-V) cores. This role focuses on deep ownership of performance-critical micro-architectural domains, workload analysis, and data-driven recommendations that influence core architecture decisions. You will work closely with architects, design, compiler, and system teams to evaluate trade-offs, identify bottlenecks, and improve performance across real-world workloads. Responsibilities - Own performance analysis for one or more CPU microarchitectural domains (e.g., frontend, execution engine, memory subsystem) - Build, extend, and validate architectural performance models and simulators - Perform CPI/IPC breakdowns and root-cause performance bottlenecks - Evaluate microarchitectural features and optimizations using trace-driven, analytical, and cycle-accurate models - Characterize workloads and benchmarks (SPEC, server, client, AI/ML, internal traces) - Translate performance data into clear architectural recommendations - Contribute to work leading to patents and publications What You Bring - Master’s degree in Computer Engineering, Computer Science, or related field with 8+ years of experience, or PhD with 5+ years of experience - 5+ years of experience in CPU microarchitecture and/or performance engineering - Experience with RISC-V, ARM, or X86 architectures - Strong understanding of: - Out-of-order execution, branch prediction, pipelines, and speculation - Cache coherence, memory systems, prefetching, and NUMA effects - Hands-on experience with architectural simulators such as gem5 - Strong programming skills in C/C++ and Python - Experience analyzing large performance datasets and traces - Familiarity with compiler optimizations and hardware/software co-design - Proven prior tapeout experience Preferred Qualifications - Background in power/performance/area (PPA) trade-off analysis - Experience or familiarity with SIMD, vectors, or VME for AI inference workloads - Prior leadership experience guiding junior engineers - Prior patent or publication experience Benefits - Competitive compensation with incentive opportunities - Medical, Dental, Vision, and 401(k) - Charitable giving match and community involvement opportunities - 4+ weeks of paid time off, holidays, and sick leave - Fertility care, adoption support, medical travel support, and virtual vet care - Emotional wellness support including confidential therapy sessions - Onsite café, gym, and virtual fitness classes - Flexible work environment and wellness-focused benefits Base Pay Range: $163,000—$253,000 USD

Technology

Samsung

Staff, CPU Architecture & Performance Research Engineer

Senior

On-site

San Jose, CA

🏢 Summary: Staff CPU Architecture & Performance Research Engineer role focused on deep microarchitectural performance analysis and optimization of current and next-generation RISC-V CPU cores. The position involves building performance models, analyzing real-world workloads, and driving data-driven architectural improvements in collaboration with cross-functional hardware and software teams. The work directly influences core design decisions and may lead to patents and publications. 🗂️ Requirements: Master’s or PhD in Computer Engineering, Computer Science or related field, 5+ years of experience in CPU microarchitecture or performance engineering, Experience with RISC-V, ARM or x86 architectures, Strong knowledge of out-of-order execution, branch prediction, pipelines and speculation, Strong knowledge of cache coherence, memory systems, prefetching and NUMA, Hands-on experience with architectural simulators, Strong programming skills in C/C++ and Python, Experience analyzing large performance datasets and traces, Prior tapeout experience 📃 Skills: RISC-V, ARM, x86, gem5, C, C++, Python, SPEC, NUMA, SIMD 🏢 Description: Please Note: To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the World's Technology Together Our technology solutions power the tools you use every day--including smartphones, electric vehicles, hyperscale data centers, IoT devices, and so much more. Here, you'll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what's possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We're dedicated to empowering people to be their true selves. Together, we're building a better tomorrow for our employees, customers, partners, and communities.Staff, CPU Architecture & Performance Research Engineer What You'll Do Architecture Research Lab is looking for Staff CPU Architecture & Performance Engineer to drive detailed performance analysis and architectural optimization for current and next-generation CPU (RISC-V) cores. This role focuses on deep ownership of performance-critical micro-architectural domains, workload analysis, and data-driven recommendations that influence core architecture decisions. You will work closely with architects, design, compiler, and system teams to evaluate trade-offs, identify bottlenecks, and improve performance across real-world workloads Location: Daily onsite presence at our San Jose office in alignment with our Flexible Work policy Job ID: 42850 Own performance analysis for one or more CPU microarchitectural domains (e.g., frontend, execution engine, memory subsystem). Build, extend, and validate architectural performance models and simulators. Perform CPI/IPC breakdowns and root-cause performance bottlenecks. Evaluate microarchitectural features and optimizations using trace-driven, analytical, and cycle-accurate models. Characterize workloads and benchmarks (SPEC, server, client, AI/ML, internal traces). Translate performance data into clear architectural recommendations. Work leading to patents and publication. What You Bring Master's in Computer Engineering, Computer Science or related filed with 8+ years of experience or PhD in Computer Engineering, Computer Science, or related field with 5+ years of experience. 5+ years of experience in CPU microarchitecture and/or performance engineering. Experience with RISC-V, ARM or X86 architectures. Strong understanding of: Out-of-order execution, branch prediction, pipelines, and speculation Cache coherence, memory systems, prefetching, and NUMA effects Hands-on experience with architectural simulators (like gem5). Strong programming skills in C/C++ and Python. Experience analyzing large performance datasets and traces. Familiarity with compiler optimizations and hardware/software co-design.. Proven ability - prior tapeout experience. Preferred Qualifications Background in power/performance/area (PPA) trade-off analysis. Experience / Familiarity with SIMD / Vectors / VME for AI inference workloads.. Prior leadership guiding junior engineers. Prior patent / publication experience. You're inclusive, adapting your style to the situation and diverse global norms of our people. You approach challenges with curiosity and resilience, seeking data to help build understanding. You're collaborative, building relationships, humbly offering support and openly welcoming approaches. Innovative and creative, you proactively explore new ideas and adapt quickly to change. #LI-SF1What We OfferThe pay range below is for all roles at this level across all US locations and functions. Paywithin this range varies by work locationand may also depend on job-related knowledge, skills,and experience. We also offer incentive opportunities that reward employees based on individual and company performance. This is in addition to our diverse package of benefits centered around the wellbeing of our employees and their loved ones. In addition to the usual Medical/Dental/Vision/401k, our inclusive rewards plan empowers our people to care for their whole selves. An investment in your future is an investment in ours. Give Back With a charitable giving match and frequent opportunities to get involved, we take an active role in supporting the community.Enjoy Time Away You'll start with 4+ weeks of paid time off a year, plus holidays and sick leave, to rest and recharge.Care for Family Whatever family means to you, we want to support you along the way—including a stipend for fertility care or adoption, medical travel support, and virtual vet care for your fur babies.Prioritize Emotional Wellness With on-demand apps and free confidential therapy sessions, you'll have support no matter where you are.Stay Fit Eating well and being active are important parts of a healthy life. Our onsite Café and gym, plus virtual classes, make it easier.Embrace Flexibility Benefits are best when you have the space to use them. That's why we facilitate a flexible environment so you can find the right balance for you.Base Pay Range$163,000—$253,000 USDEqual Opportunity Employment Policy Samsung Semiconductor takes pride in being an equal opportunity workplace dedicated to fostering an environment where all individuals feel valued and empowered to excel, regardless of race, religion, color, age, disability, sex, gender identity, sexual orientation, ancestry, genetic information, marital status, national origin, political affiliation, or veteran status. When selecting team members, we prioritize talent and qualities such as humility, kindness, and dedication. We extend comprehensive accommodations throughout our recruiting processes for candidates with disabilities, long-term conditions, neurodivergent individuals, or those requiring pregnancy-related support. All candidates scheduled for an interview will receive guidance on requesting accommodations. Recruiting Agency Policy We do not accept unsolicited resumes. Only authorized recruitment agencies that have a current and valid agreement with Samsung Semiconductor, Inc. are permitted to submit resumes for any job openings. Applicant AI Use Policy At Samsung Semiconductor, we support innovation and technology. However, to ensure a fair and authentic assessment, we prohibit the use of generative AI tools to misrepresent a candidate's true skills and qualifications. Permitted uses are limited to basic preparation, grammar, and research, but all submitted content and interview responses must reflect the candidate's genuine abilities and experience. Violation of this policy may result in immediate disqualification from the hiring process. Applicant Privacy Policyhttps://semiconductor.samsung.com/about-us/careers/us/privacy/

Technology

Samsung

Senior Director, Architecture Research Lab

Senior

On-site

San Jose, CA

🏢 Summary: Lead research and architecture design for next-generation AI systems, focusing on rack-scale co-design of AI workloads, memory, interconnects, and high-performance RISC-V CPUs. The role drives system-level modeling, simulation, and micro-architecture innovation to eliminate memory and bandwidth bottlenecks in large-scale AI platforms. It combines deep technical leadership with hands-on architecture research and cross-functional collaboration. 🗂️ Requirements: Ph.D. in Computer Science, Electrical Engineering, or related field, 10+ years in system-level architecture research or large-scale computing platform design, Expertise in AI workload-focused system architecture design, Strong experience in performance modeling and event-driven simulation, Deep knowledge of RISC-V, ARM, or x86 CPU architectures, Experience designing out-of-order CPU micro-architectures, Proficiency in transaction-level modeling (TLM), Experience with large-scale design-space exploration and PPA analysis, Hands-on programming experience in Python and C++, Ability to lead technical research teams and define architecture roadmaps 📃 Skills: RISC-V, ARM, x86, TLM, Python, C++, PPA, Simulation, Modeling, Microarchitecture 🏢 Description: Please Note: To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the World's Technology Together Our technology solutions power the tools you use every day--including smartphones, electric vehicles, hyperscale data centers, IoT devices, and so much more. Here, you'll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what's possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We're dedicated to empowering people to be their true selves. Together, we're building a better tomorrow for our employees, customers, partners, and communities.What You'll Do Lead cutting‑edge research on next‑generation AI system architectures. The role is responsible for end‑to‑end co‑design of AI workloads, system‑level modeling, hardware platforms, and high‑performance processors that leverage Samsung's advanced memory technologies to eliminate capacity, bandwidth, and large‑scale communication bottlenecks. Location: Daily onsite presence at our San Jose Headquarters in alignment with our Flexible Work policy AI System Architecture Leadership Define system‑level architectures that solve memory‑capacity, bandwidth, and interconnect challenges for large AI workloads (e.g., large language models, recommendation systems). Build and maintain analytical and event‑driven simulation frameworks for compute‑memory‑network performance at rack scale. Conduct design‑space exploration and quantitative trade‑off studies (performance, power, cost) to guide architecture decisions. Partner with SAIT HQ teams to align modeling insights with real‑world AI system implementations. RISC‑V CPU Architecture Leadership Architect high‑performance, out‑of‑order RISC‑V CPU cores that serve as host processors for AI computing systems. Drive IPC‑focused feature path‑finding; lead micro‑architecture research through performance‑model simulation and workload analysis. Produce detailed micro‑architecture specifications and guide cache/memory hierarchy design for optimal AI workload execution. Research & Innovation Management Lead a multidisciplinary research team, set technical roadmaps, and ensure delivery of high‑impact publications. Present architectural insights and strategic recommendations to senior leadership and external partners. What You Bring Education: Ph.D. in Computer Science, Electrical Engineering, or a related field. Experience: 10+years in system‑level architecture research or large‑scale computing platform design, with a strong focus on AI workloads. Technical Expertise: Performance modeling, event‑driven simulation, and quantitative analysis of compute‑memory‑interconnect systems. Deep knowledge of modern CPU/accelerator architectures (RISC‑V, ARM, x86) and heterogeneous integration. Proven ability to design rack‑scale AI system architectures that address memory, bandwidth, and interconnect constraints. Proficiency with transaction‑level modeling (TLM) and event‑driven simulation for compute‑memory‑network co‑design. Experience in large‑scale design‑space exploration and PPA (performance, power, area/cost) trade‑off analysis. Expertise in architecture and micro‑architecture design of out‑of‑order CPUs. Hands‑on experience with simulation tools and programming languages such as Python and C++. Communication Skills: Excellent written and verbal communication; proven ability to deliver technical presentations to senior stakeholders. Preferred Experience Proven track record of publishing high‑impact research papers. Experience influencing product roadmaps through architectural recommendations. Strong collaborative history with cross‑functional teams (hardware, software, memory technology). You're inclusive, adapting your style to the situation and diverse global norms of our people. An avid learner, you approach challenges with curiosity and resilience, seeking data to help build understanding. You're collaborative, building relationships, humbly offering support and openly welcoming approaches. Innovative and creative, you proactively explore new ideas and adapt quickly to change #LI-SF1 What We OfferThe pay range below is for all roles at this level across all US locations and functions. Paywithin this range varies by work locationand may also depend on job-related knowledge, skills,and experience. We also offer incentive opportunities that reward employees based on individual and company performance. This is in addition to our diverse package of benefits centered around the wellbeing of our employees and their loved ones. In addition to the usual Medical/Dental/Vision/401k, our inclusive rewards plan empowers our people to care for their whole selves. An investment in your future is an investment in ours. Give Back With a charitable giving match and frequent opportunities to get involved, we take an active role in supporting the community.Enjoy Time Away You'll start with 4+ weeks of paid time off a year, plus holidays and sick leave, to rest and recharge.Care for Family Whatever family means to you, we want to support you along the way—including a stipend for fertility care or adoption, medical travel support, and virtual vet care for your fur babies.Prioritize Emotional Wellness With on-demand apps and free confidential therapy sessions, you'll have support no matter where you are.Stay Fit Eating well and being active are important parts of a healthy life. Our onsite Café and gym, plus virtual classes, make it easier.Embrace Flexibility Benefits are best when you have the space to use them. That's why we facilitate a flexible environment so you can find the right balance for you.Base Pay Range$246,000—$430,000 USDEqual Opportunity Employment Policy Samsung Semiconductor takes pride in being an equal opportunity workplace dedicated to fostering an environment where all individuals feel valued and empowered to excel, regardless of race, religion, color, age, disability, sex, gender identity, sexual orientation, ancestry, genetic information, marital status, national origin, political affiliation, or veteran status. When selecting team members, we prioritize talent and qualities such as humility, kindness, and dedication. We extend comprehensive accommodations throughout our recruiting processes for candidates with disabilities, long-term conditions, neurodivergent individuals, or those requiring pregnancy-related support. All candidates scheduled for an interview will receive guidance on requesting accommodations. Our Commitment to Innovation and Fairness At Samsung Semiconductor, we use Artificial Intelligence (AI) tools in the recruitment process to enhance efficiency. However, AI is used as a support tool, not a final decision-maker. All hiring decisions are made by our human recruiting team and hiring managers to ensure every candidate is evaluated fairly and holistically. Recruiting Agency Policy We do not accept unsolicited resumes. Only authorized recruitment agencies that have a current and valid agreement with Samsung Semiconductor, Inc. are permitted to submit resumes for any job openings. Applicant AI Use Policy At Samsung Semiconductor, we support innovation and technology. However, to ensure a fair and authentic assessment, we ask that candidates rely on their own knowledge and skills throughout the process. AI tools may be used for basic preparation, grammar, and research, but should not be used to generate or assist with submitted content or live interview responses. If we determine that AI is being used outside these guidelines, we reserve the right to pause or end the interview, and your candidacy may be disqualified. Trade Secret Notice By submitting an application, you agree not to disclose to Samsung—or encourage Samsung to use—any confidential or proprietary information (including trade secrets) belonging to a current or former employer or other entity. Applicant Privacy Policyhttps://semiconductor.samsung.com/about-us/careers/us/privacy/

Technology

New offer

Samsung

Principal Engineer, AI System Architect (Hardware)

Senior

On-site

San Jose, CA

240,000 - 249,996 USD/yr

🏢 Summary: Principal AI System Architect role focused on researching and designing next-generation AI hardware systems for large-scale workloads. The position involves leading architecture strategy, developing system-level performance models, and evaluating compute, memory, and interconnect architectures for AI scalability and efficiency. The role requires deep expertise in AI system architecture, simulation modeling, and large-scale AI workloads. 🗂️ Requirements: Ph.D. in Computer Science, Electrical Engineering, or related field preferred, 15+ years of experience in system architecture for large-scale computing platforms, Hands-on experience developing analytical and event-driven simulation models, Deep understanding of AI hardware architectures, Knowledge of LLMs, DLRMs, and large-scale AI training and inference systems, Ability to translate workload characteristics into architectural design decisions, Proficiency in Python, Proficiency in C++, Proficiency in PyTorch, Strong technical communication and presentation skills 📃 Skills: Python, C++, PyTorch, AI, LLM, DLRM, Modeling, Simulation, Architecture, Hardware, Networking, Memory, Interconnect 🏢 Description: Please Note: To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the World's Technology Together Our technology solutions power the tools you use every day--including smartphones, electric vehicles, hyperscale data centers, IoT devices, and so much more. Here, you'll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what's possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We're dedicated to empowering people to be their true selves. Together, we're building a better tomorrow for our employees, customers, partners, and communities. What You'll Do The Architecture Research Lab (ARL) focuses on addressing fundamental system-level bottlenecks in modern AI, particularly in memory capacity/bandwidth and system-scale communication. By leveraging world-class memory technologies, ARL explores and defines next-generation AI system architectures that deliver step-function improvements in performance, efficiency, and scalability. We are seeking a Principal AI System Architect who will play a Technical Lead role in bridging AI workloads, system architecture, and hardware design. In this role, you will develop system-level performance models, drive architecture-level design decisions, and propose forward-looking AI system architectures. Location: Daily onsite presence at the San Jose office in alignment with the Flexible Work policy. Responsibilities: - Technically lead the architecture team with strong direction, shaping system-architecture strategy and advancing key innovations - Conduct system-level architectural research for next-generation AI systems spanning compute, memory, and interconnect/network subsystems - Develop and maintain analytical and simulation-based system modeling frameworks to evaluate AI workloads and identify performance, scalability, and efficiency bottlenecks - Analyze representative and emerging AI workloads including LLMs, DLRMs, and future AI models - Drive architecture-level design decisions through quantitative modeling, design-space exploration, and performance/power projections - Perform comparative studies of alternative system architectures and report performance and efficiency metrics - Collaborate with hardware architecture, memory, interconnect, and system engineering teams - Communicate architectural insights and recommendations through technical presentations and documentation - Occasional domestic and international travel under 10% What You Bring - Ph.D. in Computer Science, Electrical Engineering, or a related field preferred - 15+ years of experience in system architecture for large-scale computing platforms with a focus on AI workloads - Hands-on experience developing analytical and event-driven simulation models - Deep understanding of AI system hardware architectures including compute, memory hierarchies, and high-performance interconnects - Strong knowledge of modern and emerging AI workloads including LLMs, DLRMs, and large-scale training and inference systems - Ability to translate workload characteristics and modeling results into architectural design decisions - Proficiency in Python, C++, and PyTorch - Excellent written, verbal, and presentation communication skills - Collaborative mindset and resilience in solving complex system-level challenges What We Offer - Competitive compensation with incentive opportunities - Medical, Dental, Vision, and 401(k) - Charitable giving match and community involvement opportunities - 4+ weeks of paid time off plus holidays and sick leave - Family support benefits including fertility and adoption support - Emotional wellness resources and therapy sessions - Onsite café, gym, and virtual fitness classes - Flexible work environment Base Pay Range $219,000—$351,000 USD Equal Opportunity Employment Policy Samsung Semiconductor is an equal opportunity workplace dedicated to fostering an environment where all individuals feel valued and empowered to excel. Our Commitment to Innovation and Fairness AI tools may be used in the recruitment process as support tools, but all hiring decisions are made by human recruiting teams and hiring managers. Applicant AI Use Policy Candidates may use AI tools for basic preparation, grammar, and research, but not for submitted content or live interview responses. Trade Secret Notice By submitting an application, candidates agree not to disclose confidential or proprietary information belonging to current or former employers or other entities.

Technology

New offer

Samsung

Principal Engineer, AI System Architect (Hardware)

Senior

On-site

San Jose, CA

240,000 - 249,996 USD/yr

🏢 Summary: Principal Engineer role focused on AI system architecture research and technical leadership for next-generation AI hardware platforms. The position involves developing system-level performance models, evaluating AI workloads, and driving architecture decisions across compute, memory, and interconnect systems. Daily onsite work in San Jose with collaboration across hardware and system engineering teams. 🗂️ Requirements: PhD in Computer Science, Electrical Engineering, or related field, 15+ years of experience in AI system architecture for large-scale computing platforms, Hands-on experience with analytical and event-driven simulation models, Deep understanding of AI hardware architectures, Knowledge of LLMs, DLRMs, and large-scale AI training and inference systems, Ability to translate workload modeling into architectural decisions, Proficiency in Python, Proficiency in C++, Proficiency in PyTorch, Strong technical communication and presentation skills 📃 Skills: Python, C++, PyTorch, AI, LLMs, DLRMs, Modeling, Simulation, Architecture, Hardware, Memory, Interconnect, Networking 🏢 Description: Please Note: To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the World's Technology Together Our technology solutions power the tools you use every day—including smartphones, electric vehicles, hyperscale data centers, IoT devices, and more. Here, you'll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what's possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We're dedicated to empowering people to be their true selves. Together, we're building a better tomorrow for our employees, customers, partners, and communities. The Architecture Research Lab (ARL) focuses on addressing fundamental system-level bottlenecks in modern AI, particularly in memory capacity/bandwidth and system-scale communication. By leveraging Samsung's world-class memory technologies, ARL explores and defines next-generation AI system architectures that deliver step-function improvements in performance, efficiency, and scalability. We are seeking a Principal AI System Architect who will play a Technical Lead role in bridging AI workloads, system architecture, and hardware design. In this role, you will develop system-level performance models, drive architecture-level design decisions, and propose forward-looking AI system architectures that shape long-term AI platform strategy. Location: Daily onsite presence at the San Jose office in alignment with the Flexible Work policy. What You'll Do - Technically lead the architecture team with strong direction, shaping system-architecture strategy and advancing key innovations - Conduct system-level architectural research for next-generation AI systems spanning compute, memory, and interconnect/network subsystems - Develop and maintain analytical and simulation-based system modeling frameworks to evaluate AI workloads and identify performance, scalability, and efficiency bottlenecks at rack- and system-scale - Analyze representative and emerging system-level architectural research including LLMs, DLRMs, and future AI models to derive architecture requirements and trade-offs across compute, memory, networking, and power - Drive architecture-level design decisions through quantitative modeling, design-space exploration, and performance/power projections - Perform comparative studies of alternative system architectures and report performance and performance-per-watt metrics - Collaborate with cross-functional teams in hardware architecture, memory, interconnect, and system engineering - Communicate architectural insights and recommendations through technical presentations and documentation - Occasional domestic and international travel under 10% What You Bring - PhD in Computer Science, Electrical Engineering, or a related field preferred - 15+ years of experience in system architecture for large-scale computing platforms with a strong focus on AI workloads - Proven hands-on experience developing analytical and event-driven simulation models for system-level performance evaluation - Deep understanding of AI system hardware architectures including compute, memory hierarchies, and high-performance interconnects - Strong knowledge of modern and emerging AI workloads including LLMs, DLRMs, and large-scale training and inference systems - Demonstrated ability to translate workload characteristics and modeling results into actionable architectural design decisions - Proficiency in Python, C++, and PyTorch for modeling, analysis, and experimentation - Excellent written, verbal, and presentation communication skills - Collaborative mindset, intellectual curiosity, and resilience in tackling complex system-level challenges What We Offer The pay range varies by work location and depends on job-related knowledge, skills, and experience. Incentive opportunities are offered in addition to benefits. Benefits include: - Medical, Dental, Vision, and 401k plans - Charitable giving match and community involvement opportunities - 4+ weeks of paid time off plus holidays and sick leave - Fertility care or adoption stipend, medical travel support, and virtual vet care - On-demand wellness apps and confidential therapy sessions - Onsite café and gym plus virtual fitness classes - Flexible work environment Base Pay Range $219,000—$351,000 USD Equal Opportunity Employment Policy Samsung Semiconductor is committed to fostering an inclusive workplace where all individuals feel valued and empowered to excel. Our Commitment to Innovation and Fairness AI tools may be used in the recruitment process as support tools, but all hiring decisions are made by human recruiters and hiring managers. Applicant AI Use Policy Candidates may use AI tools for preparation, grammar, and research, but not for generating submitted content or live interview responses. Trade Secret Notice By submitting an application, candidates agree not to disclose confidential or proprietary information belonging to current or former employers. Applicant Privacy Policy https://semiconductor.samsung.com/about-us/careers/us/privacy/

Technology

New offer

Samsung

Principal Engineer, AI Serving Framework Architect (Software)

Senior

On-site

San Jose, CA

240,000 - 249,996 USD/yr

🏢 Summary: Principal AI Serving Framework Architect role focused on designing and optimizing large-scale AI inference systems, memory-centric architectures, and AI serving frameworks for multi-rack environments. The position involves leading research initiatives, developing performance optimization strategies, and contributing to AI infrastructure design using technologies such as vLLM, PyTorch, Python, and C++. 🗂️ Requirements: PhD in Computer Science or related field, 10+ years of experience in AI Serving Frameworks for large-scale computing, Experience leading LLM inference software stack projects at multi-rack scale, Experience delivering AI inference services for 100,000+ users, Expertise in AI inference software stacks for heterogeneous devices, Deep understanding of inference engines such as vLLM, Experience in AI inference system profiling and optimization, Knowledge of reasoning models, multimodal AI, AI agents, and world models, Strong understanding of compute, memory, and networking bottlenecks in AI systems, Proficiency in PyTorch, Proficiency in Python, Proficiency in C++, Excellent verbal and written communication skills 📃 Skills: PyTorch, Python, C++, vLLM, LLM, RAG, SSD, KVCache, AI, Inference 🏢 Description: Please Note: To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the World's Technology Together Our technology solutions power the tools you use every day—including smartphones, electric vehicles, hyperscale data centers, IoT devices, and more. Here, you'll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what's possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We're dedicated to empowering people to be their true selves. Together, we're building a better tomorrow for our employees, customers, partners, and communities. Job Title: Principal engineer, AI Serving Framework Architect (Software) The Architecture Research Lab (ARL) focuses on addressing fundamental system-level bottlenecks in modern AI, particularly in memory capacity/bandwidth and system-scale communication. By leveraging world-class memory technologies, the lab explores and defines next-generation AI system architectures that deliver improvements in performance, efficiency, and scalability. We are seeking a Principal AI System Architect who will play a key role in bridging AI workloads, system architecture, and hardware design. In this role, you will develop system-level performance models, drive architecture-level design decisions, and propose forward-looking AI system architectures. Location: Daily onsite presence at the San Jose office in alignment with the Flexible Work policy. What You'll Do - Lead research teams and propose technical direction - Research dynamic scheduling methodologies for maximizing AI inference performance in multi-rack scale memory-centric systems - Investigate methods to accelerate search operations in RAG vector databases and AI agent knowledge graphs using compute-capable memory - Study strategies for optimally placing KVCache and vector databases in hierarchical memory to minimize SSD access and reduce IO stalls - Propose software designs for implementing optimization algorithms on open-source platforms such as vLLM What You Bring - PhD in Computer Science or related field with 10+ years of experience in AI Serving Frameworks for large-scale computing - Experience leading projects to build and optimize LLM inference software stacks on multi-rack scale systems serving over 100,000 users - Extensive experience designing AI inference software stacks for heterogeneous devices - In-depth understanding of inference engines such as vLLM - Proficiency in AI inference system profiling and optimization - Knowledge of future AI workloads including reasoning models, multimodal solutions, AI agents, and world models - Strong understanding of compute, memory, and networking bottlenecks in AI systems - Required skills: PyTorch, Python, and C++ - Collaborative mindset and strong communication skills - Native or fluent Korean is preferred What We Offer - Competitive compensation with incentive opportunities - Medical, Dental, Vision, and 401(k) - Charitable giving match and community involvement opportunities - 4+ weeks of paid time off, holidays, and sick leave - Family support benefits including fertility, adoption, and medical travel assistance - Emotional wellness support and confidential therapy sessions - Onsite café, gym, and virtual fitness classes - Flexible work environment Base Pay Range $219,000—$351,000 USD Equal Opportunity Employment Policy Samsung Semiconductor is committed to fostering an inclusive workplace where all individuals are valued and empowered to excel. Our Commitment to Innovation and Fairness AI tools may be used as support tools during recruitment, but all hiring decisions are made by human recruiters and hiring managers. Applicant AI Use Policy Candidates may use AI tools for preparation and research but not for generating submitted content or live interview responses. Trade Secret Notice By submitting an application, candidates agree not to disclose confidential or proprietary information belonging to current or former employers.

Technology

Samsung

Principal Engineer, AI System Architect (Hardware)

Senior

On-site

San Jose, CA

240,000 - 249,996 USD/yr

🏢 Summary: Principal Engineer role focused on leading AI system architecture research and driving system-level design decisions for next-generation AI platforms. The position centers on modeling, evaluating, and optimizing compute, memory, and interconnect subsystems to improve performance, scalability, and efficiency of large-scale AI workloads. The role bridges AI workloads, hardware architecture, and quantitative system modeling to shape long-term AI infrastructure strategy. 🗂️ Requirements: PhD in Computer Science, Electrical Engineering, or related field, 15+ years of experience in system architecture for large-scale computing platforms, Hands-on experience with analytical and event-driven system-level performance modeling, Deep understanding of AI hardware architectures including compute, memory hierarchies, and interconnects, Strong knowledge of modern AI workloads such as LLMs and DLRMs, Experience translating workload analysis into architectural design decisions, Proficiency in Python, C++, and PyTorch, Ability to work onsite in San Jose 📃 Skills: Python, C++, PyTorch, AI, LLMs, DLRMs, Modeling, Simulation, Architecture, Hardware, Compute, Memory, Interconnects, Networking, Performance, Scalability, Power 🏢 Description: Please Note: To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the World's Technology Together Our technology solutions power the tools you use every day--including smartphones, electric vehicles, hyperscale data centers, IoT devices, and so much more. Here, you'll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what's possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We're dedicated to empowering people to be their true selves. Together, we're building a better tomorrow for our employees, customers, partners, and communities.Job Title: Principal Engineer, AI System Architect (Hardware) The Architecture Research Lab (ARL) focuses on addressing fundamental system-level bottlenecks in modern AI, particularly in memory capacity/bandwidth and system-scale communication. By leveraging Samsung's world-class memory technologies, ARL explores and defines next-generation AI system architectures that deliver step-function improvements in performance, efficiency, and scalability. We are seeking a Principal AI System Architect who will play a Technical Lead role in bridging AI workloads, system architecture, and hardware design. In this role, you will develop system-level performance models, drive architecture-level design decisions, and propose forward-looking AI system architectures that shape Samsung's long-term AI platform strategy. Location: Daily onsite presence at our San Jose office in alignment with our Flexible Work policy Job ID: 42852 What You'll Do Technically Lead the architecture team with strong direction, shaping system‑architecture strategy and advancing key innovations Conduct system-level architectural research for next-generation AI systems, spanning compute, memory, and interconnect/network subsystems. Develop and maintain analytical and simulation-based system modeling frameworks to evaluate AI workloads and identify performance, scalability, and efficiency bottlenecks at rack- and system-scale. Analyze representative and emerging system-level architectural research (e.g., LLMs, DLRMs, and future AI models) to derive architecture requirements and trade-offs across compute, memory, networking, and power. Drive architecture-level design decisions through quantitative modeling, design-space exploration, and performance/power projections. Perform comparative studies of alternative system architectures, reporting performance and performance-per-watt metrics to guide strategic technology choices. Collaborate closely with cross-functional teams in hardware architecture, memory, interconnect, and system engineering to align modeling insights with implementation realities. Communicate architectural insights and recommendations through clear technical presentations and documentation. Occasional domestic and international travel (<10%). What You Bring Ph.D. in Computer Science, Electrical Engineering, or a related field preferred, with 15+ years of experience in system architecture for large-scale computing platforms, with a strong focus on AI workloads Proven hands-on experience developing analytical and event-driven simulation models for system-level performance evaluation. Deep understanding of AI system hardware architectures, including compute, memory hierarchies, and high-performance interconnects. Strong knowledge of modern and emerging AI workloads, including LLMs, DLRMs, and large-scale training and inference systems. Demonstrated ability to translate workload characteristics and modeling results into actionable architectural design decisions. Proficiency in Python, C++, and PyTorch for modeling, analysis, and experimentation. Excellent written, verbal, and presentation communication skills, with the ability to influence technical direction across teams. A collaborative mindset, intellectual curiosity, and resilience in tackling complex, open-ended system-level challenges. You're inclusive, adapting your style to the situation and diverse global norms of our people. You approach challenges with curiosity and resilience, seeking data to help build understanding. You're collaborative, building relationships, humbly offering support and openly welcoming approaches. Innovative and creative, you proactively explore new ideas and adapt quickly to change. #LI-SF1What We OfferThe pay range below is for all roles at this level across all US locations and functions. Paywithin this range varies by work locationand may also depend on job-related knowledge, skills,and experience. We also offer incentive opportunities that reward employees based on individual and company performance. This is in addition to our diverse package of benefits centered around the wellbeing of our employees and their loved ones. In addition to the usual Medical/Dental/Vision/401k, our inclusive rewards plan empowers our people to care for their whole selves. An investment in your future is an investment in ours. Give Back With a charitable giving match and frequent opportunities to get involved, we take an active role in supporting the community.Enjoy Time Away You'll start with 4+ weeks of paid time off a year, plus holidays and sick leave, to rest and recharge.Care for Family Whatever family means to you, we want to support you along the way—including a stipend for fertility care or adoption, medical travel support, and virtual vet care for your fur babies.Prioritize Emotional Wellness With on-demand apps and free confidential therapy sessions, you'll have support no matter where you are.Stay Fit Eating well and being active are important parts of a healthy life. Our onsite Café and gym, plus virtual classes, make it easier.Embrace Flexibility Benefits are best when you have the space to use them. That's why we facilitate a flexible environment so you can find the right balance for you.Base Pay Range$219,000—$351,000 USDEqual Opportunity Employment Policy Samsung Semiconductor takes pride in being an equal opportunity workplace dedicated to fostering an environment where all individuals feel valued and empowered to excel, regardless of race, religion, color, age, disability, sex, gender identity, sexual orientation, ancestry, genetic information, marital status, national origin, political affiliation, or veteran status. When selecting team members, we prioritize talent and qualities such as humility, kindness, and dedication. We extend comprehensive accommodations throughout our recruiting processes for candidates with disabilities, long-term conditions, neurodivergent individuals, or those requiring pregnancy-related support. All candidates scheduled for an interview will receive guidance on requesting accommodations. Recruiting Agency Policy We do not accept unsolicited resumes. Only authorized recruitment agencies that have a current and valid agreement with Samsung Semiconductor, Inc. are permitted to submit resumes for any job openings. Applicant AI Use Policy At Samsung Semiconductor, we support innovation and technology. However, to ensure a fair and authentic assessment, we prohibit the use of generative AI tools to misrepresent a candidate's true skills and qualifications. Permitted uses are limited to basic preparation, grammar, and research, but all submitted content and interview responses must reflect the candidate's genuine abilities and experience. Violation of this policy may result in immediate disqualification from the hiring process. Applicant Privacy Policyhttps://semiconductor.samsung.com/about-us/careers/us/privacy/

Technology

Samsung

Principal engineer, AI Serving Framework Architect (Software)

Senior

On-site

San Jose, CA

240,000 - 249,996 USD/yr

🏢 Summary: Principal AI Serving Framework Architect role focused on designing and optimizing large-scale AI inference systems for multi-rack, memory-centric architectures. The position drives system-level performance modeling, dynamic scheduling, and software design for next-generation AI platforms, including LLM inference stacks and heterogeneous computing environments. The role combines technical leadership with hands-on architecture work to advance scalable, high-performance AI serving frameworks. 🗂️ Requirements: PhD in Computer Science or related field, 10+ years experience in AI serving frameworks for large-scale systems, Proven experience building and optimizing LLM inference software stack for multi-rack systems, Experience delivering AI inference services at large user scale, Expertise in designing inference stacks for heterogeneous devices, Deep understanding of vLLM or similar inference engines, Experience in AI inference system profiling and optimization, Strong knowledge of compute, memory, and networking bottlenecks in AI systems, Proficiency in dynamic scheduling for AI workloads, Experience implementing optimization algorithms on open-source platforms, Proficiency in PyTorch, Proficiency in Python, Proficiency in C++ 📃 Skills: PyTorch, Python, C++, vLLM, LLM, RAG, KVCache, Profiling, Optimization, Networking, Memory, SSD, VectorDB, Scheduling 🏢 Description: Please Note: To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the World's Technology Together Our technology solutions power the tools you use every day--including smartphones, electric vehicles, hyperscale data centers, IoT devices, and so much more. Here, you'll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what's possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We're dedicated to empowering people to be their true selves. Together, we're building a better tomorrow for our employees, customers, partners, and communities.Job Title: Principal engineer, AI Serving Framework Architect (Software) What You'll Do The Architecture Research Lab (ARL) focuses on addressing fundamental system-level bottlenecks in modern AI, particularly in memory capacity/bandwidth and system-scale communication. By leveraging Samsung's world-class memory technologies, ARL explores and defines next-generation AI system architectures that deliver step-function improvements in performance, efficiency, and scalability. We are seeking a Principal AI System Architect who will play a key role in bridging AI workloads, system architecture, and hardware design. In this role, you will develop system-level performance models, drive architecture-level design decisions, and propose forward-looking AI system architectures that shape Samsung's long-term AI platform strategy. Location: Daily onsite presence at our San Jose office in alignment with our Flexible Work policy Job ID: 42853 As a Tech Lead, leading research teams in Korea and proposing technical direction Research on dynamic scheduling methodologies for maximizing AI inference performance in multi-rack scale memory-centric systems, comprised of heterogeneous compute-capable memory and hierarchical memory Investigating methods to accelerate search operations in RAG's vector DB and AI Agent's knowledge-graph by leveraging compute-capable memory Studying strategies for optimally placing KVCache and a vector DB in hierarchical memory to minimize frequent SSD accesses and reduce IO stalls Proposing SW design for implementing the derived optimization algorithms on open-source platforms such as vLLM What You Bring PhD in Computer Science or a related field with 10+ years of experience in AI Serving Framework for large-scale computing, with focusing on the AI workloads. Led a project to build and optimize a Large Language Model (LLM) Inference Software Stack on a multi-rack scale system to deliver AI Inference services to over 100,000 users. Extensive experience in designing AI Inference Software Stacks for heterogeneous devices.In-depth understanding of the internal architecture and operation mechanisms of inference engines such as vLLM. Proficiency in AI Inference System Profiling and optimization. Knowledge and practical experience with future AI workloads, including reasoning models, multi-modal solutions, AI agents, and world models. Strong understanding of compute, memory, and networking bottlenecks in AI systems. Required skillsets: PyTorch, Python, and C++ A collaborative mindset, curiosity, and resilience in solving complex challenges. Excellent verbal, presentation, and written communication skills. (Nice to have) Native or fluent Korean speakers are preferred. You're inclusive, adapting your style to the situation and diverse global norms of our people. You approach challenges with curiosity and resilience, seeking data to help build. Understanding. You're collaborative, building relationships, humbly offering support and openly welcoming approaches. Innovative and creative, you proactively explore new ideas and adapt quickly to change #LI-SF1What We OfferThe pay range below is for all roles at this level across all US locations and functions. Paywithin this range varies by work locationand may also depend on job-related knowledge, skills,and experience. We also offer incentive opportunities that reward employees based on individual and company performance. This is in addition to our diverse package of benefits centered around the wellbeing of our employees and their loved ones. In addition to the usual Medical/Dental/Vision/401k, our inclusive rewards plan empowers our people to care for their whole selves. An investment in your future is an investment in ours. Give Back With a charitable giving match and frequent opportunities to get involved, we take an active role in supporting the community.Enjoy Time Away You'll start with 4+ weeks of paid time off a year, plus holidays and sick leave, to rest and recharge.Care for Family Whatever family means to you, we want to support you along the way—including a stipend for fertility care or adoption, medical travel support, and virtual vet care for your fur babies.Prioritize Emotional Wellness With on-demand apps and free confidential therapy sessions, you'll have support no matter where you are.Stay Fit Eating well and being active are important parts of a healthy life. Our onsite Café and gym, plus virtual classes, make it easier.Embrace Flexibility Benefits are best when you have the space to use them. That's why we facilitate a flexible environment so you can find the right balance for you.Base Pay Range$219,000—$351,000 USDEqual Opportunity Employment Policy Samsung Semiconductor takes pride in being an equal opportunity workplace dedicated to fostering an environment where all individuals feel valued and empowered to excel, regardless of race, religion, color, age, disability, sex, gender identity, sexual orientation, ancestry, genetic information, marital status, national origin, political affiliation, or veteran status. When selecting team members, we prioritize talent and qualities such as humility, kindness, and dedication. We extend comprehensive accommodations throughout our recruiting processes for candidates with disabilities, long-term conditions, neurodivergent individuals, or those requiring pregnancy-related support. All candidates scheduled for an interview will receive guidance on requesting accommodations. Recruiting Agency Policy We do not accept unsolicited resumes. Only authorized recruitment agencies that have a current and valid agreement with Samsung Semiconductor, Inc. are permitted to submit resumes for any job openings. Applicant AI Use Policy At Samsung Semiconductor, we support innovation and technology. However, to ensure a fair and authentic assessment, we prohibit the use of generative AI tools to misrepresent a candidate's true skills and qualifications. Permitted uses are limited to basic preparation, grammar, and research, but all submitted content and interview responses must reflect the candidate's genuine abilities and experience. Violation of this policy may result in immediate disqualification from the hiring process. Applicant Privacy Policyhttps://semiconductor.samsung.com/about-us/careers/us/privacy/