New offer - be the first one to apply!
July 2, 2026
Principal Engineer, AI System Architect (Hardware)
Senior • On-site
240,000 - 249,996 USD/yr
San Jose, CA
Please Note: To provide the best candidate experience amidst high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period.
Advancing the World's Technology Together
Our technology solutions power the tools you use every day—including smartphones, electric vehicles, hyperscale data centers, IoT devices, and more. Here, you'll have an opportunity to help shape the future of AI systems.
We believe innovation and growth are driven by an inclusive culture and a diverse workforce.
The Architecture Research Lab (ARL) focuses on addressing fundamental system-level bottlenecks in modern AI, particularly in memory capacity/bandwidth and system-scale communication. By leveraging advanced memory technologies, ARL explores and defines next-generation AI system architectures that improve performance, efficiency, and scalability.
We are seeking a Principal AI System Architect who will play a Technical Lead role in bridging AI workloads, system architecture, and hardware design. In this role, you will develop system-level performance models, drive architecture-level design decisions, and propose forward-looking AI system architectures.
Location: Daily onsite presence at the San Jose office in alignment with the Flexible Work policy.
What You'll Do
- Technically lead the architecture team with strong direction, shaping system-architecture strategy and advancing key innovations
- Conduct system-level architectural research for next-generation AI systems spanning compute, memory, and interconnect/network subsystems
- Develop and maintain analytical and simulation-based system modeling frameworks to evaluate AI workloads and identify performance, scalability, and efficiency bottlenecks
- Analyze emerging AI models including LLMs and DLRMs to derive architecture requirements and trade-offs across compute, memory, networking, and power
- Drive architecture-level design decisions through quantitative modeling, design-space exploration, and performance/power projections
- Perform comparative studies of alternative system architectures and report performance and performance-per-watt metrics
- Collaborate with cross-functional teams in hardware architecture, memory, interconnect, and system engineering
- Communicate architectural insights and recommendations through technical presentations and documentation
- Occasional domestic and international travel under 10%
What You Bring
- PhD in Computer Science, Electrical Engineering, or a related field preferred
- 15+ years of experience in system architecture for large-scale computing platforms focused on AI workloads
- Hands-on experience developing analytical and event-driven simulation models
- Deep understanding of AI system hardware architectures including compute, memory hierarchies, and high-performance interconnects
- Strong knowledge of LLMs, DLRMs, and large-scale AI training and inference systems
- Ability to translate workload characteristics and modeling results into architectural design decisions
- Proficiency in Python, C++, and PyTorch
- Excellent written, verbal, and presentation communication skills
- Collaborative mindset and resilience in solving complex system-level challenges
What We Offer
The pay range varies by work location and depends on experience and skills. Incentive opportunities are offered in addition to benefits.
- Medical, Dental, Vision, and 401k plans
- Charitable giving match and community involvement opportunities
- 4+ weeks of paid time off plus holidays and sick leave
- Fertility care or adoption stipend, medical travel support, and virtual vet care
- On-demand wellness apps and confidential therapy sessions
- Onsite café and gym plus virtual fitness classes
- Flexible work environment
Base Pay Range: $219,000—$351,000 USD
Equal Opportunity Employment Policy
Samsung Semiconductor is committed to fostering an inclusive workplace where all individuals feel valued and empowered to excel.
Our Commitment to Innovation and Fairness
AI tools may be used in the recruitment process as support tools, but all hiring decisions are made by human recruiters and hiring managers.
Applicant AI Use Policy
Candidates may use AI tools for preparation, grammar, and research, but not for generating submitted content or live interview responses.
Trade Secret Notice
By submitting an application, candidates agree not to disclose confidential or proprietary information belonging to current or former employers.
Applicant Privacy Policy: https://semiconductor.samsung.com/about-us/careers/us/privacy/
Similar jobs you might like
Technology
New offer

Samsung
Principal Engineer, AI System Architect (Hardware)
Senior
On-site
San Jose, CA
240,000 - 249,996 USD/yr
🏢 Summary: Principal AI System Architect role focused on researching and designing next-generation AI hardware systems for large-scale workloads. The position involves leading architecture strategy, developing system-level performance models, and evaluating compute, memory, and interconnect architectures for AI scalability and efficiency. The role requires deep expertise in AI system architecture, simulation modeling, and large-scale AI workloads. 🗂️ Requirements: Ph.D. in Computer Science, Electrical Engineering, or related field preferred, 15+ years of experience in system architecture for large-scale computing platforms, Hands-on experience developing analytical and event-driven simulation models, Deep understanding of AI hardware architectures, Knowledge of LLMs, DLRMs, and large-scale AI training and inference systems, Ability to translate workload characteristics into architectural design decisions, Proficiency in Python, Proficiency in C++, Proficiency in PyTorch, Strong technical communication and presentation skills 📃 Skills: Python, C++, PyTorch, AI, LLM, DLRM, Modeling, Simulation, Architecture, Hardware, Networking, Memory, Interconnect 🏢 Description: Please Note: To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the World's Technology Together Our technology solutions power the tools you use every day--including smartphones, electric vehicles, hyperscale data centers, IoT devices, and so much more. Here, you'll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what's possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We're dedicated to empowering people to be their true selves. Together, we're building a better tomorrow for our employees, customers, partners, and communities. What You'll Do The Architecture Research Lab (ARL) focuses on addressing fundamental system-level bottlenecks in modern AI, particularly in memory capacity/bandwidth and system-scale communication. By leveraging world-class memory technologies, ARL explores and defines next-generation AI system architectures that deliver step-function improvements in performance, efficiency, and scalability. We are seeking a Principal AI System Architect who will play a Technical Lead role in bridging AI workloads, system architecture, and hardware design. In this role, you will develop system-level performance models, drive architecture-level design decisions, and propose forward-looking AI system architectures. Location: Daily onsite presence at the San Jose office in alignment with the Flexible Work policy. Responsibilities: - Technically lead the architecture team with strong direction, shaping system-architecture strategy and advancing key innovations - Conduct system-level architectural research for next-generation AI systems spanning compute, memory, and interconnect/network subsystems - Develop and maintain analytical and simulation-based system modeling frameworks to evaluate AI workloads and identify performance, scalability, and efficiency bottlenecks - Analyze representative and emerging AI workloads including LLMs, DLRMs, and future AI models - Drive architecture-level design decisions through quantitative modeling, design-space exploration, and performance/power projections - Perform comparative studies of alternative system architectures and report performance and efficiency metrics - Collaborate with hardware architecture, memory, interconnect, and system engineering teams - Communicate architectural insights and recommendations through technical presentations and documentation - Occasional domestic and international travel under 10% What You Bring - Ph.D. in Computer Science, Electrical Engineering, or a related field preferred - 15+ years of experience in system architecture for large-scale computing platforms with a focus on AI workloads - Hands-on experience developing analytical and event-driven simulation models - Deep understanding of AI system hardware architectures including compute, memory hierarchies, and high-performance interconnects - Strong knowledge of modern and emerging AI workloads including LLMs, DLRMs, and large-scale training and inference systems - Ability to translate workload characteristics and modeling results into architectural design decisions - Proficiency in Python, C++, and PyTorch - Excellent written, verbal, and presentation communication skills - Collaborative mindset and resilience in solving complex system-level challenges What We Offer - Competitive compensation with incentive opportunities - Medical, Dental, Vision, and 401(k) - Charitable giving match and community involvement opportunities - 4+ weeks of paid time off plus holidays and sick leave - Family support benefits including fertility and adoption support - Emotional wellness resources and therapy sessions - Onsite café, gym, and virtual fitness classes - Flexible work environment Base Pay Range $219,000—$351,000 USD Equal Opportunity Employment Policy Samsung Semiconductor is an equal opportunity workplace dedicated to fostering an environment where all individuals feel valued and empowered to excel. Our Commitment to Innovation and Fairness AI tools may be used in the recruitment process as support tools, but all hiring decisions are made by human recruiting teams and hiring managers. Applicant AI Use Policy Candidates may use AI tools for basic preparation, grammar, and research, but not for submitted content or live interview responses. Trade Secret Notice By submitting an application, candidates agree not to disclose confidential or proprietary information belonging to current or former employers or other entities.
Technology

Samsung
Principal Engineer, AI System Architect (Hardware)
Senior
On-site
San Jose, CA
240,000 - 249,996 USD/yr
🏢 Summary: Principal Engineer role focused on leading AI system architecture research and driving system-level design decisions for next-generation AI platforms. The position centers on modeling, evaluating, and optimizing compute, memory, and interconnect subsystems to improve performance, scalability, and efficiency of large-scale AI workloads. The role bridges AI workloads, hardware architecture, and quantitative system modeling to shape long-term AI infrastructure strategy. 🗂️ Requirements: PhD in Computer Science, Electrical Engineering, or related field, 15+ years of experience in system architecture for large-scale computing platforms, Hands-on experience with analytical and event-driven system-level performance modeling, Deep understanding of AI hardware architectures including compute, memory hierarchies, and interconnects, Strong knowledge of modern AI workloads such as LLMs and DLRMs, Experience translating workload analysis into architectural design decisions, Proficiency in Python, C++, and PyTorch, Ability to work onsite in San Jose 📃 Skills: Python, C++, PyTorch, AI, LLMs, DLRMs, Modeling, Simulation, Architecture, Hardware, Compute, Memory, Interconnects, Networking, Performance, Scalability, Power 🏢 Description: Please Note: To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the World's Technology Together Our technology solutions power the tools you use every day--including smartphones, electric vehicles, hyperscale data centers, IoT devices, and so much more. Here, you'll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what's possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We're dedicated to empowering people to be their true selves. Together, we're building a better tomorrow for our employees, customers, partners, and communities.Job Title: Principal Engineer, AI System Architect (Hardware) The Architecture Research Lab (ARL) focuses on addressing fundamental system-level bottlenecks in modern AI, particularly in memory capacity/bandwidth and system-scale communication. By leveraging Samsung's world-class memory technologies, ARL explores and defines next-generation AI system architectures that deliver step-function improvements in performance, efficiency, and scalability. We are seeking a Principal AI System Architect who will play a Technical Lead role in bridging AI workloads, system architecture, and hardware design. In this role, you will develop system-level performance models, drive architecture-level design decisions, and propose forward-looking AI system architectures that shape Samsung's long-term AI platform strategy. Location: Daily onsite presence at our San Jose office in alignment with our Flexible Work policy Job ID: 42852 What You'll Do Technically Lead the architecture team with strong direction, shaping system‑architecture strategy and advancing key innovations Conduct system-level architectural research for next-generation AI systems, spanning compute, memory, and interconnect/network subsystems. Develop and maintain analytical and simulation-based system modeling frameworks to evaluate AI workloads and identify performance, scalability, and efficiency bottlenecks at rack- and system-scale. Analyze representative and emerging system-level architectural research (e.g., LLMs, DLRMs, and future AI models) to derive architecture requirements and trade-offs across compute, memory, networking, and power. Drive architecture-level design decisions through quantitative modeling, design-space exploration, and performance/power projections. Perform comparative studies of alternative system architectures, reporting performance and performance-per-watt metrics to guide strategic technology choices. Collaborate closely with cross-functional teams in hardware architecture, memory, interconnect, and system engineering to align modeling insights with implementation realities. Communicate architectural insights and recommendations through clear technical presentations and documentation. Occasional domestic and international travel (<10%). What You Bring Ph.D. in Computer Science, Electrical Engineering, or a related field preferred, with 15+ years of experience in system architecture for large-scale computing platforms, with a strong focus on AI workloads Proven hands-on experience developing analytical and event-driven simulation models for system-level performance evaluation. Deep understanding of AI system hardware architectures, including compute, memory hierarchies, and high-performance interconnects. Strong knowledge of modern and emerging AI workloads, including LLMs, DLRMs, and large-scale training and inference systems. Demonstrated ability to translate workload characteristics and modeling results into actionable architectural design decisions. Proficiency in Python, C++, and PyTorch for modeling, analysis, and experimentation. Excellent written, verbal, and presentation communication skills, with the ability to influence technical direction across teams. A collaborative mindset, intellectual curiosity, and resilience in tackling complex, open-ended system-level challenges. You're inclusive, adapting your style to the situation and diverse global norms of our people. You approach challenges with curiosity and resilience, seeking data to help build understanding. You're collaborative, building relationships, humbly offering support and openly welcoming approaches. Innovative and creative, you proactively explore new ideas and adapt quickly to change. #LI-SF1What We OfferThe pay range below is for all roles at this level across all US locations and functions. Paywithin this range varies by work locationand may also depend on job-related knowledge, skills,and experience. We also offer incentive opportunities that reward employees based on individual and company performance. This is in addition to our diverse package of benefits centered around the wellbeing of our employees and their loved ones. In addition to the usual Medical/Dental/Vision/401k, our inclusive rewards plan empowers our people to care for their whole selves. An investment in your future is an investment in ours. Give Back With a charitable giving match and frequent opportunities to get involved, we take an active role in supporting the community.Enjoy Time Away You'll start with 4+ weeks of paid time off a year, plus holidays and sick leave, to rest and recharge.Care for Family Whatever family means to you, we want to support you along the way—including a stipend for fertility care or adoption, medical travel support, and virtual vet care for your fur babies.Prioritize Emotional Wellness With on-demand apps and free confidential therapy sessions, you'll have support no matter where you are.Stay Fit Eating well and being active are important parts of a healthy life. Our onsite Café and gym, plus virtual classes, make it easier.Embrace Flexibility Benefits are best when you have the space to use them. That's why we facilitate a flexible environment so you can find the right balance for you.Base Pay Range$219,000—$351,000 USDEqual Opportunity Employment Policy Samsung Semiconductor takes pride in being an equal opportunity workplace dedicated to fostering an environment where all individuals feel valued and empowered to excel, regardless of race, religion, color, age, disability, sex, gender identity, sexual orientation, ancestry, genetic information, marital status, national origin, political affiliation, or veteran status. When selecting team members, we prioritize talent and qualities such as humility, kindness, and dedication. We extend comprehensive accommodations throughout our recruiting processes for candidates with disabilities, long-term conditions, neurodivergent individuals, or those requiring pregnancy-related support. All candidates scheduled for an interview will receive guidance on requesting accommodations. Recruiting Agency Policy We do not accept unsolicited resumes. Only authorized recruitment agencies that have a current and valid agreement with Samsung Semiconductor, Inc. are permitted to submit resumes for any job openings. Applicant AI Use Policy At Samsung Semiconductor, we support innovation and technology. However, to ensure a fair and authentic assessment, we prohibit the use of generative AI tools to misrepresent a candidate's true skills and qualifications. Permitted uses are limited to basic preparation, grammar, and research, but all submitted content and interview responses must reflect the candidate's genuine abilities and experience. Violation of this policy may result in immediate disqualification from the hiring process. Applicant Privacy Policyhttps://semiconductor.samsung.com/about-us/careers/us/privacy/
Technology
New offer

Samsung
Principal Engineer, AI Serving Framework Architect (Software)
Senior
On-site
San Jose, CA
240,000 - 249,996 USD/yr
🏢 Summary: Principal AI Serving Framework Architect role focused on designing and optimizing large-scale AI inference systems, memory-centric architectures, and AI serving frameworks for multi-rack environments. The position involves leading research initiatives, developing performance optimization strategies, and contributing to AI infrastructure design using technologies such as vLLM, PyTorch, Python, and C++. 🗂️ Requirements: PhD in Computer Science or related field, 10+ years of experience in AI Serving Frameworks for large-scale computing, Experience leading LLM inference software stack projects at multi-rack scale, Experience delivering AI inference services for 100,000+ users, Expertise in AI inference software stacks for heterogeneous devices, Deep understanding of inference engines such as vLLM, Experience in AI inference system profiling and optimization, Knowledge of reasoning models, multimodal AI, AI agents, and world models, Strong understanding of compute, memory, and networking bottlenecks in AI systems, Proficiency in PyTorch, Proficiency in Python, Proficiency in C++, Excellent verbal and written communication skills 📃 Skills: PyTorch, Python, C++, vLLM, LLM, RAG, SSD, KVCache, AI, Inference 🏢 Description: Please Note: To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the World's Technology Together Our technology solutions power the tools you use every day—including smartphones, electric vehicles, hyperscale data centers, IoT devices, and more. Here, you'll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what's possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We're dedicated to empowering people to be their true selves. Together, we're building a better tomorrow for our employees, customers, partners, and communities. Job Title: Principal engineer, AI Serving Framework Architect (Software) The Architecture Research Lab (ARL) focuses on addressing fundamental system-level bottlenecks in modern AI, particularly in memory capacity/bandwidth and system-scale communication. By leveraging world-class memory technologies, the lab explores and defines next-generation AI system architectures that deliver improvements in performance, efficiency, and scalability. We are seeking a Principal AI System Architect who will play a key role in bridging AI workloads, system architecture, and hardware design. In this role, you will develop system-level performance models, drive architecture-level design decisions, and propose forward-looking AI system architectures. Location: Daily onsite presence at the San Jose office in alignment with the Flexible Work policy. What You'll Do - Lead research teams and propose technical direction - Research dynamic scheduling methodologies for maximizing AI inference performance in multi-rack scale memory-centric systems - Investigate methods to accelerate search operations in RAG vector databases and AI agent knowledge graphs using compute-capable memory - Study strategies for optimally placing KVCache and vector databases in hierarchical memory to minimize SSD access and reduce IO stalls - Propose software designs for implementing optimization algorithms on open-source platforms such as vLLM What You Bring - PhD in Computer Science or related field with 10+ years of experience in AI Serving Frameworks for large-scale computing - Experience leading projects to build and optimize LLM inference software stacks on multi-rack scale systems serving over 100,000 users - Extensive experience designing AI inference software stacks for heterogeneous devices - In-depth understanding of inference engines such as vLLM - Proficiency in AI inference system profiling and optimization - Knowledge of future AI workloads including reasoning models, multimodal solutions, AI agents, and world models - Strong understanding of compute, memory, and networking bottlenecks in AI systems - Required skills: PyTorch, Python, and C++ - Collaborative mindset and strong communication skills - Native or fluent Korean is preferred What We Offer - Competitive compensation with incentive opportunities - Medical, Dental, Vision, and 401(k) - Charitable giving match and community involvement opportunities - 4+ weeks of paid time off, holidays, and sick leave - Family support benefits including fertility, adoption, and medical travel assistance - Emotional wellness support and confidential therapy sessions - Onsite café, gym, and virtual fitness classes - Flexible work environment Base Pay Range $219,000—$351,000 USD Equal Opportunity Employment Policy Samsung Semiconductor is committed to fostering an inclusive workplace where all individuals are valued and empowered to excel. Our Commitment to Innovation and Fairness AI tools may be used as support tools during recruitment, but all hiring decisions are made by human recruiters and hiring managers. Applicant AI Use Policy Candidates may use AI tools for preparation and research but not for generating submitted content or live interview responses. Trade Secret Notice By submitting an application, candidates agree not to disclose confidential or proprietary information belonging to current or former employers.
Technology

Samsung
Principal engineer, AI Serving Framework Architect (Software)
Senior
On-site
San Jose, CA
240,000 - 249,996 USD/yr
🏢 Summary: Principal AI Serving Framework Architect role focused on designing and optimizing large-scale AI inference systems for multi-rack, memory-centric architectures. The position drives system-level performance modeling, dynamic scheduling, and software design for next-generation AI platforms, including LLM inference stacks and heterogeneous computing environments. The role combines technical leadership with hands-on architecture work to advance scalable, high-performance AI serving frameworks. 🗂️ Requirements: PhD in Computer Science or related field, 10+ years experience in AI serving frameworks for large-scale systems, Proven experience building and optimizing LLM inference software stack for multi-rack systems, Experience delivering AI inference services at large user scale, Expertise in designing inference stacks for heterogeneous devices, Deep understanding of vLLM or similar inference engines, Experience in AI inference system profiling and optimization, Strong knowledge of compute, memory, and networking bottlenecks in AI systems, Proficiency in dynamic scheduling for AI workloads, Experience implementing optimization algorithms on open-source platforms, Proficiency in PyTorch, Proficiency in Python, Proficiency in C++ 📃 Skills: PyTorch, Python, C++, vLLM, LLM, RAG, KVCache, Profiling, Optimization, Networking, Memory, SSD, VectorDB, Scheduling 🏢 Description: Please Note: To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the World's Technology Together Our technology solutions power the tools you use every day--including smartphones, electric vehicles, hyperscale data centers, IoT devices, and so much more. Here, you'll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what's possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We're dedicated to empowering people to be their true selves. Together, we're building a better tomorrow for our employees, customers, partners, and communities.Job Title: Principal engineer, AI Serving Framework Architect (Software) What You'll Do The Architecture Research Lab (ARL) focuses on addressing fundamental system-level bottlenecks in modern AI, particularly in memory capacity/bandwidth and system-scale communication. By leveraging Samsung's world-class memory technologies, ARL explores and defines next-generation AI system architectures that deliver step-function improvements in performance, efficiency, and scalability. We are seeking a Principal AI System Architect who will play a key role in bridging AI workloads, system architecture, and hardware design. In this role, you will develop system-level performance models, drive architecture-level design decisions, and propose forward-looking AI system architectures that shape Samsung's long-term AI platform strategy. Location: Daily onsite presence at our San Jose office in alignment with our Flexible Work policy Job ID: 42853 As a Tech Lead, leading research teams in Korea and proposing technical direction Research on dynamic scheduling methodologies for maximizing AI inference performance in multi-rack scale memory-centric systems, comprised of heterogeneous compute-capable memory and hierarchical memory Investigating methods to accelerate search operations in RAG's vector DB and AI Agent's knowledge-graph by leveraging compute-capable memory Studying strategies for optimally placing KVCache and a vector DB in hierarchical memory to minimize frequent SSD accesses and reduce IO stalls Proposing SW design for implementing the derived optimization algorithms on open-source platforms such as vLLM What You Bring PhD in Computer Science or a related field with 10+ years of experience in AI Serving Framework for large-scale computing, with focusing on the AI workloads. Led a project to build and optimize a Large Language Model (LLM) Inference Software Stack on a multi-rack scale system to deliver AI Inference services to over 100,000 users. Extensive experience in designing AI Inference Software Stacks for heterogeneous devices.In-depth understanding of the internal architecture and operation mechanisms of inference engines such as vLLM. Proficiency in AI Inference System Profiling and optimization. Knowledge and practical experience with future AI workloads, including reasoning models, multi-modal solutions, AI agents, and world models. Strong understanding of compute, memory, and networking bottlenecks in AI systems. Required skillsets: PyTorch, Python, and C++ A collaborative mindset, curiosity, and resilience in solving complex challenges. Excellent verbal, presentation, and written communication skills. (Nice to have) Native or fluent Korean speakers are preferred. You're inclusive, adapting your style to the situation and diverse global norms of our people. You approach challenges with curiosity and resilience, seeking data to help build. Understanding. You're collaborative, building relationships, humbly offering support and openly welcoming approaches. Innovative and creative, you proactively explore new ideas and adapt quickly to change #LI-SF1What We OfferThe pay range below is for all roles at this level across all US locations and functions. Paywithin this range varies by work locationand may also depend on job-related knowledge, skills,and experience. We also offer incentive opportunities that reward employees based on individual and company performance. This is in addition to our diverse package of benefits centered around the wellbeing of our employees and their loved ones. In addition to the usual Medical/Dental/Vision/401k, our inclusive rewards plan empowers our people to care for their whole selves. An investment in your future is an investment in ours. Give Back With a charitable giving match and frequent opportunities to get involved, we take an active role in supporting the community.Enjoy Time Away You'll start with 4+ weeks of paid time off a year, plus holidays and sick leave, to rest and recharge.Care for Family Whatever family means to you, we want to support you along the way—including a stipend for fertility care or adoption, medical travel support, and virtual vet care for your fur babies.Prioritize Emotional Wellness With on-demand apps and free confidential therapy sessions, you'll have support no matter where you are.Stay Fit Eating well and being active are important parts of a healthy life. Our onsite Café and gym, plus virtual classes, make it easier.Embrace Flexibility Benefits are best when you have the space to use them. That's why we facilitate a flexible environment so you can find the right balance for you.Base Pay Range$219,000—$351,000 USDEqual Opportunity Employment Policy Samsung Semiconductor takes pride in being an equal opportunity workplace dedicated to fostering an environment where all individuals feel valued and empowered to excel, regardless of race, religion, color, age, disability, sex, gender identity, sexual orientation, ancestry, genetic information, marital status, national origin, political affiliation, or veteran status. When selecting team members, we prioritize talent and qualities such as humility, kindness, and dedication. We extend comprehensive accommodations throughout our recruiting processes for candidates with disabilities, long-term conditions, neurodivergent individuals, or those requiring pregnancy-related support. All candidates scheduled for an interview will receive guidance on requesting accommodations. Recruiting Agency Policy We do not accept unsolicited resumes. Only authorized recruitment agencies that have a current and valid agreement with Samsung Semiconductor, Inc. are permitted to submit resumes for any job openings. Applicant AI Use Policy At Samsung Semiconductor, we support innovation and technology. However, to ensure a fair and authentic assessment, we prohibit the use of generative AI tools to misrepresent a candidate's true skills and qualifications. Permitted uses are limited to basic preparation, grammar, and research, but all submitted content and interview responses must reflect the candidate's genuine abilities and experience. Violation of this policy may result in immediate disqualification from the hiring process. Applicant Privacy Policyhttps://semiconductor.samsung.com/about-us/careers/us/privacy/
Technology

Samsung
Staff Engineer, AI System Architect (Hardware)
Senior
On-site
San Jose, CA
🏢 Summary: Senior Staff AI System Architect role focused on researching and defining next-generation AI hardware system architectures to overcome memory, bandwidth, and interconnect bottlenecks. The position involves building system-level performance models, analyzing large-scale AI workloads, and driving architecture decisions for scalable, high-performance AI platforms. The role bridges AI workloads, hardware architecture, and system modeling to shape long-term AI platform strategy. 🗂️ Requirements: Ph.D. in Computer Science, Electrical Engineering, or related field, 5+ years experience in system architecture for large-scale computing platforms, Hands-on experience with analytical and event-driven simulation models, Deep understanding of AI hardware architectures (compute, memory, interconnect), Strong knowledge of LLMs, DLRMs, and large-scale AI training and inference systems, Ability to perform system-level performance, scalability, and power modeling, Proficiency in Python, C++, and PyTorch, Experience translating workload analysis into architectural design decisions 📃 Skills: Python, C++, PyTorch, AI, LLM, DLRM, Modeling, Simulation, Compute, Memory, Interconnect, Networking, Architecture, Performance, Scalability, Power 🏢 Description: Please Note: To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the World's Technology Together Our technology solutions power the tools you use every day--including smartphones, electric vehicles, hyperscale data centers, IoT devices, and so much more. Here, you'll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what's possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We're dedicated to empowering people to be their true selves. Together, we're building a better tomorrow for our employees, customers, partners, and communities.Job Title: Staff Engineer, AI System Architect (Hardware) What You'll Do The Architecture Research Lab (ARL) focuses on addressing fundamental system-level bottlenecks in modern AI, particularly in memory capacity/bandwidth and system-scale communication. By leveraging Samsung's world-class memory technologies, ARL explores and defines next-generation AI system architectures that deliver step-function improvements in performance, efficiency, and scalability. We are seeking a Senior Staff AI System Architect who will play a key role in bridging AI workloads, system architecture, and hardware design. In this role, you will develop system-level performance models, drive architecture-level design decisions, and propose forward-looking AI system architectures that shape Samsung's long-term AI platform strategy. Location: Daily onsite presence at our San Jose office in alignment with our Flexible Work policy Job ID: 42917 Conduct system-level architectural research for next-generation AI systems, spanning compute, memory, and interconnect/network subsystems. • Develop and maintain analytical and simulation-based system modeling frameworks to evaluate AI workloads and identify performance, scalability, and efficiency bottlenecks at rack- and system-scale. • Analyze representative and emerging AI workloads (e.g., LLMs, DLRMs, and future AI models) to derive architecture requirements and trade-offs across compute, memory, networking, and power. • Drive architecture-level design decisions through quantitative modeling, design-space exploration, and performance/power projections. • Perform comparative studies of alternative system architectures, reporting performance and performance-per-watt metrics to guide strategic technology choices. • Collaborate closely with cross-functional teams in hardware architecture, memory, interconnect, and system engineering to align modeling insights with implementation realities. • Communicate architectural insights and recommendations through clear technical presentations and documentation. • Occasional domestic and international travel (<10%). What You Bring Ph.D. in Computer Science, Electrical Engineering, or a related field, with 5+ years of experience in system architecture for large-scale computing platforms, with a strong focus on AI workloads. • Proven hands-on experience developing analytical and event-driven simulation models for system-level performance evaluation. • Deep understanding of AI system hardware architectures, including compute, memory hierarchies, and high-performance interconnects. • Strong knowledge of modern and emerging AI workloads, including LLMs, DLRMs, and large-scale training and inference systems. • Demonstrated ability to translate workload characteristics and modeling results into actionable architectural design decisions. • Proficiency in Python, C++, and PyTorch for modeling, analysis, and experimentation. • Excellent written, verbal, and presentation communication skills, with the ability to influence technical direction across teams. • A collaborative mindset, intellectual curiosity, and resilience in tackling complex, open-ended system-level challenges. You're inclusive, adapting your style to the situation and diverse global norms of our people. You approach challenges with curiosity and resilience, seeking data to help build. Understanding. You're collaborative, building relationships, humbly offering support and openly welcoming approaches. #LI-SF1 What We OfferThe pay range below is for all roles at this level across all US locations and functions. Paywithin this range varies by work locationand may also depend on job-related knowledge, skills,and experience. We also offer incentive opportunities that reward employees based on individual and company performance. This is in addition to our diverse package of benefits centered around the wellbeing of our employees and their loved ones. In addition to the usual Medical/Dental/Vision/401k, our inclusive rewards plan empowers our people to care for their whole selves. An investment in your future is an investment in ours. Give Back With a charitable giving match and frequent opportunities to get involved, we take an active role in supporting the community.Enjoy Time Away You'll start with 4+ weeks of paid time off a year, plus holidays and sick leave, to rest and recharge.Care for Family Whatever family means to you, we want to support you along the way—including a stipend for fertility care or adoption, medical travel support, and virtual vet care for your fur babies.Prioritize Emotional Wellness With on-demand apps and free confidential therapy sessions, you'll have support no matter where you are.Stay Fit Eating well and being active are important parts of a healthy life. Our onsite Café and gym, plus virtual classes, make it easier.Embrace Flexibility Benefits are best when you have the space to use them. That's why we facilitate a flexible environment so you can find the right balance for you.Base Pay Range$163,000—$253,000 USDEqual Opportunity Employment Policy Samsung Semiconductor takes pride in being an equal opportunity workplace dedicated to fostering an environment where all individuals feel valued and empowered to excel, regardless of race, religion, color, age, disability, sex, gender identity, sexual orientation, ancestry, genetic information, marital status, national origin, political affiliation, or veteran status. When selecting team members, we prioritize talent and qualities such as humility, kindness, and dedication. We extend comprehensive accommodations throughout our recruiting processes for candidates with disabilities, long-term conditions, neurodivergent individuals, or those requiring pregnancy-related support. All candidates scheduled for an interview will receive guidance on requesting accommodations. Recruiting Agency Policy We do not accept unsolicited resumes. Only authorized recruitment agencies that have a current and valid agreement with Samsung Semiconductor, Inc. are permitted to submit resumes for any job openings. Applicant AI Use Policy At Samsung Semiconductor, we support innovation and technology. However, to ensure a fair and authentic assessment, we prohibit the use of generative AI tools to misrepresent a candidate's true skills and qualifications. Permitted uses are limited to basic preparation, grammar, and research, but all submitted content and interview responses must reflect the candidate's genuine abilities and experience. Violation of this policy may result in immediate disqualification from the hiring process. Applicant Privacy Policyhttps://semiconductor.samsung.com/about-us/careers/us/privacy/
Technology
New offer

Samsung
Staff Engineer, AI System Architect (Hardware)
Senior
On-site
San Jose, CA
🏢 Summary: Senior Staff AI System Architect role focused on researching and designing next-generation AI systems for large-scale workloads. The position involves building system-level performance models, analyzing AI architectures, and driving design decisions across compute, memory, and networking subsystems. The role requires strong expertise in AI hardware architectures, simulation modeling, and AI workloads such as LLMs and DLRMs. 🗂️ Requirements: PhD in Computer Science, Electrical Engineering, or related field, 5+ years of experience in system architecture for large-scale computing platforms, Experience developing analytical and event-driven simulation models, Deep understanding of AI hardware architectures, Knowledge of LLMs, DLRMs, and large-scale AI training and inference systems, Ability to translate workload analysis into architectural decisions, Proficiency in Python, Proficiency in C++, Proficiency in PyTorch, Strong technical communication and presentation skills 📃 Skills: Python, C++, PyTorch, AI, LLM, DLRM, Simulation, Modeling, Architecture, Networking, Memory, Compute 🏢 Description: Advancing the World's Technology Together Our technology solutions power the tools you use every day—including smartphones, electric vehicles, hyperscale data centers, IoT devices, and more. The Architecture Research Lab (ARL) focuses on addressing fundamental system-level bottlenecks in modern AI, particularly in memory capacity/bandwidth and system-scale communication. By leveraging advanced memory technologies, the team explores and defines next-generation AI system architectures that deliver major improvements in performance, efficiency, and scalability. We are seeking a Senior Staff AI System Architect who will play a key role in bridging AI workloads, system architecture, and hardware design. In this role, you will develop system-level performance models, drive architecture-level design decisions, and propose forward-looking AI system architectures. Location: Daily onsite presence at the San Jose office in alignment with the Flexible Work policy. What You'll Do - Conduct system-level architectural research for next-generation AI systems spanning compute, memory, and interconnect/network subsystems. - Develop and maintain analytical and simulation-based system modeling frameworks to evaluate AI workloads and identify performance, scalability, and efficiency bottlenecks. - Analyze representative and emerging AI workloads including LLMs, DLRMs, and future AI models. - Drive architecture-level design decisions through quantitative modeling, design-space exploration, and performance/power projections. - Perform comparative studies of alternative system architectures and report performance-per-watt metrics. - Collaborate with cross-functional teams in hardware architecture, memory, interconnect, and system engineering. - Communicate architectural insights and recommendations through technical presentations and documentation. - Occasional domestic and international travel under 10%. What You Bring - PhD in Computer Science, Electrical Engineering, or a related field. - 5+ years of experience in system architecture for large-scale computing platforms focused on AI workloads. - Hands-on experience developing analytical and event-driven simulation models. - Deep understanding of AI system hardware architectures, including compute, memory hierarchies, and high-performance interconnects. - Strong knowledge of modern and emerging AI workloads including LLMs, DLRMs, and large-scale training and inference systems. - Ability to translate workload characteristics and modeling results into actionable architectural decisions. - Proficiency in Python, C++, and PyTorch. - Excellent written, verbal, and presentation communication skills. - Collaborative mindset and ability to tackle complex system-level challenges. What We Offer - Competitive base pay range of $163,000–$253,000 USD. - Incentive opportunities based on individual and company performance. - Medical, Dental, Vision, and 401(k) benefits. - 4+ weeks of paid time off plus holidays and sick leave. - Support for fertility care, adoption, medical travel, and virtual pet care. - Emotional wellness support including therapy sessions and wellness apps. - Onsite café, gym, and virtual fitness classes. - Flexible work environment and community giving opportunities. Equal Opportunity Employment Policy Samsung Semiconductor is committed to fostering an inclusive workplace and providing accommodations throughout the recruiting process for candidates with disabilities or other support needs. Our Commitment to Innovation and Fairness AI tools may be used in the recruitment process as support tools, but hiring decisions are made by human recruiting teams and hiring managers. Trade Secret Notice By submitting an application, candidates agree not to disclose confidential or proprietary information belonging to current or former employers.
Technology

Samsung
Senior Director, Architecture Research Lab
Senior
On-site
San Jose, CA
🏢 Summary: Lead research and architecture design for next-generation AI systems, focusing on rack-scale co-design of AI workloads, memory, interconnects, and high-performance RISC-V CPUs. The role drives system-level modeling, simulation, and micro-architecture innovation to eliminate memory and bandwidth bottlenecks in large-scale AI platforms. It combines deep technical leadership with hands-on architecture research and cross-functional collaboration. 🗂️ Requirements: Ph.D. in Computer Science, Electrical Engineering, or related field, 10+ years in system-level architecture research or large-scale computing platform design, Expertise in AI workload-focused system architecture design, Strong experience in performance modeling and event-driven simulation, Deep knowledge of RISC-V, ARM, or x86 CPU architectures, Experience designing out-of-order CPU micro-architectures, Proficiency in transaction-level modeling (TLM), Experience with large-scale design-space exploration and PPA analysis, Hands-on programming experience in Python and C++, Ability to lead technical research teams and define architecture roadmaps 📃 Skills: RISC-V, ARM, x86, TLM, Python, C++, PPA, Simulation, Modeling, Microarchitecture 🏢 Description: Please Note: To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the World's Technology Together Our technology solutions power the tools you use every day--including smartphones, electric vehicles, hyperscale data centers, IoT devices, and so much more. Here, you'll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what's possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We're dedicated to empowering people to be their true selves. Together, we're building a better tomorrow for our employees, customers, partners, and communities.What You'll Do Lead cutting‑edge research on next‑generation AI system architectures. The role is responsible for end‑to‑end co‑design of AI workloads, system‑level modeling, hardware platforms, and high‑performance processors that leverage Samsung's advanced memory technologies to eliminate capacity, bandwidth, and large‑scale communication bottlenecks. Location: Daily onsite presence at our San Jose Headquarters in alignment with our Flexible Work policy AI System Architecture Leadership Define system‑level architectures that solve memory‑capacity, bandwidth, and interconnect challenges for large AI workloads (e.g., large language models, recommendation systems). Build and maintain analytical and event‑driven simulation frameworks for compute‑memory‑network performance at rack scale. Conduct design‑space exploration and quantitative trade‑off studies (performance, power, cost) to guide architecture decisions. Partner with SAIT HQ teams to align modeling insights with real‑world AI system implementations. RISC‑V CPU Architecture Leadership Architect high‑performance, out‑of‑order RISC‑V CPU cores that serve as host processors for AI computing systems. Drive IPC‑focused feature path‑finding; lead micro‑architecture research through performance‑model simulation and workload analysis. Produce detailed micro‑architecture specifications and guide cache/memory hierarchy design for optimal AI workload execution. Research & Innovation Management Lead a multidisciplinary research team, set technical roadmaps, and ensure delivery of high‑impact publications. Present architectural insights and strategic recommendations to senior leadership and external partners. What You Bring Education: Ph.D. in Computer Science, Electrical Engineering, or a related field. Experience: 10+years in system‑level architecture research or large‑scale computing platform design, with a strong focus on AI workloads. Technical Expertise: Performance modeling, event‑driven simulation, and quantitative analysis of compute‑memory‑interconnect systems. Deep knowledge of modern CPU/accelerator architectures (RISC‑V, ARM, x86) and heterogeneous integration. Proven ability to design rack‑scale AI system architectures that address memory, bandwidth, and interconnect constraints. Proficiency with transaction‑level modeling (TLM) and event‑driven simulation for compute‑memory‑network co‑design. Experience in large‑scale design‑space exploration and PPA (performance, power, area/cost) trade‑off analysis. Expertise in architecture and micro‑architecture design of out‑of‑order CPUs. Hands‑on experience with simulation tools and programming languages such as Python and C++. Communication Skills: Excellent written and verbal communication; proven ability to deliver technical presentations to senior stakeholders. Preferred Experience Proven track record of publishing high‑impact research papers. Experience influencing product roadmaps through architectural recommendations. Strong collaborative history with cross‑functional teams (hardware, software, memory technology). You're inclusive, adapting your style to the situation and diverse global norms of our people. An avid learner, you approach challenges with curiosity and resilience, seeking data to help build understanding. You're collaborative, building relationships, humbly offering support and openly welcoming approaches. Innovative and creative, you proactively explore new ideas and adapt quickly to change #LI-SF1 What We OfferThe pay range below is for all roles at this level across all US locations and functions. Paywithin this range varies by work locationand may also depend on job-related knowledge, skills,and experience. We also offer incentive opportunities that reward employees based on individual and company performance. This is in addition to our diverse package of benefits centered around the wellbeing of our employees and their loved ones. In addition to the usual Medical/Dental/Vision/401k, our inclusive rewards plan empowers our people to care for their whole selves. An investment in your future is an investment in ours. Give Back With a charitable giving match and frequent opportunities to get involved, we take an active role in supporting the community.Enjoy Time Away You'll start with 4+ weeks of paid time off a year, plus holidays and sick leave, to rest and recharge.Care for Family Whatever family means to you, we want to support you along the way—including a stipend for fertility care or adoption, medical travel support, and virtual vet care for your fur babies.Prioritize Emotional Wellness With on-demand apps and free confidential therapy sessions, you'll have support no matter where you are.Stay Fit Eating well and being active are important parts of a healthy life. Our onsite Café and gym, plus virtual classes, make it easier.Embrace Flexibility Benefits are best when you have the space to use them. That's why we facilitate a flexible environment so you can find the right balance for you.Base Pay Range$246,000—$430,000 USDEqual Opportunity Employment Policy Samsung Semiconductor takes pride in being an equal opportunity workplace dedicated to fostering an environment where all individuals feel valued and empowered to excel, regardless of race, religion, color, age, disability, sex, gender identity, sexual orientation, ancestry, genetic information, marital status, national origin, political affiliation, or veteran status. When selecting team members, we prioritize talent and qualities such as humility, kindness, and dedication. We extend comprehensive accommodations throughout our recruiting processes for candidates with disabilities, long-term conditions, neurodivergent individuals, or those requiring pregnancy-related support. All candidates scheduled for an interview will receive guidance on requesting accommodations. Our Commitment to Innovation and Fairness At Samsung Semiconductor, we use Artificial Intelligence (AI) tools in the recruitment process to enhance efficiency. However, AI is used as a support tool, not a final decision-maker. All hiring decisions are made by our human recruiting team and hiring managers to ensure every candidate is evaluated fairly and holistically. Recruiting Agency Policy We do not accept unsolicited resumes. Only authorized recruitment agencies that have a current and valid agreement with Samsung Semiconductor, Inc. are permitted to submit resumes for any job openings. Applicant AI Use Policy At Samsung Semiconductor, we support innovation and technology. However, to ensure a fair and authentic assessment, we ask that candidates rely on their own knowledge and skills throughout the process. AI tools may be used for basic preparation, grammar, and research, but should not be used to generate or assist with submitted content or live interview responses. If we determine that AI is being used outside these guidelines, we reserve the right to pause or end the interview, and your candidacy may be disqualified. Trade Secret Notice By submitting an application, you agree not to disclose to Samsung—or encourage Samsung to use—any confidential or proprietary information (including trade secrets) belonging to a current or former employer or other entity. Applicant Privacy Policyhttps://semiconductor.samsung.com/about-us/careers/us/privacy/
Technology
New offer

Samsung
Senior Staff Engineer, AI Software
Senior
On-site
San Jose, CA
🏢 Summary: Senior AI infrastructure role focused on co-designing hardware and software solutions for AI/ML inference workloads, optimizing LLM and agentic AI performance, and addressing memory bottlenecks in scalable computing platforms. The position involves collaboration across hardware and software teams, development of high-performance inference solutions, and technical leadership in AI system architecture. 🗂️ Requirements: Bachelor's degree with 15+ years, Master's degree with 13+ years, or PhD with 10+ years of industry experience, Experience developing high-performance AI framework software for GPUs or accelerators, Understanding of AI infrastructure and full AI software stack, Knowledge of LLM architectures and transformer-based models, Understanding of agentic AI architectures and workflows, Hands-on experience with PyTorch, Experience with vLLM for model inference and serving, Knowledge of memory wall challenges and AI system performance, Understanding of HBM and memory-centric compute architectures, Experience working in Linux environments, Proficiency with GitHub and Jira 📃 Skills: Python, PyTorch, vLLM, LLM, HBM, Linux, GitHub, Jira, GPU, Transformers, AI, ML 🏢 Description: Advancing the World's Technology Together The AGI (Artificial General Intelligence) Computing Lab is dedicated to solving the complex system-level challenges posed by the growing demands of future AI/ML workloads. The team designs and develops scalable platforms that can effectively handle computational and memory requirements while minimizing energy consumption and maximizing performance. The role involves close collaboration with hardware and software engineers to address AI/ML workload challenges and explore new computing abstractions that balance hardware and software components. Location: Daily onsite presence at the San Jose, CA office in alignment with the Flexible Work policy. What You'll Do - Lead the co-design of software and hardware solutions that optimize AI model inference performance, with a focus on overcoming memory bottlenecks. - Analyze and optimize LLM and agentic AI workloads across the full software stack, identifying opportunities for hardware-aware acceleration. - Profile and characterize model execution to expose memory wall limitations and guide architectural decisions for HBM and memory-centric compute. - Collaborate with hardware teams to influence memory architecture, acceleration strategies, and compute placement based on real workload behavior. - Develop, optimize, and benchmark inference and serving solutions using frameworks such as PyTorch and vLLM. - Define best practices and provide technical mentorship across software–hardware co-design efforts. What You Bring - Bachelor's with 15+ years, or Master's with 13+ years, or PhD with 10+ years of industry experience. - Strong experience writing high-performance AI framework software development for GPUs or other accelerators. - Strong, end-to-end understanding of the AI infrastructure and AI software stack, from model definition through deployment and serving. - Solid understanding of LLM model architectures and workflows, including modern transformer-based designs. - Solid understanding of agentic AI architecture and workflows. - Hands-on expertise with the PyTorch framework. - Practical experience with vLLM for high-throughput model inference and serving. - Solid understanding of the memory wall problem and its impact on AI system performance. - Strong knowledge of memory architecture, including High Bandwidth Memory (HBM), and familiarity with memory-centric acceleration and compute approaches. - Proficiency working in a Linux development environment. - Solid command of development tooling, including agentic coding, GitHub and Jira. What We Offer - Competitive base pay range of $189,000—$301,000 USD. - Incentive opportunities based on individual and company performance. - Medical, Dental, Vision, and 401(k) benefits. - 4+ weeks of paid time off, holidays, and sick leave. - Family support benefits including fertility care, adoption support, medical travel support, and virtual vet care. - Emotional wellness support with confidential therapy sessions and wellness apps. - Onsite café, gym, and virtual fitness classes. - Flexible work environment and charitable giving opportunities.
Technology
New offer

Samsung
Senior Performance Engineer
Senior
On-site
San Jose, CA
🏢 Summary: Senior LLM Systems Performance Engineer role focused on building and analyzing large-scale AI environments, optimizing LLM workloads, and driving hardware–software co-design for next-generation AI platforms. The position involves performance characterization across compute, memory, networking, and accelerator systems using modern AI frameworks and NVIDIA GPU platforms. Daily onsite presence in San Jose, CA is required. 🗂️ Requirements: MS or PhD in Computer Science, Computer Engineering, Electrical Engineering, or related field, BS with 5+ years of experience in performance engineering, AI systems, distributed systems, or HPC, Strong understanding of LLM inference and training systems, Strong understanding of NVIDIA GPU architecture and performance characteristics, Hands-on experience profiling and optimizing AI workloads on NVIDIA GPU platforms, Experience analyzing large-scale distributed AI workloads, Proficiency in Python, Proficiency in C++, Experience with modern AI frameworks or serving systems, Strong analytical and problem-solving skills, Ability to work onsite in San Jose, CA 📃 Skills: Python, C++, PyTorch, vLLM, SGLang, TensorRT-LLM, DeepSpeed, Ray, Megatron-LM, NVIDIA, Nsight, GPU, LLM, HPC 🏢 Description: Please Note: To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the World's Technology Together Our technology solutions power the tools you use every day—including smartphones, electric vehicles, hyperscale data centers, IoT devices, and more. Here, you'll have an opportunity to be part of a global leader whose innovative designs are pushing the boundaries of what's possible and powering the future. We believe innovation and growth are driven by an inclusive culture and a diverse workforce. We're dedicated to empowering people to be their true selves. The AGI (Artificial General Intelligence) Computing Lab is dedicated to solving the complex system-level challenges posed by the growing demands of future AI/ML workloads. The team designs and develops scalable platforms that can effectively handle the computational and memory requirements of these workloads while minimizing energy consumption and maximizing performance. The lab collaborates closely with hardware and software engineers to identify and address the unique challenges posed by AI/ML workloads and to explore new computing abstractions that can provide a better balance between the hardware and software components of systems. This role is offered by the STG group within the AGI Lab as part of DSRA. The team works at the intersection of large language models, accelerator hardware, and high-performance software. The mission is to design, prototype, and optimize next-generation AI systems through tight hardware–software co-design. We are seeking a Senior LLM Systems Performance Engineer to build representative AI environments, characterize emerging workloads, and drive performance analysis for next-generation AI platforms. In this role, you will set up and operate realistic LLM serving and agentic AI environments, collect workload traces and performance data, and develop methodologies to characterize workload behavior. Location: Daily onsite presence at the San Jose, CA office in alignment with the Flexible Work policy. Quick Facts What You'll Do - Build and operate representative AI environments, including agentic workflows, distributed inference systems, disaggregated serving architectures, and MoE deployments. - Collect workload traces, telemetry, and performance data from real-world AI applications. - Characterize workload behavior, develop representative benchmarks, and identify performance bottlenecks across compute, memory, communication, and scheduling resources. - Evaluate AI systems across the full hardware and software stack. - Analyze the impact of runtime, memory hierarchy, interconnect, and accelerator architecture on application performance. - Collaborate with hardware and software teams to drive performance analysis, architecture exploration, and hardware–software co-design for next-generation AI platforms. What You Bring - MS or PhD in Computer Science, Computer Engineering, Electrical Engineering, or a related field. - BS with 5+ years of experience in performance engineering, AI systems, distributed systems, high-performance computing, or related areas. - Strong understanding of LLM inference and training systems. - Strong understanding of NVIDIA GPU architecture and performance characteristics. - Hands-on experience profiling and optimizing AI workloads on NVIDIA GPU platforms using Nsight Systems, Nsight Compute, and related frameworks. - Experience analyzing performance of large-scale distributed AI workloads. - Proficiency in Python and C++. - Experience with AI frameworks or serving systems such as PyTorch, vLLM, SGLang, TensorRT-LLM, DeepSpeed, Ray, or Megatron-LM. - Strong analytical and problem-solving skills. What We Offer - Competitive base pay range: $138,000—$206,000 USD. - Incentive opportunities based on individual and company performance. - Medical, Dental, Vision, and 401(k) benefits. - 4+ weeks of paid time off plus holidays and sick leave. - Family support benefits including fertility care, adoption assistance, and medical travel support. - Emotional wellness support including confidential therapy sessions. - Onsite café and gym with additional virtual wellness classes. - Flexible work environment. Equal Opportunity Employment Policy Samsung Semiconductor is committed to fostering an inclusive and equal opportunity workplace. Our Commitment to Innovation and Fairness AI tools may be used to support the recruitment process, but hiring decisions are made by human recruiters and hiring managers. Applicant AI Use Policy Generative AI tools may not be used to misrepresent candidate qualifications during the application or interview process. Trade Secret Notice By submitting an application, candidates agree not to disclose confidential or proprietary information belonging to current or former employers.
Technology
New offer

Samsung
Principal Engineer, CPU Architecture & Performance Research
Senior
On-site
San Jose, CA
240,000 - 249,996 USD/yr
🏢 Summary: Principal-level role focused on researching and optimizing next-generation CPU microarchitectures, including RISC-V cores, through performance analysis, simulation, and workload characterization. The position involves cross-functional collaboration with architecture, compiler, OS, and design teams to improve system performance and efficiency across real-world workloads. Candidates will lead technical initiatives, mentor engineers, and contribute to patents and publications. 🗂️ Requirements: Master’s degree with 18+ years in Computer Engineering, Computer Science, or related field, or PhD with 15+ years, 10+ years of CPU microarchitecture or performance engineering experience, Experience with RISC-V, ARM, or X86 architectures, Knowledge of out-of-order execution, branch prediction, pipelines, and speculation, Knowledge of cache coherence, memory systems, prefetching, and NUMA, Experience with architectural simulators, Programming skills in C/C++ and Python, Experience with compiler optimizations and hardware/software co-design, Experience analyzing large performance datasets and traces, Ability to influence architecture decisions through quantitative analysis 📃 Skills: RISC-V, ARM, X86, gem5, C, C++, Python, SIMD, VME, NUMA, RTL, SPEC 🏢 Description: Advancing the World's Technology Together Our technology solutions power the tools you use every day—including smartphones, electric vehicles, hyperscale data centers, IoT devices, and more. This opportunity offers the chance to contribute to next-generation CPU technologies and performance optimization. What You'll Do Architecture Research Lab is seeking a Principal CPU Architecture & Performance Engineer to lead the definition, analysis, and optimization of next-generation CPU microarchitectures (RISC-V core). This role focuses on end-to-end performance, including architectural trade-offs, workload characterization, micro-architectural modeling, simulation, and silicon bring-up correlation. You will work closely with architecture, design, compiler, and system teams to drive performance and efficiency across a broad set of real-world workloads. Location: Daily onsite presence at the San Jose office in alignment with the Flexible Work policy. Responsibilities - Define and evaluate CPU micro-architectural features for future cores (frontend, execution engine, memory hierarchy, interconnect) - Lead performance analysis using simulators and RTL - Help develop and validate performance models (cycle-accurate, trace-driven, statistical) - Characterize workloads (SPEC, server, client, AI/ML, cloud, internal traces) and translate findings into architectural requirements - Identify performance bottlenecks and propose data-driven optimizations - Drive architecture-to-implementation alignment with design team - Collaborate with compiler, OS, and system architects on cross-stack performance issues - Mentor senior and staff engineers and provide technical leadership across projects - Contribute to patents and publications What You Bring - Master’s degree with 18+ years of experience in Computer Engineering, Computer Science, or related field, or PhD with 15+ years preferred - 10+ years of experience in CPU microarchitecture and/or performance engineering - Experience with RISC-V, ARM, or X86 architectures - Strong understanding of out-of-order execution, branch prediction, pipelines, speculation, cache coherence, memory systems, prefetching, and NUMA effects - Hands-on experience with architectural simulators such as gem5 - Strong programming skills in C/C++ and Python - Familiarity with compiler optimizations and hardware/software co-design - Familiarity with SIMD, vectors, and VME for AI inference workloads - Experience analyzing large performance datasets and traces - Proven ability through tapeouts, patents, or publications to influence architecture and microarchitectural decisions through quantitative analysis Preferred Qualifications - Background in power/performance/area (PPA) trade-off analysis - Experience with SIMD and vector technologies - Prior technical leadership at Senior Staff or Principal level Benefits - Competitive compensation and incentive opportunities - Medical, dental, vision, and 401(k) - Charitable giving match and community involvement opportunities - 4+ weeks of paid time off plus holidays and sick leave - Family support benefits including fertility, adoption, and medical travel support - Emotional wellness support including therapy sessions and wellness apps - Onsite café, gym, and virtual fitness classes - Flexible work environment Base Pay Range: $219,000—$351,000 USD