New offer - be the first one to apply!
August 29, 2025
Senior • Hybrid • On-site • Remote
$224,000 - $356,500/yr
Santa Clara, CA
We are seeking a Software Engineering Manager to lead the development for the Dynamo engineering team, NVIDIA’s high-performance, low-latency inference platform for serving generative AI and reasoning workloads at scale. The team accelerates deployment of cutting-edge models across diverse engines and architectures, enabling breakthroughs from real-time LLM serving to complex multi-GPU, multi-node pipelines. Ideal candidate is strong in software development, designing and creating fault-tolerant distributed systems, and has the ability to implement well thought out long term maintenance strategy.
What you'll be doing:
Mentor, grow, and develop the Dynamo engineering team and be responsible for planning and execution of projects and workflows..
Work across several teams and orgs to build platforms that use the latest developments in LLM inferencing. In this role, you will be collaborating with research and development teams and serve a large user base (software teams both internal and external to NVIDIA).
Align priorities across collaborators and define metrics for measuring the success of the product/team.
Stay updated with the latest trends in AI, ML, and infrastructure, proactively seeking opportunities to integrate advancements into NVIDIA's LLM and AI infrastructure solutions.
What we need to see:
Masters or PhD or equivalent experience in Computer Science, computer architecture, or related field.
10+ years of overall experience in developing large distributed systems.
2+ years of experience managing of AI and SW development teams.
Experience in developing and maintaining LLM or GenAI infrastructure
Excellent communication, collaboration and problem-solving skills, with a dedication to encouraging an inclusive and diverse workplace.
Hands-on experience developing large-scale distributed systems
Ways to stand out from the crowd:
Strong technical background in cloud/distributed systems.
Experience working in a globally distributed organization.
Good knowledge of CPU and/or GPU hardware architecture
Background in developing LLM inference systems.
Experience with LLM frameworks like vLLM & TRT-LLM.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most expert and passionate people in the world working for us. Are you creative and autonomous? Do you love a challenge? If so, we want to hear from you. Come help us build the real-time, efficient computing platform driving our success in the multifaceted and quickly growing field Deep Learning and Artificial Intelligence.
#LI-Hybrid
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD.You will also be eligible for equity and benefits.