New offer - be the first one to apply!
September 3, 2025
Senior • Hybrid • On-site • Remote
$160,000 - $253,000/yr
Santa Clara, CA
Hardware Infrastructure is seeking a Technical Program Manager to lead capacity programs that enable us to scale how internal compute is managed and allocated across NVIDIA's Infrastructure. The infrastructure we build and operate enables NVIDIA's most advanced AI researchers and EDA engineers to create the future of computing - this growing demand for compute resources requires us to rethink and shape how resources are distributed and managed. This is a fast paced and evolving landscape that requires a TPM to identify bottlenecks and lead engineering teams to deliver high quality outcomes that focus on speed, performance and resource efficiency. They will partner both internally within Hardware Infrastructure and externally with customer and partner teams. They will also develop and standardize planning, reporting and execution methodologies and metrics to enable meeting the challenging objectives.
What You'll Be Doing:
Work across multiple internal customer teams to identify gaps and challenges in capacity allocation - these inputs play a key role in shaping the capacity tooling roadmap
Nurture a culture of continuous improvement, finding new opportunities across tooling, automation and processes to scale overall capacity management
Take lead in defining strategies that will help increase the efficiency and utilization of resources across internal clusters to minimize capacity waste
Guide a diverse set of engineering efforts in an agile program methodology across planning, prioritization, design, dependency management, implementation and execution.
Bring a data-first approach to programs (metrics, OKRs, KPIs) to measure program success and for identifying areas of improvement
Create effective communication channels to provide varying audience levels insights into program status, risks and opportunities.
Act as an effective technical and non-technical liaison between developers, customers and partners to drive organization alignment across a multi-functional matrixed set of leads
What We Need To See:
B.S. (or equivalent experience) in Computer Science or a related technical field
10+ years of experience across software engineering and/or technical program management roles with demonstrated expertise and mastery of technical and management practices
Prior experience developing process and programs focused on the allocation and management of infrastructure resources that span a diverse and large portfolio ($billions)
Prior experience leading programs that span across multiple teams and engineers (100+)
Experience handling large scale HPC and/or AI Infrastructure deployments that stretch across hardware and software
Exceptional communication and presentation skills for diverse technical and non-technical audiences
Strong multitasking abilities with a focus on thoroughness and rapid context switching
Knowledge of agile methodologies and the best in class project management tools
Proactive and enthusiastic in identifying and implementing positive changes in software engineering and release management within a fast-paced environment
Ways To Stand Out From The Crowd:
Prior experience bringing up new datacenter capacity across cloud service providers and on-premise locations
Prior experience in working with AI researchers and/or EDA developers
Software development, release and support methodology and devops
NVIDIA offers highly competitive salaries and a comprehensive benefits package. We have some of the most forward-thinking and hardworking people in the world on our team and our collaborative talent continues to drive NVIDIA's growth. We are seeking creative and independent engineers with real passion for technology!
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 160,000 USD - 253,000 USD.You will also be eligible for equity and benefits.