New offer - be the first one to apply!

August 18, 2025

Senior Technical Program Manager, Cloud Infrastructure

Senior • Hybrid • On-site • Remote

$160,000 - $253,000/yr

Santa Clara, CA

NVIDIA's deep learning platforms are at the forefront of innovation, profoundly impacting various fields and widely adopted by leading academic institutions, startups, and major Internet companies globally. We're seeking an accomplished and highly skilled Technical Program Manager (TPM) to join our NVIDIA DGX Cloud team. This is a fantastic opportunity for a passionate, creative individual to deliver outstanding value to our DGX Cloud customers. We are specifically looking for a TPM with extensive experience in cloud infrastructure bring-up and relationship management. You'll be instrumental in partnering with companies and engineering teams internally to help build AI capacity and infrastructure across the globe

What you'll be doing:

As a DGX Cloud Technical Program Manager, you'll be a key partner to our Engineering, Infrastructure, Software teams and their leadership, driving critical programs related to AI capacity enablement and management. You'll play a pivotal role in developing and maturing foundational capabilities and processes for DGX Cloud, spanning critical areas such as cluster/capacity bring-up and maintenance. This is a dynamic, fast-paced environment where TPMs are expected to apply fungible skillsets to a range of high-impact programs across DGX Cloud. Your responsibilities will include:

  • Gathering technical requirements, developing comprehensive roadmaps, achieving breakthroughs, and ensuring adherence to our Product Lifecycle (PLC) process.

  • Leveraging Jira and other program management platforms to instill rigor and structure in the management of engineering deliverables.

  • Collaborating cross-functionally internally and externally to understand partner capabilities, build a bridge to NVIDIA reference architectures and drive execution

  • Identifying and driving opportunities to onboard the adoption of third-party and in-house solutions for deployments, support, security, compliance and observability across DGX Cloud

  • Establishing metrics and key performance indicators (KPIs) and quantitatively demonstrating the value and impact delivered by your programs.

  • Proactively identifying, resolving, and mitigating risks and issues that could affect scope, schedule, and quality across all program aspects.

  • Developing and executing a robust communication strategy to ensure organizational visibility on overall program progress and engineering delivery, including presenting regularly to NVIDIA's executive leadership team.

  • Encouraging a culture of continuous improvement, consistently finding opportunities for process improvements within our cloud infrastructure operations.

What we need to see:

  • 10+ years of technical program management experience, specifically driving the planning and execution of large-scale engineering programs, with a strong focus on software engineering projects within a matrixed organization.

  • Extensive hands-on experience in cloud infrastructure, preferably gained from working at a major Cloud Service Provider (CSP) including AI/ML

  • Expert-level proficiency with Jira, Smartsheet, or similar program management tools, with the ability to confidently guide engineering teams on their effective use and execution within an Agile/Scrum framework.

  • Exceptional strategic and tactical thinking abilities, coupled with a strong capacity to build consensus and drive program success

  • Comfort and effectiveness in thriving within ambiguous environments.

  • Possess excellent communication and technical presentation skills, particularly for executive audiences.

  • BS or MS in Electrical Engineering or Computer Science, or equivalent experience.

Ways to stand out from the crowd:

  • Highly motivated with exceptional communication skills, and a proven ability to work successfully with multi-functional teams and coordinate effectively across organizational boundaries and geographies.

  • In depth knowledge of NVIDIA GPU products, including deployment and bring-up

  • Solid understanding of various cloud technologies (Kubernetes, API integration, Terraform, etc)

  • Significant experience with productivity tools and process automation is a major plus.

  • Deep familiarity with cloud-native product / services environments and familiarity with AI, ML infrastructure, and cloud/services

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 160,000 USD - 253,000 USD for Level 4, and 192,000 USD - 304,750 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until August 21, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.