September 24, 2025

Senior Machine Learning Engineer

Senior • Hybrid • On-site • Remote

$184,000 - $287,500/yr

Santa Clara, CA , +1

As a Senior Machine Learning Engineer at NVIDIA, you will build the machine learning brain that keeps NVIDIA’s global DGX Cloud healthy, efficient and ready for the next waves of AI breakthroughs. DGX Cloud fuses NVIDIA GPUs, NVLink networking and the full AI software stack into elastic infrastructure powering large language models, drug discovery, autonomous driving and climate science. Your models will turn billions of telemetry signals into predictive insight. This frees customers to innovate while our platform runs smarter.

What you'll be doing:

Ground breaking and developing innovative machine learning algorithms and models that propel our AI products.
Build production models for anomaly detection, predictive maintenance and usage optimization.
Develop tools surfacing real time telemetry, efficiency metrics and long term trends.
Develop forecasting and simulation models for global scale planning.
Analyzing complex datasets to determine the best approach for model training and optimization.
Translate findings into clear engineering actions with infrastructure, operations and product teams.
Participating in cross-functional projects to integrate machine learning capabilities into various NVIDIA products.

What we need to see:

Master's degree or PhD in Mathematics, Statistics, Machine Learning or related quantitative field (or equivalent experience).
8+ years experience applying Machine Learning to operational systems.
Proven track record of building and deploying Machine Learning models in production environments.
Experience with time series analysis and optimization algorithms.
Familiarity with distributed systems and cloud platforms such as AWS and Kubernetes.
Strong software engineering skills and proficiency in Python.
Effective verbal/written communication, and technical presentation skills.
Experience with machine learning frameworks such as TensorFlow, PyTorch, or similar.
A track record of delivering high-impact projects to compete in a fast-paced environment.

Ways to stand out from the crowd:

Experience solving capacity planning problems.
Deep understanding of GPU performance metrics.
Familiarity with prometheus and PromQL.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until November 1, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Nvidia

NVIDIA Corporation founded in 1993 by Jen-Hsun Huang, Chris Malachowsky, and Curtis Priem, NVIDIA Corporation has carved out a leading position in the technology industry. Based in Santa Clara, California, NVIDIA is renowned for its GeForce series of GPUs, which cater to both gaming and professional applications. The company's innovative graphics processing units are integral to various sectors, from gaming to machine learning and data centers. As a frontrunner in the semiconductor industry, NVIDIA continues to leverage emerging technologies like AI and machine learning to stay ahead of the curve.