November 18, 2025

Senior Generative AI Software Engineer

Senior • Hybrid • On-site • Remote

$224,000 - $356,500/yr

Santa Clara, CA

At NVIDIA, we're not just building the future, we're generating it! Our Cosmos generative AI engineering team is pushing the boundaries of what’s possible across multimodal learning, video generation, synthetic data, intelligent simulation, and agentic systems. We are looking for exceptionally driven engineers and applied scientists with deep experience in generative modeling to help define the next era of AI computing.

What you'll be doing:

You will own and evolve the Cosmos open-source and internal research codebases, crafting core infrastructure that supports our foundation model research and deployment.
Refactor and modularize large research-driven code into clean, testable, maintainable libraries for use across teams.
Integrate and adapt off-the-shelf models into our pipelines as preprocessors, postprocessors, or evaluation components.
Build model-serving endpoints (e.g., with Gradio or FastAPI) to enable researchers and internal users to experiment with models interactively.
Design, implement, and maintain evaluation pipelines, providing high-quality tooling to the broader team to measure model quality and track improvements.
Improve configuration hygiene and reproducibility using systems like Hydra, and ensure smooth overrides, templates, and environment switching.
Lead efforts in packaging and release of Python modules using modern tools (uv, just, pydantic) for both OSS and internal consumption.
Set the standard for code health, test coverage, and release readiness across the team. Write documentation and automation to scale good practices.

What we need to see:

Expert-level proficiency in Python, with a strong foundation in modular design, abstraction boundaries, and collaborative codebase evolution.
Fluency with PyTorch, including the ability to run, debug, and patch inference-time model behavior in research-level codebases. Comfort modifying pre/post-processors, model wrappers, and checkpoint logic.
Proven experience in refactoring large codebases—cleaning up legacy implementations, eliminating anti-patterns, and paying down tech debt to improve long-term maintainability.
Strong grasp of configuration systems, especially Hydra, with an emphasis on reproducibility, override logic, and environment scoping.
Familiarity with Python packaging tools like uv, just, and pydantic, including experience managing environment consistency and shipping libraries as artifacts.
Strong instincts around code health: API design, directory structure, writing unit and integration tests, exception hygiene, docstrings, and dependency isolation.
Comfortable deploying models internally via Gradio or similar frameworks to enable interactive evaluation and feedback from researchers or downstream users.
BS or MS (or equivalent experience) in Computer Science, Software Engineering, or a related technical field and 10+ years of industry experience.

Ways to stand out from the crowd:

Proficiency in model configs, especially Hydra! Comfortable crafting hierarchical config systems with reusable templates, environment scoping, and overrides for evaluation, inference, or release.
Prior work cleaning up sophisticated generative model codebases—adding tests, improving wrappers, and instrumenting code for observability and debugging.
Demonstrated success raising engineering quality in a research setting: taking exploratory code and evolving it into a robust, production-friendly module.
Track record of mentoring teammates on software engineering best practices and proactively identifying long-term structural risks in fast-moving teams.
Passion for building ML tooling that is not only functional, but also elegant, intuitive, and maintainable by others.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD for Level 5, and 272,000 USD - 425,500 USD for Level 6.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until December 23, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Nvidia

NVIDIA Corporation founded in 1993 by Jen-Hsun Huang, Chris Malachowsky, and Curtis Priem, NVIDIA Corporation has carved out a leading position in the technology industry. Based in Santa Clara, California, NVIDIA is renowned for its GeForce series of GPUs, which cater to both gaming and professional applications. The company's innovative graphics processing units are integral to various sectors, from gaming to machine learning and data centers. As a frontrunner in the semiconductor industry, NVIDIA continues to leverage emerging technologies like AI and machine learning to stay ahead of the curve.