New offer - be the first one to apply!
September 20, 2025
Senior • Hybrid • On-site • Remote
$148,000 - $235,750/yr
Santa Clara, CA
NVIDIA is looking for an expert software engineer to help us expand our catalog of Device eXtension (Dx) APIs for our math libraries. For over a decade, NVIDIA's accelerated computing platform has revolutionized HPC and AI with applications ranging from COVID-19 research to autonomous machines. Did you know that our team develops the GPU-accelerated mathematical libraries that make all of this possible?
The hardware and software accelerated computing ecosystem is constantly evolving, including shifts towards hybrid backends, deep integration with high-level languages and ecosystems (such as Python, Numpy, JAX, MLIR…), and optimization at runtime for maximum flexibility and performance. Our Dx APIs allow developers to embed highly-optimized mathematical operations in their applications for these and other scenarios. You will be part of a team designing, developing, and optimizing math libraries for the future. If you are passionate about designing modern HPC libraries and want to build software that will stand the test-of-time as it accelerates countless applications, we might have the dream job you have been waiting for!
What you'll be doing:
Design modern, flexible, and easy to use APIs for math libraries and lead design reviews with all collaborators.
Work closely with internal (e.g., engineering, Product Management) and external partners such as researchers to understand their use cases and requirements.
Become a domain expert by continuously surveying current trends in software systems.
What we need to see:
PhD or MSc degree in Computer Science, Applied Math, or a related science or engineering field is preferred (or equivalent experience).
3+ years of experience designing and developing software for high-performance computing and/or AI applications.
Advanced C++ skills, including modern design paradigms (e.g., template meta-programming, RAII).
Parallel programming experience with CUDA or OpenCL.
Strong collaboration, communication, and documentation habits.
Ways to stand out from the crowd:
Experience using graph compilers and/or Just In Time compilation workflows (e.g. XLA, LLVM, MLIR, Numba, NVRTC).
Programming skills with Python, and modern automation setups for both building software (e.g. cmake) as well as testing it (e.g. CI/CD, sanitizers).
Experience with CCCL, OpenMP, OpenACC, multi-threading, MPI, PGAS.
Strong background in numerical methods (e.g., FFT, numerical linear algebra).
Experience with scientific and deep learning libraries and frameworks such as PyTorch, JAX, MKL, MAGMA, PETSc, Kokkos, etc.
With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you!
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.You will also be eligible for equity and benefits.