New offer - be the first one to apply!
August 5, 2025
Senior • On-site
$168,000 - $264,500/yr
Santa Clara, CA , +3
NVIDIA is looking for an engineer who thrives at the intersection of innovative technology and real-world customer applications to join our team of Solution Engineers supporting the NVIDIA AI Enterprise product line. In this role, you will work directly with customers to deliver solutions on the latest NVIDIA hardware including the GB200. We are looking for an experienced engineer to triage customers' datacenter-scale AI/ML workloads, resolve customer issues, and contribute to products and support software. You will solve complex issues related to AI training and inference at scale, drive multi-functional engineering alignment, and lead multiple strategic projects.
We’re looking for a hands-on leader with the drive to make an impact. Someone who brings creative solutions, improves processes, mentors early-career engineers, and takes ownership of complex projects.
What you'll be doing
Directly support NVIDIA’s Enterprise customers and work to answer questions, reproduce, resolve, or advance customer issues.
Collaborate with multiple engineering teams on customer issues to drive root-cause analysis and deliver real solutions to customers.
Contribute to product and support improvements through code, debugging, design feedback, or tools that accelerate issue resolution.
Own and drive customer issues from inception to resolution.
Document customer interactions to enrich our knowledge base and improve support effectiveness.
Mentor early-career engineers on their journey to become successful Solution Engineers
Define new standard methodologies, streamline cross-team collaboration, and take ownership of leading both projects and people.
Occasional work on weekends and holidays to support customers
What we need to see
Minimum of a BS in Computer Science, Electrical Engineering, or equivalent experience.
At least 8 years of engineering experience with a proven track record in AI/ML-focused projects or enterprise-grade solutions, including at least 5 years leading engineering teams
Deep understanding of Linux and the ability to analyze, optimize, and customize Linux environments for AI/ML workloads.
Strong expertise in AI/ML training and inference, with experience deploying models at scale and applying them to real-world use cases such as chatbots, RAG pipelines, and vector search.
Experience with common deep learning frameworks such as PyTorch or TensorFlow
Exceptional communication skills with the ability to tailor technical depth to any audience, and remain calm and focused under pressure.
Exceptionally organized and execution-focused, with a strong sense of follow-through and a genuine passion for solving complex problems.
Proficient in Python and C/C++ programming, with experience contributing to large existing projects as well as building new tooling
Experience building and deploying containerized solutions using Docker, Kubernetes, or Slurm.
Ways to stand out from the crowd
Background with parallel programming or GPU acceleration (e.g., CUDA)
Experience developing in GPU accelerated / cloud / virtualized environments
Experience analyzing software performance of distributed workloads
Clustering or HPC data center technologies including Upper Layer Protocols (NCCL, MPI)
You will also be eligible for equity and benefits.