New offer - be the first one to apply!
July 21, 2025
Senior • On-site
$184,000 - $287,500/yr
Santa Clara, CA
NVIDIA is searching for AI/ML Solutions Architect for Hyperscale and cloud providers focus. Primary responsibilities will be to lead AI/ML software customer technical engagement for systems being deployed at vast scale. Working with multiple organizations within NVIDIA as well as at the customer we will ensure a successful and trouble-free deployment. Would you like to partner with a large company to build automation and management to create a robust large scale artificial intelligence infrastructure? Interested in the optimization and characterization of customer specific AI models and pipelines? Then read on!
What you’ll be doing:
As a key technical member of a focused account team, you will serve as the main point of contact for NVIDIA products, enabling internet giants and cloud providers to have an innovative AI/ML software infrastructure.
Work directly with outstanding engineering teams to secure design wins, address challenges, bring solutions to production, and support them throughout their lifecycle.
Become a trusted advisor to your customer by understanding their environment, constraints, and long-term strategy. Translate these insights into product requirements and innovative solutions.
Help your customer enhance the value of NVIDIA technology, and provide feedback to NVIDIA for future product improvements.
Facilitate the resolution of customer issues, offering timely and proactive communications to mitigate risks.
Lead workshops, demos, and proof-of-concepts to showcase NVIDIA’s AI/ML capabilities.
Guide customers on standard processes for scalable AI model deployment and inference optimization.
What we need to see:
Minimum of a BS/MS in Computer Science, Electrical Engineering, or equivalent experience.
8+ years of engineering experience with a proven record in AI/ML-focused projects or enterprise-grade solutions.
Solid understanding of Linux, including fixing, optimization, and customization for AI/ML workloads.
Strong understanding of data science and machine learning infrastructure—software and hardware.
Excellent follow-up and interpersonal skills, with a true passion for problem-solving.
Proficient in Python, with the ability to develop scripts and build custom tools. Experience with parallel programming or GPU acceleration (e.g., CUDA) is helpful.
Ways to stand out from the crowd:
Background with Chatbots, RAG pipelines, vector databases, and distributed training or inference workloads.
Experience or background in HPC (High Performance Computing) environments for AI or ML applications.
Familiarity with multi-node GPU clusters and performance tuning for large-scale AI workloads.
Experience developing in cloud and/or virtualized environments, containerized solutions, with knowledge of Docker, Kubernetes
Deep learning and AI experience with common deep learning frameworks such as PyTorch or TensorFlow.
We make extensive use of conferencing tools, but occasional travel is required for local on-site visit to customers and industry events.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!
The base salary range is 184,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.