New offer - be the first one to apply!
September 3, 2025
Senior • Hybrid • On-site • Remote
$148,000 - $235,750/yr
Santa Clara, CA
Do you want to be part of the team that brings Artificial Intelligence (AI) technology to the field? We are looking for a Solution Architect (SA) or Data Scientist to join the Applied AI SA Segment team. We specialize on the newest technology and advances in deep learning, Generative AI, and Cloud. The vision of the AI Segment team is to use our deep expertise to guide and enable the successful adoption at data center scale of NVIDIA AI Enterprise Software!
If you are passionate about AI and how it can be applied to solve real-world problems, we should talk. NVIDIA is the world leader in GPU accelerated computing and AI, and is looking for developers like you to design and build enterprise AI solutions using our newest technology. As a member of the NVAIE Segment Solution Architecture team, you will work closely with customers and partners to tackle hard problems in customizing and deploying AI workloads in production at scale.
What you’ll be doing:
A huge part of our work involves developing end-to-end AI solutions for enterprise use cases. We help customers adopt NVIDIA AI SDKs and APIs by offering deep technical expertise and designing GPU-accelerated pipelines that optimize compute resource utilization and improve workload performance.
We solve customer problems by building solutions using deep learning technology including language and multimodal models, information retrieval, domain customization, reinforcement learning, reasoning, inferencing, agentic systems, and other sophisticated AI workloads.
As we work with customers across multiple industries, we build the reference architectures needed to deploy and optimize workloads at large scale. With this knowledge, we help improve NVIDIA products and build creative solutions to overcome scaling challenges.
We contribute to the wider organization and community by sharing our expert knowledge with others. This can vary from product engineering contributions to building and delivering hands-on training.
Above all, you will be part of the team that helps bring NVIDIA technology to life in the Enterprise! We empower you and give you the tools to achieve this with the backing of all of NVIDIA, including other Solution Architects, Product, Engineering and Research teams. You’ll get to be the face and trusted expert advisor that our customers and partners rely on.
What we need to see:
Strong foundational expertise, from a BS, MS, or Ph.D. degree in Engineering, Mathematics, Physics, Computer Science, Data Science, or similar (or equivalent experience).
5+ years experience using deep learning frameworks and libraries such as PyTorch, Tensorflow/Keras, Hugging Face Transformers, Megatron-LM, and DeepSpeed.
Expertise running deep learning jobs on GPUs using SLURM and Kubernetes.
Demonstrated coding and debugging skills, including 5+ years experience with Python and Linux.
Hands-on experience with customizing AI models, including distillation, pre-training, supervised finetuning, reinforcement learning, reasoning, evaluation, guard railing, and data curation.
Demonstrated expertise in accuracy and performance profiling and optimization for AI training and inference workloads.
Ability to learn fast and quickly adapt to change.
Clear written and oral communications skills with the ability to effectively collaborate with executives and engineering teams.
Ways to stand out from the crowd:
Background with NVIDIA AI Enterprise software with emphasis on NeMo.
Experience training foundational models
Experience on high-performance NVIDIA GPU computing clusters.
Extensive engineering and customer experience on projects with multiple collaborators.
Show willingness and ability to dig into unfamiliar territories to solve complex problems relying on experience from previous work.
You will also be eligible for equity and benefits.