New offer - be the first one to apply!
April 25, 2025
Senior • Hybrid • On-site • Remote
$148,000 - $287,500/yr
Santa Clara, CA , +2
We are looking for a Senior Build and Release Engineer to join our Cloud Engineering Services team! In this highly cross-functional role, you will work with product engineering teams, as well as the infrastructure and security teams, to bring up and maintain foundational cloud services, cloud infrastructure, release processes, developer tools, and workflow automation. Ideal candidate will not only have experience improving existing systems and tooling but also selecting technologies and standing up new systems.
What you'll be doing:
As the first Release Engineer on the team, you will serve as the primary point of contact for all release engineering activities! In this role, we develop sophisticated tooling to automate the build and deployment processes for microservices and cloud infrastructure. We will be responsible for identifying and integrating new technologies that improve build and release efficiency, seamlessly incorporating them into CI/CD workflows. Collaboration with product engineering teams will be essential as you architect solutions tailored to specific project requirements. You will proactively seek opportunities to accelerate development velocity by automating common development tasks. Additionally, you will design systems with a focus on high reliability, redundancy, fault tolerance, and security. Continuous monitoring of the infrastructure will be a key part of your responsibilities, ensuring timely alerts on significant events and maintaining the highest levels of system performance and reliability.
What we need to see:
Bachelor’s or Master’s degree in Computer Science, Engineering, or related field (or equivalent experience).
5+ years of experience in release engineering focused on deploying microservices and infrastructure in cloud environments.
5+ years of experience with programming in Python or similar languages.
Strong experience with cloud infrastructure platforms such as AWS.
High proficiency in infrastructure as code (IaC) and configuration management tools such as Terraform.
Expertise in administering, operating, and configuring Kubernetes and Envoy.
Demonstrated experience with Continuous Integration/Continuous Delivery (CI/CD) tools, including GitLab, Flux CD, and implementing GitOps-based deployment models
Proficiency in monitoring tools such as Prometheus, Grafana, Cloudwatch, and Thanos.
Experienced in working with Linux-based operating systems, including system administration and troubleshooting a wide range of issues.
Ways to Stand Out from the Crowd:
Expertise in administering and operating Kubernetes clusters and Envoy Ingress Gateways.
Experience with blue/green, canary, or progressive delivery strategies.
Background in site reliability engineering or platform engineering is a plus.
You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.