NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company.” We're looking to grow our company and establish teams with the most thoughtful people in the world.
We are looking for an excellent engineering manager to own and deliver an end to end manageability stack for Data Center Systems. We are seeking an experienced manager who is deeply technical, hands-on, and has a wide system view. You will manage a team of experts, design & build OpenBMC based manageability software stack for NVIDIA’s next generation Data Center Compute Systems. We want to grow our teams with the smartest people in the world. If you're creative and autonomous, we want to hear from you!
What you’ll be doing:
- Own and deliver OpenBMC based manageability stack for next generation Data Center Compute Systems.
- Own firmware delivered to data centers in terms of quality, reliability and telemetry performance.
- Manage and lead a distributed team of software engineers to deliver firmware stack with high quality.
- Work with data center architects and cloud customers for correct requirements and scope implementation to ensure speed of light product development.
- Work closely with cross functional teams to ensure scalable manageability architecture for all data centers products
- Drive efficiency, reliability and optimization in firmware architecture from a data center view point.
- Work closely with customers and internal teams to resolve issues at Speed of Light.
What we need to see:
- BS, MS, or PhD in EE/CS or related field of education or equivalent experience.
- 10+ overall years of relevant experience working on server firmware (BMC) and platform software development
- 5+ years of experience in managing a software/firmware engineering team
- Hands on experience with data center health management workflow. Proven record of delivering server firmware for large data centers.
- Strong knowledge of data center management, server architecture and server manageability in data centers.
- Strong and demonstrable skill in C/C++ and Python. Experience programming and debugging skills for server platforms.
- Experience in SCM (e.g. Git, Perforce) and project management tools like Jira.
- Possess excellent written and oral communication skills, good work ethics, high sense of team-work, love to produce quality work. Self-starter who loves to find creative solutions to complicated problems
Ways to stand out from the crowd:
- Hands on experience with BMC firmware/software stack for data center health management and server manageability.
- Proven engineering managers driving large complex problem with 25+ engineers working
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative and autonomous, we want to hear from you!
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD for Level 3, and 272,000 USD - 425,500 USD for Level 4.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until October 16, 2025.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.