Description

NVIDIA is looking for a Principal architect to work on a scalable hybrid cloud system used for infrastructure services across multiple teams at NVIDIA. As a team we work with various groups within NVIDIA such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence and Autonomous Vehicles to cater to their various infrastructure needs. These cloud services will be scaled to run on thousands of servers and running millions of automated jobs per day helping with the efficiency of thousands of NVIDIA’s software engineers worldwide. As part of these services, we host heterogeneous mix of machines with various operating systems (Windows/Linux/Android), multitude of hardware platforms (x86/ARM) having both NVIDIA GPUs and Tegra Processors.

Are you passionate about infrastructure and looking for complicated problems, ready to build the next generation of cloud services, craft innovative solutions with Kubernetes across on-premises and public cloud platforms we are excited to have you on board!

What you’ll be doing:

  • Craft creative scalable cloud solutions to scale to millions of jobs and thousands of systems.

  • Challenging problems in area of infrastructure such as NIMs (NVIDIA Inference Microservice), Kubernetes, job scheduling, resource management and automated recovery.

  • Build observability solutions to measure and improve the availability, reliability and latency of the systems.

  • Work with customers to understand their needs from the system and come up with innovative solutions.

What we need to see:

  • Experience in architecting scalable cloud infrastructure solutions from concept to production.

  • Expertise in Kubernetes.

  • Strong object-oriented programming background, Java or Go preferred.

  • Ability to collaborate across multiple team and across people working in different time zones.

  • Bachelors degree or equivalent experience.

  • Strong software/hardware engineering background

  • 12+ years of experience in infrastructure.

Ways to stand out from the crowd:

  • Experience in design, implementation and deployment of major infrastructure features.

  • Experience with AI/ML, Data Analytics and application of them in Infrastructure.

  • Experience in creating and scaling service for large scale and multiple Kubernetes clusters.

  • Ability to design robust distributed systems for heterogeneous platforms.

Come and work with us at NVIDIA where we have most resourceful and talented people in the world to advance Artificial Intelligence. If you interested in infrastructure, we love to hear from you!

The base salary range is 272,000 USD – 419,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/) . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.