Taro Logo

Senior Distributed Systems Engineer, AI Infrastructure

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.
Senior Software Engineer
In-Person
5+ years of experience
AI · Automotive
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Senior Distributed Systems Engineer, AI Infrastructure

NVIDIA is seeking a Senior Distributed Systems Engineer to lead the development of their exa-scale AI infrastructure for Autonomous Vehicles. This role combines cutting-edge distributed systems work with AI applications, focusing on building the foundation for autonomous driving technology. The position requires expertise in cloud technologies, distributed storage & compute systems, and strong technical leadership skills.

The role involves architecting and developing scalable services that power AI infrastructure for deep learning platforms, handling petabyte-scale datasets, and designing next-generation dataset management services. You'll work at the intersection of distributed systems and AI, enabling smart data selection capabilities crucial for machine learning success.

As a technical leader, you'll collaborate with multiple AI teams, contributing to the platform's architecture while ensuring it meets current and future requirements. The position offers the opportunity to work on one of technology's most ambitious challenges - autonomous vehicles - with potential applications in medical imaging, data science, and genomics.

NVIDIA offers highly competitive compensation and is renowned as one of the technology industry's most desirable employers. You'll join forward-thinking teams working on state-of-the-art fields including Deep Learning, Artificial Intelligence, and Autonomous Vehicles. The role provides an excellent opportunity to impact critical projects while working with cutting-edge technology in a collaborative environment.

The ideal candidate combines strong programming skills with distributed systems expertise, security knowledge, and technical leadership experience. This position offers the chance to shape the future of AI infrastructure while working with some of the industry's most advanced technologies and talented professionals.

Last updated 8 months ago

Responsibilities For Senior Distributed Systems Engineer, AI Infrastructure

  • Architect and build scalable and distributed services for AI infrastructure
  • Design and build infrastructure for PB sized deep learning datasets
  • Design next generation dataset management services
  • Enable smart data selection for machine learning
  • Collaborate with AI teams to understand requirements
  • Be a technical leader on platform projects
  • Support platform users

Requirements For Senior Distributed Systems Engineer, AI Infrastructure

Go
Java
Python
Scala
Kubernetes
  • BS, MS, or PhD in Computer Architecture, Computer Science, Electrical Engineering or related field
  • 5+ years of experience in distributed systems development and design
  • Strong programming background in data structures, design patterns, OOP, and TDD
  • Experience with distributed computing and storage systems
  • Knowledge of authentication and authorization technologies
  • Advanced programming skills in distributed systems and microservices
  • Specialist programmer in Go, Java or C/C++
  • Strong interpersonal skills and ability to work with cross-functional teams
  • Track record of successful technical leadership

Interested in this job?