Taro Logo

Senior Distributed Systems Engineer, AI Infrastructure

NVIDIA is the world leader in accelerated computing, pioneering AI and digital twins technology.
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Automotive

Job Description

NVIDIA is seeking a Senior Distributed Systems Engineer to lead the development of their exa-scale AI infrastructure and deep learning platform for Autonomous Vehicles. This role sits at the intersection of distributed systems and artificial intelligence, focusing on building scalable solutions for one of technology's most challenging problems. The position requires expertise in cloud technologies, distributed systems, and strong programming skills, with opportunities to work on cutting-edge applications in autonomous vehicles, medical imaging, and genomics.

The role involves architecting and developing scalable distributed services, managing petabyte-sized datasets, and implementing next-generation dataset management solutions. You'll be working with LLMs and AI agents to create intelligent assistants, while collaborating with multiple AI teams to build a future-proof platform. Technical leadership is a key component, as you'll be guiding various projects and contributing to the platform's overall architecture.

NVIDIA, recognized as a leader in accelerated computing, offers a competitive compensation package and the opportunity to work on transformative technology. The position requires 5+ years of experience in distributed systems, strong programming skills particularly in Python, Go, or C/C++, and a deep understanding of distributed computing architectures. Knowledge of security principles and experience with technologies like Kubernetes and Docker are valuable assets.

This role presents an exciting opportunity to shape the future of AI infrastructure while working with some of the industry's most forward-thinking professionals. You'll be at the forefront of developing solutions that power autonomous vehicles and other critical AI applications, making a significant impact on multiple industries.

Last updated 2 days ago

Responsibilities For Senior Distributed Systems Engineer, AI Infrastructure

  • Architect and build scalable and distributed services for AI infrastructure
  • Design and build infrastructure for PB sized deep learning datasets
  • Design next generation dataset management services
  • Create AI assistants using LLM and AI agents
  • Collaborate with AI teams to build future-proof platform
  • Provide technical leadership across projects
  • Support platform users

Requirements For Senior Distributed Systems Engineer, AI Infrastructure

Python
Go
Kubernetes
  • BS, MS, or PhD in Computer Architecture, Computer Science, Electrical Engineering or related field
  • 5+ years of Work or Research Experience in distributed systems
  • Strong programming background in data structures, design patterns, OOP
  • Experience with distributed computing and storage systems
  • Knowledge of authentication and authorization technologies
  • Advanced programming skills in Python, Go or C/C++
  • Strong interpersonal skills and ability to work in teams
  • Track record of successful technical leadership