Taro Logo

Senior DGX Cloud Software Engineer - Infrastructure Automation and Distributed Systems

NVIDIA is the world leader in accelerated computing, pioneering AI and digital twins technology.
$144,000 - $270,250
Cloud
Senior Software Engineer
Remote
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Description For Senior DGX Cloud Software Engineer - Infrastructure Automation and Distributed Systems

NVIDIA is seeking an experienced Senior DGX Cloud Software Engineer to join their Infrastructure Automation and Distributed Systems team. This role is central to supporting NVIDIA's AI training and inference development initiatives through building and maintaining platforms, tools, and services for their bare-metal, accelerated compute infrastructure.

The position offers an opportunity to work at the forefront of AI and cloud computing technology, specifically focusing on the DGX Cloud platform ecosystem. The role combines hands-on technical work with strategic system design, requiring expertise in cloud infrastructure, automation, and distributed systems. You'll be working with cutting-edge technologies including Kubernetes, Linux, and various cloud infrastructure tools.

As a senior engineer, you'll be responsible for designing and implementing cloud infrastructure services, participating in defining service level objectives, and working on automation initiatives. The role involves both independent work and collaboration with peer teams, requiring strong technical skills and excellent communication abilities.

NVIDIA offers a competitive compensation package with a base salary range of $144,000 to $270,250 USD, plus equity and additional benefits. The company is known for its innovative work in AI, High-Performance Computing, and Visualization, making it an ideal place for engineers passionate about working with cutting-edge technology.

The position offers flexibility with remote work options while being part of a team that's driving the next wave of artificial intelligence development. NVIDIA's culture emphasizes innovation, autonomy, and technical excellence, making it an attractive destination for engineers looking to make a significant impact in the field of AI and cloud computing.

This role would be ideal for someone with a strong background in infrastructure engineering who wants to work on large-scale systems that power AI and machine learning applications. The position offers opportunities for both technical growth and leadership development, with exposure to advanced technologies and complex technical challenges.

Last updated 7 hours ago

Responsibilities For Senior DGX Cloud Software Engineer - Infrastructure Automation and Distributed Systems

  • Design, build, and run cloud infrastructure services
  • Participate in defining internal facing service level objectives and error budgets
  • Eliminate or automate toil where ROI justifies it
  • Practice sustainable blameless incident prevention and response
  • Participate in on-call rotation
  • Consult with peer teams on systems design best practices

Requirements For Senior DGX Cloud Software Engineer - Infrastructure Automation and Distributed Systems

Python
Go
Kubernetes
Linux
  • Proficiency in Python or Go
  • BS degree in Computer Science or related technical field
  • 5+ years of experience in infrastructure and fleet management engineering
  • Experience with infrastructure automation and distributed systems design
  • Track record of project initiation and collaboration
  • In-depth knowledge of Linux, Slurm, Kubernetes, Local and Distributed Storage, and Systems Networking

Benefits For Senior DGX Cloud Software Engineer - Infrastructure Automation and Distributed Systems

Equity
  • Equity
  • Additional benefits mentioned but not specified in detail

Interested in this job?

Jobs Related To NVIDIA Senior DGX Cloud Software Engineer - Infrastructure Automation and Distributed Systems

Senior AI Infrastructure Engineer - DGX Cloud

Senior AI Infrastructure Engineer position at NVIDIA focusing on DGX Cloud services, requiring 5+ years of experience in cloud infrastructure and software development.

Senior Cloud Software Engineer

Senior Cloud Software Engineer position at NVIDIA working on DGX Cloud Engineering Team, building cloud services and virtualization frameworks for AI workloads.

Senior Software Engineer - DGX Cloud

Senior Software Engineer position at NVIDIA's DGX Cloud team, focusing on building and managing cloud infrastructure software with competitive compensation and benefits.

Senior Systems Software Engineer, Containers and Kubernetes

Senior Systems Software Engineer position at NVIDIA focusing on container and Kubernetes technologies, offering competitive salary and the opportunity to work with cutting-edge GPU/DPU technologies.

Senior Systems Software Engineer, Containers and Kubernetes

Senior Systems Software Engineer role at NVIDIA focusing on container runtimes and Kubernetes technologies, requiring expertise in Go, distributed systems, and cloud computing.