Senior AI Infrastructure Engineer - DGX Cloud

NVIDIA is the world leader in accelerated computing, pioneering AI and digital twins technology.
$144,000 - $270,250
Cloud
Senior Software Engineer
Remote
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Description For Senior AI Infrastructure Engineer - DGX Cloud

NVIDIA is seeking a Senior AI Infrastructure Engineer for their DGX Cloud team to ensure maximum reliability and uptime of GPU cloud services. This role combines SRE principles with cutting-edge AI infrastructure, focusing on building tooling, reporting, and automation to enable operational excellence. The position offers an opportunity to work with state-of-the-art technology at NVIDIA, a leader in accelerated computing and AI innovation.

The role involves designing and implementing critical infrastructure tools and data pipelines that directly impact business decisions at the executive level. You'll be working with cloud infrastructure, incident management systems, and modern DevOps tools while collaborating with various teams to improve operational efficiency.

As a Senior AI Infrastructure Engineer, you'll be responsible for maintaining high-reliability systems while enabling developer productivity. The position requires a strong background in distributed systems, infrastructure automation, and programming languages like Python, Go, or TypeScript. Knowledge of Kubernetes, terraform, and ML concepts is highly valued.

NVIDIA offers a competitive compensation package with a base salary range of $144,000 - $270,250 USD, plus equity and comprehensive benefits. The company is known for its innovative work in AI, High-Performance Computing, and Visualization, making it an ideal place for those passionate about cutting-edge technology and scalable infrastructure.

The role offers both technical challenges and leadership opportunities, requiring someone who can balance independent initiative with strong collaboration skills. Working at NVIDIA means being at the forefront of AI and cloud computing innovation, with the chance to impact how some of the world's most advanced computing systems are operated and maintained.

Last updated a day ago

Responsibilities For Senior AI Infrastructure Engineer - DGX Cloud

  • Design, build, deploy, and run internal tooling built on top of cloud infrastructure
  • Design, implement, ship, and maintain essential data pipelines for executive leadership
  • Integrate tooling with internal and customer workflows
  • Reduce the toil of running an incident, writing a postmortem, running on call
  • Evangelize sustainable blameless incident prevention and incident response
  • Consult with peer teams on operations best practices

Requirements For Senior AI Infrastructure Engineer - DGX Cloud

Python
Go
TypeScript
Kubernetes
  • BS degree in Computer Science or related technical field
  • 5+ years of experience
  • Experience with infrastructure automation and distributed systems design
  • Experience in Python, Go, Typescript, C/C++, or Java
  • In-depth knowledge of Linux, Networking, Storage, and Containers
  • Track record of project initiation and collaboration

Interested in this job?

Jobs Related To NVIDIA Senior AI Infrastructure Engineer - DGX Cloud

Senior Cloud Solutions Integrations Engineer, CSS Delivery - Integrations

Senior Cloud Solutions Integration Engineer role at Oracle, focusing on Oracle Integration Cloud implementation and customer success services.

Senior Software Development Engineer - OCI (Nashville-TN or Austin-TX)

Senior Software Development Engineer position at Oracle Cloud Infrastructure (OCI) focusing on cloud services development and operations, based in Nashville or Austin.

Senior Cloud Solution Engineer

Senior Cloud Solution Engineer position at Oracle focusing on cloud architecture and pre-sales engineering for enterprise customers in Japan.

Sr. Cloud Integration Engineer

Senior Cloud Integration Engineer position at Fidelity Investments focusing on cloud integration and infrastructure development.

Sr. Software Engineer - Public Cloud Engineering

Senior Software Engineer position at Workday focusing on Public Cloud Engineering