Taro Logo

Senior Software Engineer, Bare Metal Automation - DGX Cloud

NVIDIA is the world leader in accelerated computing, pioneering GPU technology and AI solutions.
$148,000 - $287,500
Cloud
Senior Software Engineer
Remote
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Description For Senior Software Engineer, Bare Metal Automation - DGX Cloud

NVIDIA, the world leader in accelerated computing, is seeking a Senior Software Engineer to join their DGX Cloud team. This role focuses on bare metal automation and is critical to scaling NVIDIA's AI Infrastructure. The position offers a competitive salary range of $148,000 - $287,500 USD, plus equity and benefits.

The role involves working with cutting-edge GPU technology and AI infrastructure, where you'll be responsible for managing and automating large fleets of bare metal hardware. You'll be part of a team that ensures the reliability and performance of production AI clusters, implementing sophisticated monitoring and health management capabilities.

As a Senior Software Engineer, you'll be working with various data streams, from GPU hardware diagnostics to cluster and network telemetry. The position requires strong expertise in systems programming languages like Go and Python, along with deep understanding of bare metal hardware APIs and frameworks.

NVIDIA's culture encourages creativity, autonomy, and out-of-the-box thinking. You'll be working at the forefront of the AI computing era, contributing to technology that transforms industries from gaming to scientific research. The company has a strong focus on innovation and has been pioneering visual computing for two decades.

The ideal candidate will have 5+ years of experience in similar roles, strong communication skills, and a proven track record of working with large-scale production systems. A background in Computer Science, Engineering, Physics, or Mathematics is required. The position offers the flexibility of remote work and the opportunity to work with some of the most forward-thinking professionals in the technology industry.

This is an excellent opportunity for someone passionate about GPU hardware and AI infrastructure who wants to make a significant impact in the field of accelerated computing. You'll be part of a company that's driving the future of AI and digital twins, working on challenges that no one else can solve.

Last updated 9 days ago

Responsibilities For Senior Software Engineer, Bare Metal Automation - DGX Cloud

  • Work on DGX Cloud team managing production systems for large scalable GPU clusters
  • Implement monitoring and health management capabilities for GPU assets
  • Work with cross-functional teams to ensure production AI clusters run reliably
  • Evaluate system failures and improve services based on incident management process
  • Manage and automate bare metal hardware

Requirements For Senior Software Engineer, Bare Metal Automation - DGX Cloud

Python
Go
  • 5+ years experience in similar role with large-scale production systems
  • BS in Computer Science, Engineering, Physics, Mathematics or equivalent experience
  • Experience with systems programming languages (Go, Python)
  • Strong understanding of data structures and algorithms
  • Direct experience in software engineering with bare metal hardware APIs
  • Strong communication skills and ability to work with cross-functional teams

Benefits For Senior Software Engineer, Bare Metal Automation - DGX Cloud

Equity
  • Equity
  • Comprehensive benefits package

Interested in this job?

Jobs Related To NVIDIA Senior Software Engineer, Bare Metal Automation - DGX Cloud