Senior Site Reliability Engineer - DGX Cloud

NVIDIA is a global technology company and leader in AI computing, graphics, and accelerated computing.
$150,000 - $220,000
Site Reliability
Senior Software Engineer
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Description For Senior Site Reliability Engineer - DGX Cloud

NVIDIA, a world leader in artificial intelligence computing and graphics technology, is seeking a Senior Site Reliability Engineer for their DGX Cloud platform. This role sits at the intersection of cloud infrastructure and AI computing, working with NVIDIA's cutting-edge DGX systems that power some of the most advanced AI workloads in the industry. The position requires expertise in cloud infrastructure, site reliability engineering practices, and a deep understanding of distributed systems. As part of NVIDIA's cloud operations team, you'll be responsible for ensuring the reliability, scalability, and performance of the DGX Cloud platform that serves enterprise customers worldwide. This is an opportunity to work with state-of-the-art technology in AI and cloud computing, while being part of a company that's driving innovation in multiple industries including artificial intelligence, gaming, autonomous vehicles, and scientific computing. The role offers exposure to complex distributed systems at scale and the chance to work with a team of highly skilled engineers who are passionate about building reliable, performant cloud infrastructure for AI workloads.

Last updated 2 days ago

Interested in this job?

Jobs Related To NVIDIA Senior Site Reliability Engineer - DGX Cloud

Site Reliability Engineer - Cloud

Senior Site Reliability Engineer position at NVIDIA focusing on AWS infrastructure and cloud services, offering competitive compensation and opportunity to work with cutting-edge technology.

Platform Reliability Engineer

Senior Platform Reliability Engineer role at NVIDIA focusing on maintaining and improving the reliability of their Unified Commerce Platform through automated testing and monitoring solutions.

Site Reliability Engineer

Senior Site Reliability Engineer position at Wheely, focusing on infrastructure security, monitoring, and DevOps practices in Nicosia, Cyprus.

Senior Software Engineer, Site Reliability Engineering

Senior SRE position at Adobe working on Identity Services, focusing on scalability, reliability and zero downtime for systems handling millions of requests.

Site Reliability Engineer - Cloud

Senior Site Reliability Engineer position at NVIDIA focusing on AWS infrastructure and cloud services, offering competitive compensation and opportunity to work with cutting-edge technology.