NVIDIA, the world leader in accelerated computing, is seeking a Senior Site Reliability Engineer for their DGX Cloud team. This role combines software and systems engineering practices to design, build, and maintain large-scale production systems. As an SRE at NVIDIA, you'll work with cutting-edge technologies like Kubernetes and OpenStack to ensure maximum reliability of GPU cloud services. The position requires expertise in systems, networking, coding, database management, and continuous deployment. You'll be part of a diverse, intellectually curious team that values problem-solving and openness. The role offers opportunities to work on meaningful projects with support and mentorship for growth. NVIDIA provides a blame-free environment that encourages innovation and risk-taking. The company's work in AI and digital twins is transforming major industries, making this an opportunity to impact society through technology. The position offers competitive compensation including a base salary range of $144,000-$333,500, plus equity and comprehensive benefits. NVIDIA values diversity and maintains an inclusive work environment, making it one of technology's most desirable employers.