NVIDIA, a global leader in AI computing and accelerated computing solutions, is seeking a Site Reliability Engineer to join their Digital Marketing Organization. This role combines technical expertise with operational excellence, focusing on maintaining and improving AWS infrastructure and ensuring the reliability of NVIDIA's Digital Marketing Services.
The position offers an opportunity to work with cutting-edge technology at a company that has continuously reinvented itself over two decades. From inventing the GPU in 1999 to becoming "the AI computing company," NVIDIA has been at the forefront of technological advancement. The role involves working with AWS Infrastructure, Kubernetes, and various programming languages to ensure service reliability and efficiency.
As an SRE, you'll be responsible for critical tasks including debugging user-reported issues, implementing monitoring solutions, and automating deployment pipelines. The role requires a blend of technical skills in Python, Java, and cloud technologies, along with strong problem-solving abilities and excellent communication skills. You'll be part of a team that values innovation and autonomous thinking, with opportunities to make significant impacts on service reliability and performance.
The position offers a competitive compensation package with a base salary range of $136,000 to $212,750, plus equity and comprehensive benefits. NVIDIA is known for its inclusive work environment and commitment to diversity, making it an ideal place for professionals looking to advance their careers in technology while working on meaningful projects that shape the future of computing.