NVIDIA is seeking an exceptional Senior Site Reliability Engineer to join their Infrastructure, Planning and Processes organization. This role is part of a dynamic team responsible for developing and maintaining sophisticated build & test environments for various hardware platforms including NVIDIA GPUs and Tegra Processors across multiple operating systems. The position offers an opportunity to work with cutting-edge technologies in AI, Robotics, and Autonomous Vehicles.
The ideal candidate will be responsible for implementing and managing Kubernetes architectures, establishing high-availability clusters, and developing automation tools. They will work with various business units within NVIDIA Software, including Graphics Processors, Mobile Processors, Deep Learning, and Artificial Intelligence teams. The role requires expertise in infrastructure as code, monitoring solutions, and cloud infrastructure development.
This is an excellent opportunity for a seasoned SRE professional who thrives in a fast-paced environment and wants to work with state-of-the-art technology. The position offers competitive compensation and benefits, making it an attractive opportunity for those looking to advance their career at one of the technology world's most desirable employers. NVIDIA's commitment to innovation in accelerated computing and AI makes this an exciting opportunity to work on transformative technologies that impact various industries.