NVIDIA is seeking a Senior AI Infrastructure Engineer for their DGX Cloud team to ensure maximum reliability and uptime of GPU cloud services. This role combines infrastructure engineering with AI technology, focusing on building and maintaining cloud-based systems that power NVIDIA's cutting-edge AI solutions.
The position requires a strong background in computer science fundamentals and offers the opportunity to work with state-of-the-art AI infrastructure. You'll be responsible for designing and implementing critical data pipelines, creating internal tooling, and leveraging AI/ML to improve operational efficiency. The role involves working with modern technologies including Kubernetes, terraform, and various ML frameworks.
As part of NVIDIA, one of technology's most desirable employers, you'll be at the forefront of AI and High-Performance Computing. The company's invention, the GPU, serves as the foundation for breakthrough developments in these fields. The position offers competitive compensation, including a base salary range of $148,000 - $287,500 USD, plus equity and benefits.
This role is perfect for someone who combines technical expertise with creative problem-solving abilities. You'll work in a dynamic environment that values initiative and collaboration, contributing to projects that have real impact on the AI infrastructure landscape. NVIDIA's commitment to diversity and inclusion ensures a welcoming workplace for all professionals.
The ideal candidate will have 5+ years of experience, strong infrastructure automation skills, and expertise in languages like Python, Go, or TypeScript. Knowledge of Linux, networking, and containers is essential. Experience with incident tooling, Backstage, and ML concepts would be particularly valuable. This position offers both technical challenges and professional growth opportunities in a company that's driving the future of AI technology.