NVIDIA, the world leader in accelerated computing, is seeking a Senior AI Infrastructure Engineer for their Compute Architecture Group. This role involves managing a diverse cluster of GPU-accelerated systems to support AI and software development. The position requires expertise in system administration, performance analysis, automation, and architecture. You'll be working with cutting-edge technology, enabling groundbreaking experimentation in designing the world's most powerful systems.
The role combines hands-on technical work with strategic planning, requiring you to administer AI clusters, maintain SLURM configurations, and implement DevOps practices using tools like Ansible and Gitlab. You'll work directly with developers and hardware architects, making a meaningful impact at a company spearheading the next wave in computing technology.
Ideal candidates should have 5+ years of experience with large-scale clusters, strong technical knowledge of distributed systems, and expertise in Linux administration. The position offers competitive compensation ($144,000-$270,250) plus equity and benefits. This is an excellent opportunity for someone passionate about AI infrastructure who wants to work at the forefront of technology innovation.
NVIDIA's commitment to diversity and inclusion, combined with their position as a leader in AI and digital twins technology, makes this an attractive opportunity for those looking to make a significant impact in the field of accelerated computing. The role offers the chance to work with a technically diverse team of GPU architects, software engineers, and infrastructure experts in a fast-paced, innovation-driven environment.