NVIDIA, the pioneer in visual computing and GPU technology, is seeking a Software Engineer Intern for their Fleet Health Instrumentation team for Fall 2025. This role sits at the intersection of cloud infrastructure and GPU computing, where you'll help design and implement systems that monitor and maintain NVIDIA's global GPU fleet.
The position offers a unique opportunity to work with cutting-edge technology in a company that's leading the AI and visual computing revolution. You'll be involved in developing microservices and data pipelines that process millions of records daily, using modern technologies like Go, Python, Kafka, and Kubernetes. The role combines software engineering with infrastructure monitoring, giving you exposure to both development and operations aspects of cloud computing.
As an intern, you'll be immersed in NVIDIA's engineering culture, working on high-impact features that keep GPU-accelerated platforms running smoothly at a global scale. You'll gain hands-on experience with service design, development, system instrumentation, and data-pipeline engineering. The position emphasizes writing robust, performant code and automation, ensuring NVIDIA's cloud offerings maintain world-class reliability.
The ideal candidate should be pursuing a BS or MS in Computer Science or related field, with a strong foundation in distributed systems and modern software engineering practices. You'll need proficiency in either Python or Go, along with knowledge of Linux and Kubernetes. This internship offers competitive compensation ($18-71/hour) and benefits, making it an excellent opportunity for students looking to gain real-world experience in cloud infrastructure and systems engineering at a leading technology company.
What makes this role particularly exciting is the chance to work on systems that directly impact NVIDIA's core infrastructure, supporting the company's mission in AI and visual computing. You'll be part of a team that values automation, reliability, and continuous improvement, while getting exposure to enterprise-scale systems and modern cloud technologies.