NVIDIA is seeking a Senior DGX Cloud AI Infrastructure Software Engineer to join their innovative AI research team. This role is crucial in developing and optimizing the infrastructure that powers NVIDIA's AI initiatives. The position focuses on building and maintaining scalable AI systems that enable large-scale training and inferencing operations.
The role combines deep technical expertise with strategic thinking, requiring the ability to design and implement robust infrastructure solutions while ensuring high efficiency and availability. You'll be working with cutting-edge AI technologies, including LLMs and GenAI, while contributing to the development of tools and services that support NVIDIA's AI platforms.
As a senior engineer, you'll be responsible for complex problem-solving, from application-level issues to hardware-level challenges. The position offers significant autonomy while providing the support and mentorship needed for success. NVIDIA's culture promotes blameless postmortems, continuous improvement, and innovative thinking.
The ideal candidate brings extensive experience in software infrastructure for AI systems, strong debugging capabilities, and a proven track record in scaling distributed systems. Knowledge of GPU technologies, network protocols, and deep learning frameworks is highly valued. The role offers competitive compensation, including equity and comprehensive benefits, reflecting NVIDIA's position as a leader in the AI and accelerated computing space.
Working at NVIDIA means being at the forefront of AI innovation, contributing to groundbreaking developments that transform industries. The company's commitment to diversity and inclusion, combined with its focus on pushing technological boundaries, creates an exciting and rewarding environment for professional growth and impact.