NVIDIA is seeking a Senior AI Infrastructure Engineer for their DGX Cloud group, focusing on designing and maintaining large-scale production systems. This role combines software and systems engineering, requiring expertise in systems, networking, coding, database management, and cloud technologies. The position is part of NVIDIA's DGX Cloud SRE team, ensuring reliable GPU cloud services while managing system changes and capacity.
The role offers an opportunity to work with cutting-edge AI infrastructure at one of technology's most respected companies. You'll be responsible for building and maintaining the backbone of NVIDIA's AI training and inferencing platforms, working with multi-GPU clusters and distributed systems. The position combines hands-on technical work with strategic system design and planning.
NVIDIA's culture emphasizes diversity, intellectual curiosity, and problem-solving in a blame-free environment. The company encourages collaboration and risk-taking while providing support and mentorship for professional growth. As a leader in AI and accelerated computing, NVIDIA offers the chance to work on meaningful projects that impact various industries.
The position includes competitive compensation with a base salary range of $148,000 - $287,500 USD, plus equity and benefits. The role can be performed from Santa Clara, CA, or remotely from WA or CA, offering flexibility in work location while being part of a team that's pushing the boundaries of AI infrastructure.