NVIDIA, a global leader in accelerated computing and AI technology, is seeking a forward-thinking HPC and AI Software Architect to shape the future of scalable AI infrastructure. This role focuses on distributed training, real-time inference, and communication optimization across large-scale systems. You'll join a world-class team building next-generation software and hardware systems powering the most demanding AI workloads.
The position involves designing scalable software systems, optimizing communication libraries, and collaborating with AI framework teams to enhance performance. You'll work on cutting-edge technologies, including NCCL, UCX, and UCC, while co-designing hardware features for accelerated data movement. The ideal candidate brings deep expertise in distributed systems, AI frameworks, and high-performance computing.
At NVIDIA, you'll work alongside industry leaders, pushing the boundaries of what's possible in AI and high-performance computing. The company offers competitive compensation and benefits while promoting a diverse and inclusive workplace. This role presents an exceptional opportunity to impact the future of AI infrastructure, working with state-of-the-art technology and contributing to groundbreaking advancements in distributed computing and artificial intelligence.
The position requires a Ph.D. or equivalent experience, strong programming skills, and practical experience with AI frameworks. You'll be part of a team driving innovation in areas like large model training, inference serving at scale, and communication optimization. NVIDIA's commitment to technological advancement and professional growth makes this an ideal opportunity for those passionate about distributed systems and AI technology.