Taro Logo

Distinguished Software Architect - Deep Learning and HPC Communications

NVIDIA is the world leader in accelerated computing, pioneering GPU technology and AI solutions.
Principal Software Engineer
Remote
5,000+ Employees
15+ years of experience
AI · Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Distinguished Software Architect - Deep Learning and HPC Communications

NVIDIA is seeking a Distinguished Software Architect to lead their Deep Learning and HPC Communications initiatives. This role is crucial for scaling Deep Learning and HPC applications across thousands of GPUs. The position involves working with NVIDIA's groundbreaking technologies like NCCL, NVSHMEM & GPUDirect, focusing on high-speed interconnects and networking solutions. The ideal candidate will co-design next-generation data center platforms, push the boundaries of communication performance between GPUs, and directly impact end-to-end application performance at massive scales. This role offers an exceptional opportunity to work at the intersection of AI, HPC, and distributed systems, contributing to NVIDIA's vision of advancing accelerated computing. The position requires deep expertise in parallel computing, network technologies, and machine learning, with opportunities to influence industry standards and open-source software development. Working at NVIDIA means joining one of technology's most desirable employers, with some of the industry's most innovative minds tackling unprecedented challenges in AI and digital twins.

Last updated a month ago

Responsibilities For Distinguished Software Architect - Deep Learning and HPC Communications

  • Research new communication technologies and design features for communication libraries
  • Propose innovative solutions in HW and SW for next-gen platforms
  • Inspire changes based on quantitative data and technical analysis
  • Drive adoption of new communication technologies across application verticals
  • Keep up with latest DL research and collaborate with diverse teams

Requirements For Distinguished Software Architect - Deep Learning and HPC Communications

  • PHD in Computer Science, Computer Engineering or related field or strong equivalent experience
  • 15+ years of relevant experience in academia or industry
  • Expert in HPC, parallel programming models (MPI, SHMEM), communication runtime
  • Deep understanding of high performance networking
  • Strong knowledge of ML/DL fundamentals
  • Programming fluency with C or C++ for systems software development
  • Flexibility to work across different teams and timezones

Interested in this job?