Taro Logo

Senior Software Architect - Deep Learning and HPC Communications

NVIDIA is the world leader in accelerated computing, pioneering GPU technology and AI solutions.
Santa Clara, CA, USAWestford, MA 01886, USAAustin, TX, USA
$184,000 - $356,500
Staff Software Engineer
Hybrid
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Description For Senior Software Architect - Deep Learning and HPC Communications

NVIDIA is seeking a Senior Software Architect to join their GPU Communications Libraries and Networking team. This role focuses on building crucial communication libraries like NCCL, NVSHMEM, and UCX for scaling Deep Learning and HPC applications. The position involves working with cutting-edge technology in GPU communications, designing and implementing solutions that power AI and HPC workloads at massive scales.

The ideal candidate will be responsible for advancing state-of-the-art communication technologies, working on systems that scale to tens of thousands of GPUs. They will be involved in co-design efforts with GPU, Networking, and SW architects to develop next-generation platforms. The role requires expertise in parallel programming, system architecture, and network communications.

This is an exceptional opportunity to work at NVIDIA, a company leading groundbreaking developments in Artificial Intelligence, High Performance Computing, and Visualization. The position offers competitive compensation, including a base salary range of $184,000 - $356,500 USD, plus equity and benefits. The role can be based in several US locations including Santa Clara, CA, Westford, MA, Austin, TX, or Durham, NC, with hybrid work options available.

The position requires strong technical skills in C/C++, Linux, and parallel programming models, combined with deep understanding of system architecture and networking. The successful candidate will work on projects that directly impact the performance and scalability of AI and HPC applications, making this an ideal role for someone passionate about high-performance computing and communication systems.

Last updated a day ago

Responsibilities For Senior Software Architect - Deep Learning and HPC Communications

  • Investigate opportunities to improve communication performance
  • Design and implement new communication technologies to accelerate AI and HPC workloads
  • Explore innovative solutions in HW and SW for next generation platforms
  • Build proofs-of-concept, conduct experiments, and perform quantitive modeling
  • Use simulation to explore performance of large GPU clusters

Requirements For Senior Software Architect - Deep Learning and HPC Communications

Linux
  • M.S./Ph.D. degree in CS/CE or equivalent experience
  • 5+ years of relevant experience
  • Excellent C/C++ programming and debugging skills
  • Experience with parallel programming models (MPI, SHMEM)
  • Deep understanding of operating systems, computer and system architecture
  • Solid in fundamentals of network architecture, topology, algorithms
  • Strong experience with Linux
  • Ability to work and communicate effectively in a multi-national environment

Interested in this job?

Jobs Related To NVIDIA Senior Software Architect - Deep Learning and HPC Communications