Taro Logo

Senior Software Architect - Deep Learning and HPC Communications

NVIDIA is the world leader in accelerated computing, pioneering GPU technology and AI solutions.
Santa Clara, CA, USAWestford, MA 01886, USAAustin, TX, USA
$184,000 - $356,500
Staff Software Engineer
Hybrid
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:
Software Engineer 5 - Data Platform Poland

Staff Software Engineer position at Netflix's Data Platform team in Warsaw, focusing on distributed systems and data infrastructure, requiring 7+ years of experience.

Staff Software Engineer - Systems and Infrastructure

Staff Software Engineer position at LinkedIn focusing on distributed systems and infrastructure, building next-generation platforms that power LinkedIn's services at massive scale.

Staff Software Engineer - Systems Infrastructure

Staff Software Engineer position at LinkedIn focusing on building next-generation infrastructure and platforms including distributed systems and scalable data storage.

Staff Software Engineer

Staff Software Engineer position at Datadog focusing on building large-scale distributed systems and technical leadership, offering competitive compensation and remote work options.

Staff Software Engineer - Compute Infrastructure

Staff Software Engineer role at LinkedIn focusing on Kubernetes infrastructure and distributed systems, offering competitive pay and hybrid work environment.

Description For Senior Software Architect - Deep Learning and HPC Communications

NVIDIA, the pioneer in GPU technology and accelerated computing, is seeking a Senior Software Architect to join their GPU Communications Libraries and Networking team. This role focuses on building crucial communication libraries like NCCL, NVSHMEM, and UCX that are essential for scaling Deep Learning and HPC applications.

The position offers an opportunity to work at the cutting edge of technology, designing and implementing next-generation data center platforms and scalable communications software. The role involves working with systems that operate at massive scales, handling workloads across tens of thousands of GPUs connected through high-speed interconnects like NVLink and InfiniBand.

As a Senior Software Architect, you'll be responsible for advancing the state-of-the-art in GPU communications, identifying and solving performance bottlenecks, and contributing to co-design efforts involving GPU, Networking, and Software architects. The role requires deep technical expertise in parallel programming, system architecture, and high-performance computing.

The position offers a competitive compensation package with a base salary range of $184,000 - $356,500 USD, plus equity and comprehensive benefits. This is an excellent opportunity for experienced professionals who want to make a significant impact on the future of AI and HPC computing infrastructure.

Working at NVIDIA means joining a company at the forefront of artificial intelligence, high-performance computing, and visualization technology. The company culture emphasizes innovation, technical excellence, and collaborative problem-solving in a multi-national environment.

Last updated 3 days ago

Responsibilities For Senior Software Architect - Deep Learning and HPC Communications

  • Investigate opportunities to improve communication performance
  • Design and implement new communication technologies to accelerate AI and HPC workloads
  • Explore innovative solutions in HW and SW for next generation platforms
  • Build proofs-of-concept, conduct experiments, and perform quantitive modeling
  • Use simulation to explore performance of large GPU clusters

Requirements For Senior Software Architect - Deep Learning and HPC Communications

Linux
  • M.S./Ph.D. degree in CS/CE or equivalent experience
  • 5+ years of relevant experience
  • Excellent C/C++ programming and debugging skills
  • Experience with parallel programming models (MPI, SHMEM)
  • Deep understanding of operating systems, computer and system architecture
  • Solid fundamentals of network architecture, topology, algorithms
  • Strong experience with Linux
  • Ability to work and communicate effectively in a multi-national environment

Benefits For Senior Software Architect - Deep Learning and HPC Communications

Equity
Medical Insurance
  • Equity compensation
  • Comprehensive benefits package

Interested in this job?