Taro Logo

Senior System Software Engineer, NCCL - Partner Enablement

NVIDIA is the world leader in accelerated computing and GPU technology.
$148,000 - $287,500
Senior Software Engineer
Remote
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Description For Senior System Software Engineer, NCCL - Partner Enablement

NVIDIA, the pioneer in GPU technology and accelerated computing, is seeking a Senior System Software Engineer for their GPU Communications Libraries and Networking team. This role focuses on NCCL and NVSHMEM communication runtimes for Deep Learning and HPC applications. The position offers a unique opportunity to work at the intersection of AI and high-performance networking, supporting large-scale GPU clusters with advanced networking technologies.

The role involves close collaboration with partners and customers to optimize performance and resolve technical issues, conducting sophisticated analysis of NCCL and DL applications, and developing automation tools for various cloud platforms. The ideal candidate will bring strong expertise in parallel programming, C/C++ development, and high-performance networking, particularly with Infiniband/RoCE/Ethernet networks.

This position offers the chance to work with cutting-edge technology in AI and HPC, while making significant contributions to NVIDIA's vision of advancing accelerated computing. The role combines technical depth with customer interaction, requiring both strong engineering skills and excellent communication abilities. Working at NVIDIA means being part of a team that's transforming industries through AI and digital twins technology.

The compensation package includes a competitive base salary range of $148,000 - $287,500 USD, plus equity and benefits. This is an excellent opportunity for experienced engineers passionate about high-performance computing and AI to make a significant impact in a leading technology company.

Last updated 30 minutes ago

Responsibilities For Senior System Software Engineer, NCCL - Partner Enablement

  • Engage with partners and customers to root cause functional and performance issues reported with NCCL
  • Conduct performance characterization and analysis of NCCL and DL applications on GPU clusters
  • Develop tools and automation to isolate issues on new systems and platforms
  • Guide customers and support teams on HPC knowledge
  • Document and conduct trainings/webinars for NCCL
  • Engage with internal teams in different time zones

Requirements For Senior System Software Engineer, NCCL - Partner Enablement

Linux
Python
  • B.S./M.S. degree in CS/CE or equivalent experience with 5+ years of relevant experience
  • Experience with parallel programming and communication runtime
  • Excellent C/C++ programming skills
  • Experience working with engineering or academic research community supporting HPC or AI
  • Practical experience with high performance networking
  • Expert in Linux fundamentals and Python
  • Familiar with containers, cloud provisioning and scheduling tools
  • Adaptability and passion to learn new areas and tools
  • Flexibility to work and communicate effectively across different teams and timezones

Benefits For Senior System Software Engineer, NCCL - Partner Enablement

Equity
  • Equity

Interested in this job?

Jobs Related To NVIDIA Senior System Software Engineer, NCCL - Partner Enablement

HPC Middleware Developer

Senior HPC software engineer role at NVIDIA focusing on developing high-performance communication frameworks for supercomputers and datacenters.

HPC Middleware Developer

Senior HPC Middleware Developer role at NVIDIA focusing on high-performance computing software development and optimization for supercomputers and datacenters.

Senior System Software Engineer, NCCL - Partner Enablement

Senior System Software Engineer role at NVIDIA focusing on NCCL partner enablement and GPU communications libraries for AI and HPC applications.

Senior DGX Cloud Software Engineer - Infrastructure Automation and Distributed Systems

Senior Software Engineering role focusing on DGX Cloud infrastructure automation and distributed systems at NVIDIA.

Senior DGX Cloud Software Engineer - Infrastructure Automation and Distributed Systems

Senior Software Engineering role at NVIDIA focusing on DGX Cloud infrastructure automation and distributed systems development.