Taro Logo

Senior System Software Engineer, NCCL - Partner Enablement

NVIDIA is the world leader in accelerated computing and GPU technology.
$148,000 - $287,500
Senior Software Engineer
Remote
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Description For Senior System Software Engineer, NCCL - Partner Enablement

NVIDIA, the pioneer in GPU technology and accelerated computing, is seeking a Senior System Software Engineer for their GPU Communications Libraries and Networking team. This role focuses on NCCL and NVSHMEM communication runtimes for Deep Learning and HPC applications. The position offers a unique opportunity to work at the intersection of AI and high-performance networking, supporting large-scale GPU clusters.

The role involves close collaboration with partners and customers to optimize performance and resolve technical issues. You'll be conducting sophisticated performance analysis on cutting-edge GPU clusters and developing tools for system optimization. The position requires expertise in parallel programming, high-performance networking, and strong C/C++ programming skills.

This is an ideal opportunity for someone with a strong background in HPC or AI infrastructure who wants to work on groundbreaking technology. The role offers competitive compensation ($148,000 - $287,500) based on experience level, plus equity and benefits. NVIDIA's commitment to innovation in AI, HPC, and visualization makes this an exciting opportunity to work on technology that's transforming multiple industries.

Working at NVIDIA means joining a team that's leading the way in artificial intelligence and high-performance computing. The company culture values diversity, innovation, and technical excellence. With locations in Santa Clara, Austin, and remote work options, this role offers flexibility while working on some of the most challenging problems in computing.

Last updated 2 days ago

Responsibilities For Senior System Software Engineer, NCCL - Partner Enablement

  • Engage with partners and customers to root cause functional and performance issues reported with NCCL
  • Conduct performance characterization and analysis of NCCL and DL applications on GPU clusters
  • Develop tools and automation to isolate issues on new systems and platforms
  • Guide customers and support teams on HPC knowledge
  • Document and conduct trainings/webinars for NCCL
  • Engage with internal teams on networking, GPUs, storage, infrastructure and support

Requirements For Senior System Software Engineer, NCCL - Partner Enablement

Linux
Python
  • B.S./M.S. degree in CS/CE or equivalent experience with 5+ years of relevant experience
  • Experience with parallel programming and communication runtime
  • Excellent C/C++ programming skills
  • Experience working with engineering or academic research community supporting HPC or AI
  • Practical experience with high performance networking
  • Expert in Linux fundamentals and Python
  • Familiar with containers, cloud provisioning and scheduling tools
  • Adaptability and passion to learn new areas and tools
  • Flexibility to work and communicate effectively across different teams and timezones

Benefits For Senior System Software Engineer, NCCL - Partner Enablement

Equity
  • Equity

Jobs Related To NVIDIA Senior System Software Engineer, NCCL - Partner Enablement