Taro Logo

Senior HPC Performance Engineer

NVIDIA is the world leader in accelerated computing, pioneering GPU technology and breakthroughs in AI and digital twins.
Senior Software Engineer
Remote
5,000+ Employees
3+ years of experience
AI · Enterprise SaaS

Job Description

NVIDIA, a global leader in accelerated computing and GPU technology, is seeking a Senior HPC Performance Engineer to join their innovative team. This role focuses on optimizing GPU communication libraries crucial for scaling Deep Learning and HPC applications. The position offers an opportunity to work with cutting-edge technology, influencing the roadmap of communication libraries that power applications running on thousands of GPUs.

The ideal candidate will be responsible for performance optimization and analysis on large-scale GPU clusters, working with technologies like NVLink, PCIe, and high-speed networking solutions. This role combines deep technical expertise in HPC with practical performance engineering, requiring both analytical skills and hands-on programming experience.

Working at NVIDIA means being at the forefront of AI, High-Performance Computing, and Visualization. The company's work extends from fundamental GPU technology to groundbreaking applications in artificial intelligence and autonomous vehicles. The team environment is dynamic and collaborative, with opportunities to work across multiple time zones and influence the future of computing technology.

The position offers the flexibility of remote work from several European locations, including Germany, Poland, UK, and Switzerland. NVIDIA provides competitive compensation and benefits, fostering a diverse and inclusive workplace where innovation thrives. This role is perfect for someone passionate about performance engineering who wants to make a significant impact on the future of AI and HPC computing infrastructure.

Last updated 2 days ago

Responsibilities For Senior HPC Performance Engineer

  • Conduct in-depth performance characterization and analysis on large multi-GPU and multi-node clusters
  • Study the interaction of libraries with HW and SW components in the stack
  • Evaluate proof-of-concepts, conduct trade-off analysis
  • Triage and root-cause performance issues reported by customers
  • Collect performance data and build tools for visualization and analysis
  • Collaborate with team across multiple time zones

Requirements For Senior HPC Performance Engineer

Python
Linux
Kubernetes
  • M.S. or PHD in Computer Science or related field
  • 3+ years experience with parallel programming and communication runtime
  • Experience with performance benchmarking on large scale HPC clusters
  • Understanding of computer system architecture and operating systems principles
  • Ability to implement micro-benchmarks in C/C++
  • Proficiency in Python
  • Experience with containers, cloud provisioning and scheduling tools
  • Effective communication and collaboration skills