Senior HPC Performance Engineer

NVIDIA

NVIDIA is the world leader in accelerated computing, pioneering GPU technology and breakthroughs in AI and digital twins.

Senior Software Engineer

Remote

5,000+ Employees

3+ years of experience

AI · Enterprise SaaS

Job Description

NVIDIA, a global leader in accelerated computing and GPU technology, is seeking a Senior HPC Performance Engineer to join their innovative team. This role focuses on optimizing GPU communication libraries crucial for scaling Deep Learning and HPC applications. The position offers an opportunity to work with cutting-edge technology, influencing the roadmap of communication libraries that power applications running on thousands of GPUs.

The ideal candidate will be responsible for performance optimization and analysis on large-scale GPU clusters, working with technologies like NVLink, PCIe, and high-speed networking solutions. This role combines deep technical expertise in HPC with practical performance engineering, requiring both analytical skills and hands-on programming experience.

Working at NVIDIA means being at the forefront of AI, High-Performance Computing, and Visualization. The company's work extends from fundamental GPU technology to groundbreaking applications in artificial intelligence and autonomous vehicles. The team environment is dynamic and collaborative, with opportunities to work across multiple time zones and influence the future of computing technology.

The position offers the flexibility of remote work from several European locations, including Germany, Poland, UK, and Switzerland. NVIDIA provides competitive compensation and benefits, fostering a diverse and inclusive workplace where innovation thrives. This role is perfect for someone passionate about performance engineering who wants to make a significant impact on the future of AI and HPC computing infrastructure.

Last updated 2 days ago

Responsibilities For Senior HPC Performance Engineer

Conduct in-depth performance characterization and analysis on large multi-GPU and multi-node clusters
Study the interaction of libraries with HW and SW components in the stack
Evaluate proof-of-concepts, conduct trade-off analysis
Triage and root-cause performance issues reported by customers
Collect performance data and build tools for visualization and analysis
Collaborate with team across multiple time zones

Requirements For Senior HPC Performance Engineer

Python

Linux

Kubernetes

M.S. or PHD in Computer Science or related field
3+ years experience with parallel programming and communication runtime
Experience with performance benchmarking on large scale HPC clusters
Understanding of computer system architecture and operating systems principles
Ability to implement micro-benchmarks in C/C++
Proficiency in Python
Experience with containers, cloud provisioning and scheduling tools
Effective communication and collaboration skills