NVIDIA is seeking a Senior High Performance Computing (HPC) and AI Networking Performance Research and Analysis Engineer to join their Performance group. This role sits at the intersection of AI, distributed systems, and high-performance computing, focusing on optimizing networking performance for large-scale deep learning and LLM training.
The position offers an opportunity to work with cutting-edge technology in AI and GPU computing, profiling and analyzing AI workloads on large-scale GPU clusters. You'll be working with various hardware platforms including HCAs, Switches, CPUs, and GPUs, developing performance analysis tools and methodologies to understand and optimize system performance.
As part of NVIDIA, you'll be joining a company at the forefront of AI and accelerated computing innovation. NVIDIA has been redefining computer graphics and computing for over 25 years and is now leading the charge in AI development. The company offers competitive salaries and comprehensive benefits in a diverse, supportive environment.
The ideal candidate will bring strong expertise in high-performance networking, deep learning frameworks, and performance analysis. You'll need to demonstrate proficiency in Python, Bash, and C languages, along with experience in Linux environments. Knowledge of CUDA, NCCL libraries, and congestion control algorithms would be particularly valuable.
This role offers the chance to make a significant impact on the future of AI computing, working with some of the most advanced technology in the field. You'll be part of a team pushing the boundaries of what's possible in distributed deep learning and high-performance computing, while contributing to NVIDIA's mission of solving challenges no one else can solve.