Taro Logo

Senior Performance Analysis Engineer

NVIDIA is the world leader in accelerated computing, pioneering AI and digital twins technology.
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Description For Senior Performance Analysis Engineer

NVIDIA is seeking a Senior Performance Analysis Engineer to join their Performance group, focusing on profiling and analyzing AI workloads on large GPU and CPU scale clusters for distributed Deep Learning LLM training. This role sits at the intersection of high-performance computing and artificial intelligence, working with cutting-edge technology including GPUs, CPUs, and networking systems.

The position involves deep technical work with NVIDIA's supercomputers and distributed systems, with a particular focus on high-performance networking and the NVIDIA Collective Communications Library (NCCL). You'll be responsible for benchmarking, profiling, and analyzing performance to optimize large-scale AI systems, while developing new tools and methodologies for performance analysis.

As a Senior Performance Analysis Engineer, you'll collaborate across multiple teams, from hardware to software, providing crucial insights that drive performance improvements. The role requires extensive experience with high-performance networking protocols and technologies, combined with a strong understanding of GPU computing and deep learning frameworks.

NVIDIA offers a unique opportunity to work at the forefront of AI and accelerated computing, with access to the latest technology and the chance to make a significant impact on the future of computing. The company provides competitive salaries and comprehensive benefits, fostering an environment where innovation and technical excellence are highly valued.

The ideal candidate will bring a combination of technical expertise in performance analysis, networking protocols, and AI systems, along with strong analytical and problem-solving skills. This role offers the opportunity to work on challenging problems at scale, contributing to NVIDIA's mission of advancing the field of artificial intelligence and accelerated computing.

Last updated a day ago

Responsibilities For Senior Performance Analysis Engineer

  • Exploring and researching AI workloads and DL models for large-scale deep learning LLM training
  • Benchmarking, Profiling, and Analyzing performance to find bottlenecks
  • Implementing performance analysis tools
  • Collaborating with hardware and software teams
  • Defining performance test planning and setting performance expectations

Requirements For Senior Performance Analysis Engineer

Python
Linux
  • B.Sc. in Computer Science or Software Engineering or equivalent experience
  • 5+ years of experience with high-performance Networking (RDMA, MPI, NCCL, Congestion Control Algorithms)
  • Demonstrated Performance Analysis skills and methodologies
  • Experience with NVIDIA GPUs, CUDA library, deep learning frameworks
  • Fast and self-learning capabilities with strong analytical and problem-solving skills
  • Programming Languages: Python, Bash and C languages
  • Experience with Linux OS distros
  • Great teammate with good communication and interpersonal skills

Interested in this job?

Jobs Related To NVIDIA Senior Performance Analysis Engineer