Taro Logo

Senior Performance Software Engineer, Deep Learning Libraries

NVIDIA is the world leader in accelerated computing, pioneering solutions for AI and digital twins.
Machine Learning
Senior Software Engineer
Remote
5,000+ Employees
4+ years of experience
AI · Enterprise SaaS

Description For Senior Performance Software Engineer, Deep Learning Libraries

NVIDIA is seeking a Senior Performance Software Engineer to join their Deep Learning Libraries team. This role focuses on developing optimized code to accelerate linear algebra and deep learning operations on NVIDIA GPUs, contributing to crucial libraries like cuDNN, cuBLAS, and TensorRT. The position is integral to enabling breakthroughs in image classification, speech recognition, and natural language processing. The team works on low-level GPU optimization, writing highly efficient code for current and future-generation GPUs. The role involves close collaboration with various NVIDIA teams, including the CUDA compiler team, deep learning performance teams, and hardware architecture teams. The ideal candidate should have strong C++ programming skills, experience with parallel programming, and a deep understanding of computer architecture. The position offers the opportunity to work at one of technology's most desirable employers, contributing to cutting-edge developments in AI and deep learning. The role is remote-friendly, with options to work from several European locations. This is a chance to join a team that's directly impacting the future of AI computing and hardware optimization.

Last updated a day ago

Responsibilities For Senior Performance Software Engineer, Deep Learning Libraries

  • Writing highly tuned compute kernels in C++ CUDA for core deep learning operations
  • Following software engineering best practices including regression testing and CI/CD flows
  • Collaborating with CUDA compiler team on generating optimal assembly code
  • Working with deep learning training and inference performance teams
  • Collaborating with hardware and architecture teams on programming models

Requirements For Senior Performance Software Engineer, Deep Learning Libraries

  • PhD in Computer Science, Computer Engineering, Applied Math, or related field OR Bachelors/Masters plus 4-6 years experience
  • Strong C++ programming and software design skills
  • Experience with performance-oriented parallel programming
  • Solid understanding of computer architecture and assembly programming experience

Interested in this job?

Jobs Related To NVIDIA Senior Performance Software Engineer, Deep Learning Libraries

Senior Deep Learning Software Engineer, Inference

Senior Deep Learning Software Engineer position at NVIDIA focusing on inference systems and AI technology development.

Senior Deep Learning Software Engineer, Inference

Senior Deep Learning Software Engineer position at NVIDIA focusing on inference optimization and development of GPU-accelerated AI software.

Developer Technology Engineer - HPC and AI

Senior Developer Technology Engineer position at NVIDIA focusing on HPC and AI, requiring 3+ years experience and expertise in parallel programming and algorithms.

Senior Prediction and Planning Machine Learning Engineer - Autonomous Vehicles

Senior ML Engineer role at NVIDIA focusing on prediction and planning systems for autonomous vehicles, combining AI expertise with automotive technology.

Senior Perception Engineer

Senior Perception Engineer position at NVIDIA focusing on developing and implementing perception algorithms for autonomous systems.