Senior Software Engineer, Deep Learning Inference, TensorRT

NVIDIA is the world leader in accelerated computing, pioneering AI and digital twins technology.
$148,000 - $287,500
Machine Learning
Senior Software Engineer
Hybrid
5,000+ Employees
3+ years of experience
AI

Description For Senior Software Engineer, Deep Learning Inference, TensorRT

NVIDIA, the world leader in accelerated computing, is seeking a Senior Software Engineer to join their Deep Learning Inference TensorRT software team. This role presents an exciting opportunity to make a significant impact in the field of Deep Learning by developing state-of-the-art inference frameworks for accelerating Deep Learning models, particularly Large Language Models, on NVIDIA GPUs.

The position involves working with cutting-edge technology at the intersection of deep learning and high-performance computing. As a Senior Software Engineer, you'll be responsible for developing components of TensorRT, NVIDIA's SDK for high-performance deep learning inference, using C++ and Python to build graph parsers, optimizers, and deployment tools for trained deep learning models.

The ideal candidate brings at least 3 years of software development experience, with strong expertise in C++11/C++14/C++17 and a solid foundation in Machine Learning concepts. Knowledge of Computer Architecture, Data Structures, and Algorithms is essential. The role requires excellent communication skills and the ability to collaborate effectively with diverse teams of deep learning experts, GPU architects, and DevOps engineers.

What makes this opportunity particularly compelling is the chance to work at NVIDIA, widely recognized as one of the technology world's most desirable employers. The company offers competitive compensation, including a base salary range of $148,000 - $287,500 USD, equity, and comprehensive benefits. The position is hybrid, based in Santa Clara, CA, allowing for both collaborative in-person work and flexible remote options.

Additional valuable skills include experience with system software development, GPU kernel programming using CUDA or OpenCL, software performance optimization, compiler development, and familiarity with ML frameworks like TensorRT, PyTorch, TensorFlow, and ONNX Runtime. This role offers the opportunity to work on real-world applications of deep learning technology while contributing to NVIDIA's mission of transforming industries through AI and accelerated computing.

Last updated a day ago

Responsibilities For Senior Software Engineer, Deep Learning Inference, TensorRT

  • Develop components of TensorRT, NVIDIA's SDK for high-performance deep learning inference
  • Use C++ and Python to build graph parsers, optimizers, and tools for deployment of trained deep learning models
  • Collaborate with teams of deep learning experts, GPU architects and DevOps engineers

Requirements For Senior Software Engineer, Deep Learning Inference, TensorRT

Python
  • Bachelor's, Master's, PhD or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering or related field
  • 3+ years of software development experience
  • Strong experience with C++11/C++14/C++17
  • Strong grasp of Machine Learning concepts
  • Experience and knowledge in Computer Architecture, Data Structures, Algorithms
  • Excellent communication skills, and an aptitude for collaboration and teamwork

Benefits For Senior Software Engineer, Deep Learning Inference, TensorRT

Equity
  • Equity
  • Benefits package available

Interested in this job?

Jobs Related To NVIDIA Senior Software Engineer, Deep Learning Inference, TensorRT

Deep Learning Performance Architect

Senior Deep Learning Performance Architect position at NVIDIA, focusing on AI workload optimization and hardware architecture analysis, requiring 2+ years of experience in parallel computing.

AI Computing Software Development Engineer, TensorRT

Senior AI Software Engineer role at NVIDIA developing TensorRT inferencing software, requiring 3+ years experience and expertise in C++, deep learning frameworks, and AI technologies.

Deep Learning Performance Architect

Senior Deep Learning Performance Architect role at NVIDIA, focusing on optimizing deep learning kernels and developing GPU-accelerated AI software solutions.

Senior Software Engineer - Conversational AI

Senior Software Engineer role at NVIDIA focusing on building next-generation Multimodal Conversational AI systems using Speech and LLM models.

Senior Software Engineer, TensorRT-LLM

Senior Software Engineer position at NVIDIA focusing on TensorRT-LLM development, offering competitive salary and the opportunity to work on cutting-edge AI technology.