Taro Logo

Senior Software Engineer, Deep Learning Inference Workflows

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.
$184,000 - $356,500
Machine Learning
Senior Software Engineer
Hybrid
6+ years of experience
AI
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Senior Software Engineer, Deep Learning Inference Workflows

NVIDIA, the world leader in accelerated computing, is seeking a Senior Software Engineer for their Deep Learning Inference Workflows team. This role focuses on building state-of-the-art inference frameworks for accelerating Deep Learning models, particularly Large Language Models, on NVIDIA GPUs. The position involves developing TensorRT components, NVIDIA's SDK for high-performance deep learning inference, using C++ and Python to create graph parsers, optimizers, and deployment tools.

The ideal candidate will have 6+ years of software development experience with strong C++ skills and deep understanding of Machine Learning concepts. They'll work with deep learning experts, GPU architects, and DevOps engineers in a collaborative environment. Additional valuable skills include system software development, GPU kernel programming with CUDA or OpenCL, and experience with ML frameworks like PyTorch and TensorFlow.

NVIDIA offers a competitive compensation package with a base salary range of $184,000 - $356,500 USD, plus equity and benefits. The company is known for being one of technology's most desirable employers, fostering innovation and tackling challenging problems in AI and digital twins that transform major industries. This hybrid role, based in Santa Clara, provides an opportunity to make significant contributions to NVIDIA's success in the rapidly growing field of deep learning inference.

Last updated 6 months ago

Responsibilities For Senior Software Engineer, Deep Learning Inference Workflows

  • Develop components of TensorRT, NVIDIA's SDK for high-performance deep learning inference
  • Use C++ and Python to build graph parsers, optimizers, and tools for effective deployment of trained deep learning models
  • Collaborate with teams of deep learning experts, GPU architects and DevOps engineers across diverse teams

Requirements For Senior Software Engineer, Deep Learning Inference Workflows

Python
  • Bachelor's, Master's, PhD or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering or related field
  • 6+ years of software development experience
  • Strong experience with C++11/C++14/C++17
  • Strong grasp of Machine Learning concepts, especially Natural Language Processing
  • Excellent communication skills, and an aptitude for collaboration and teamwork

Benefits For Senior Software Engineer, Deep Learning Inference Workflows

Equity
  • Equity
  • Benefits package available

Interested in this job?