Senior Software Engineer, Deep Learning Inference Workflows

NVIDIA

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.

San Francisco, CA, USA

$184,000 - $356,500

Machine Learning

Senior Software Engineer

Hybrid

6+ years of experience

This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Senior Software Engineer, Deep Learning Inference Workflows

NVIDIA, the world leader in accelerated computing, is seeking a Senior Software Engineer for their Deep Learning Inference Workflows team. This role focuses on building state-of-the-art inference frameworks for accelerating Deep Learning models, particularly Large Language Models, on NVIDIA GPUs. The position involves developing TensorRT components, NVIDIA's SDK for high-performance deep learning inference, using C++ and Python to create graph parsers, optimizers, and deployment tools.

The ideal candidate will have 6+ years of software development experience with strong C++ skills and deep understanding of Machine Learning concepts. They'll work with deep learning experts, GPU architects, and DevOps engineers in a collaborative environment. Additional valuable skills include system software development, GPU kernel programming with CUDA or OpenCL, and experience with ML frameworks like PyTorch and TensorFlow.

NVIDIA offers a competitive compensation package with a base salary range of $184,000 - $356,500 USD, plus equity and benefits. The company is known for being one of technology's most desirable employers, fostering innovation and tackling challenging problems in AI and digital twins that transform major industries. This hybrid role, based in Santa Clara, provides an opportunity to make significant contributions to NVIDIA's success in the rapidly growing field of deep learning inference.

Last updated 6 months ago

Responsibilities For Senior Software Engineer, Deep Learning Inference Workflows

Develop components of TensorRT, NVIDIA's SDK for high-performance deep learning inference
Use C++ and Python to build graph parsers, optimizers, and tools for effective deployment of trained deep learning models
Collaborate with teams of deep learning experts, GPU architects and DevOps engineers across diverse teams

Requirements For Senior Software Engineer, Deep Learning Inference Workflows

Python

Bachelor's, Master's, PhD or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering or related field
6+ years of software development experience
Strong experience with C++11/C++14/C++17
Strong grasp of Machine Learning concepts, especially Natural Language Processing
Excellent communication skills, and an aptitude for collaboration and teamwork

Benefits For Senior Software Engineer, Deep Learning Inference Workflows

Equity

Equity
Benefits package available