Taro Logo

Senior Software Engineer, Deep Learning Inference

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.
$184,000 - $287,500
Machine Learning
Senior Software Engineer
Hybrid
5,000+ Employees
6+ years of experience
AI

Description For Senior Software Engineer, Deep Learning Inference

NVIDIA, the world leader in accelerated computing, is seeking a Senior Software Engineer to join their Deep Learning Inference team. This role presents an exciting opportunity to make a significant impact in the field of Deep Learning by developing state-of-the-art inference frameworks for Large Language Models (LLMs) on NVIDIA GPUs.

The position involves working with TensorRT-LLM, NVIDIA's premier library for optimizing LLM inference performance. You'll be at the forefront of AI technology, collaborating with diverse teams of deep learning experts, GPU architects, and DevOps engineers, while also engaging with the broader deep learning community through open-source development.

The ideal candidate brings 6+ years of software development experience, strong Python skills, and a deep understanding of Machine Learning concepts, particularly in Large Language Models. Experience with C++, open-source development, and frameworks like vLLM, TensorRT, PyTorch, or JAX would be highly valuable. The role offers a competitive base salary range of $184,000 - $287,500 USD, plus equity and benefits.

NVIDIA's culture emphasizes creativity, autonomy, and technical innovation. As part of one of technology's most desirable employers, you'll help build the computing platforms driving success in AI and deep learning. The company is committed to diversity and inclusion, fostering an environment where all employees can thrive and contribute to solving challenges no one else can tackle.

Located in Santa Clara, CA, with a hybrid work arrangement, this position offers the opportunity to work with cutting-edge technology while contributing to NVIDIA's mission of transforming the world's largest industries through AI and digital twins. If you're passionate about deep learning, software optimization, and want to work with some of the industry's brightest minds, this role presents an exceptional opportunity to advance your career while making meaningful contributions to the field.

Last updated 2 minutes ago

Responsibilities For Senior Software Engineer, Deep Learning Inference

  • Develop components of TensorRT-LLM for optimizing LLM inference performance on NVIDIA GPUs
  • Provide expert solutions to internal and external TensorRT-LLM users
  • Manage TensorRT-LLM's Open Source Software repo on GitHub
  • Collaborate with deep learning experts, GPU architects and DevOps engineers

Requirements For Senior Software Engineer, Deep Learning Inference

Python
  • Bachelor's, Master's, PhD or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering or related field
  • 6+ years of software development experience
  • Strong experience with Python
  • Strong grasp of Machine Learning concepts, especially related to Large Language Models
  • Excellent communication skills and aptitude for collaboration

Benefits For Senior Software Engineer, Deep Learning Inference

Equity
  • Equity
  • Benefits package offered

Interested in this job?

Jobs Related To NVIDIA Senior Software Engineer, Deep Learning Inference

Senior Software Engineer, Digital Human Technology

Senior Software Engineer position at NVIDIA focusing on digital human technology, combining AI research with high-performance computing implementation.

Senior Software Engineer, Robotics Foundation Models

Senior Software Engineer position at NVIDIA focusing on robotics foundation models, AI, and computer vision.

Senior Software Engineer, Robotics Foundation Models

Senior Software Engineer position at NVIDIA focusing on robotics foundation models, AI, and computer vision.

Senior Software Engineer, Deep Learning Inference

Senior Software Engineer position at NVIDIA focusing on developing and optimizing Deep Learning inference frameworks, particularly for Large Language Models on NVIDIA GPUs.

Senior Software Engineer, Robotics Foundation Models

Senior Software Engineer role at NVIDIA focusing on robotics foundation models, AI, and computer vision for humanoid robots.