Taro Logo

Senior Software Engineer, Deep Learning Inference

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.
$184,000 - $287,500
Machine Learning
Senior Software Engineer
Hybrid
5,000+ Employees
6+ years of experience
AI

Description For Senior Software Engineer, Deep Learning Inference

NVIDIA is seeking a Senior Software Engineer to join their Deep Learning Inference team, focusing on building and optimizing their state-of-the-art inference framework TensorRT-LLM. This role combines cutting-edge AI technology with high-performance computing, specifically targeting Large Language Models (LLMs) on NVIDIA GPUs. The position offers an opportunity to work with industry-leading technology while contributing to open-source development.

The role involves developing components for TensorRT-LLM, NVIDIA's premier library for optimizing LLM inference performance, managing the open-source repository, and providing expert technical support to users. You'll collaborate with diverse teams including deep learning experts, GPU architects, and DevOps engineers, both within NVIDIA and the broader deep learning community.

Ideal candidates should bring 6+ years of software development experience, strong Python skills, and deep understanding of Machine Learning concepts, particularly in LLMs. Experience with C++, open-source development, and ML frameworks like vLLM, TensorRT, PyTorch, or JAX is highly valued. The position offers competitive compensation ranging from $184,000 to $287,500, plus equity and benefits.

NVIDIA is renowned as one of technology's most desirable employers, offering the chance to work with forward-thinking professionals in a collaborative environment. This role presents an excellent opportunity for those passionate about AI and high-performance computing to make significant contributions to the field of deep learning while working with cutting-edge technology at a leading company.

Last updated a day ago

Responsibilities For Senior Software Engineer, Deep Learning Inference

  • Develop components of TensorRT-LLM for optimizing LLM inference performance on NVIDIA GPUs
  • Provide expert solutions to internal and external TensorRT-LLM users
  • Manage TensorRT-LLM's Open Source Software repo on GitHub
  • Collaborate with deep learning experts, GPU architects and DevOps engineers

Requirements For Senior Software Engineer, Deep Learning Inference

Python
  • Bachelor's, Master's, PhD or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering or related field
  • 6+ years of software development experience
  • Strong experience with Python
  • Strong grasp of Machine Learning concepts, especially related to Large Language Models
  • Excellent communication skills and aptitude for collaboration

Benefits For Senior Software Engineer, Deep Learning Inference

Equity
  • Equity
  • Benefits package available at nvidia.com/benefits

Interested in this job?

Jobs Related To NVIDIA Senior Software Engineer, Deep Learning Inference

Senior Software Engineer - Robotics and AI

Senior Software Engineer position at NVIDIA focusing on robotics and AI, developing advanced solutions for humanoid robots and embodied agents.

Senior Performance Software Engineer, Deep Learning Libraries

Senior Performance Software Engineer role at NVIDIA focusing on optimizing deep learning libraries and GPU performance, offering competitive compensation and opportunity to work on cutting-edge AI technology.

Senior Software Engineer, Digital Human Technology

Senior Software Engineer position at NVIDIA focusing on Digital Human Technology, combining AI research with software engineering to develop cutting-edge digital human solutions.

Senior Deep Learning Software Engineer, LLM Performance

Senior Deep Learning Software Engineer position at NVIDIA focusing on LLM performance optimization, offering competitive salary and opportunity to work with cutting-edge AI technology.

Senior Software Engineer, Robotics Foundation Models

Senior Software Engineer role at NVIDIA focusing on robotics foundation models, AI, and computer vision for humanoid robots.