Taro Logo

Senior Deep Learning Software Engineer, LLM Performance

NVIDIA is the world leader in accelerated computing and AI technology.
$184,000 - $356,500
Machine Learning
Senior Software Engineer
Hybrid
5,000+ Employees
8+ years of experience
AI

Description For Senior Deep Learning Software Engineer, LLM Performance

NVIDIA is seeking an experienced Senior Deep Learning Engineer focused on analyzing and improving LLM inference performance. As a key member of the team building GPU-accelerated Deep Learning software, you'll work on cutting-edge technologies like TensorRT, DL benchmarking software, and performant solutions for model deployment and serving. The role involves collaborating with the deep learning community to implement latest algorithms in TensorRT LLM, VLLM, and SGLang while identifying and optimizing performance opportunities across NVIDIA's accelerator spectrum. You'll be working at the intersection of deep learning and high-performance computing, contributing to NVIDIA's position as a leader in AI computing. The role offers competitive compensation, including a base salary range of $184,000-$356,500 USD, plus equity and benefits. This is an excellent opportunity to join a company at the forefront of AI and accelerated computing, working on technology that's transforming industries and society. The hybrid work environment and collaborative culture make this an ideal position for someone passionate about deep learning and performance optimization.

Last updated 23 minutes ago

Responsibilities For Senior Deep Learning Software Engineer, LLM Performance

  • Performance optimization, analysis, and tuning of LLM, VLM and GenAI models for DL inference, serving and deployment
  • Scale performance of LLM models across different architectures and types of NVIDIA accelerators
  • Scale performance for max throughput, minimum latency and throughput under latency constraints
  • Contribute features and code to NVIDIA/OSS LLM frameworks, inference benchmarking frameworks, TensorRT, and Triton
  • Work with cross-collaborative teams across generative AI, automotive, image understanding, and speech understanding

Requirements For Senior Deep Learning Software Engineer, LLM Performance

Python
Linux
  • Bachelors, Masters, PhD, or equivalent experience in relevant fields (Computer Engineering, Computer Science, EECS, AI)
  • At least 8 years of relevant software development experience
  • Excellent Python/C/C++ programming, software design and software engineering skills
  • Experience with a DL framework like PyTorch, JAX, TensorFlow

Benefits For Senior Deep Learning Software Engineer, LLM Performance

Equity
Medical Insurance
  • Equity
  • Medical Insurance

Interested in this job?

Jobs Related To NVIDIA Senior Deep Learning Software Engineer, LLM Performance

Senior DL Algorithms Engineer - Inference Performance

Senior DL Algorithms Engineer position at NVIDIA focusing on optimizing Deep Learning workloads and inference performance, offering competitive compensation and opportunity to work with cutting-edge AI technology.

Full Stack Developer, AI and LLM

Senior Full Stack Developer position at NVIDIA focusing on AI and LLM development, offering competitive salary and hybrid work arrangement in Santa Clara, CA.

Senior Software Engineer, TensorRT Inference

Senior Software Engineering role at NVIDIA focusing on TensorRT inference optimization and development.

Senior Deep Learning Algorithm Engineer

Senior Deep Learning Algorithm Engineer position at NVIDIA focusing on developing and implementing advanced deep learning algorithms.

Senior Software Engineer - AI Infrastructure

Senior Software Engineering role focused on AI Infrastructure at NVIDIA, working on developing and maintaining AI computing solutions.