Taro Logo

Senior DL Algorithms Engineer - Inference Performance

NVIDIA is the world leader in accelerated computing, pioneering AI and digital twins technology.
$148,000 - $287,500
Machine Learning
Senior Software Engineer
Hybrid
5,000+ Employees
3+ years of experience
AI

Description For Senior DL Algorithms Engineer - Inference Performance

NVIDIA is seeking a Senior DL Algorithms Engineer to optimize Deep Learning workloads and maximize performance across their hardware/software stack. This role is perfect for someone who thrives on performance analysis and optimization, working with cutting-edge AI technology. As part of NVIDIA's datacenter business, you'll play a crucial role in optimizing datacenter deployments and contributing to hardware design and system software development.

The position involves working with NVIDIA Inference Microservices (NIMs), delivering optimized DL inference solutions, and collaborating across multiple teams including DL research, CUDA Kernel development, and Silicon Architecture. You'll be responsible for analyzing performance characteristics, benchmarking state-of-the-art DL models, and developing tools to scale the delivery of optimized models.

NVIDIA offers a competitive base salary range of $148,000 - $287,500 USD, plus equity and benefits. The company is known for being one of technology's most desirable employers, with some of the industry's most innovative minds. They foster a diverse work environment and are committed to equal opportunity employment.

This role is ideal for candidates with a PhD in CS/EE or equivalent experience, strong background in deep learning and neural networks, and expertise in computer architecture. Experience with LLMs, VLMs, RAG, and drug discovery models is highly valued, as is knowledge of MLOps and GPU programming. You'll be at the forefront of AI innovation, working with cutting-edge technology and directly impacting NVIDIA's hardware and software roadmap.

Last updated 10 minutes ago

Responsibilities For Senior DL Algorithms Engineer - Inference Performance

  • Deliver hyper-optimized recipes for DL inference as part of NVIDIA Inference Microservices (NIMs)
  • Analyze, validate and debug performance and accuracy characteristics of optimized models
  • Benchmark state-of-the-art offerings in various DL models inference
  • Develop software, tooling and processes across multiple layers of the stack
  • Collaborate with SW/HW co-design teams

Requirements For Senior DL Algorithms Engineer - Inference Performance

Python
  • PhD in CS, EE or CSEE or equivalent experience
  • 3+ years of experience
  • Experience with delivering results under tight timelines
  • Strong background in deep learning and neural networks
  • Deep understanding of computer architecture
  • Programming skills in C++ and Python

Benefits For Senior DL Algorithms Engineer - Inference Performance

Equity
  • Equity

Interested in this job?

Jobs Related To NVIDIA Senior DL Algorithms Engineer - Inference Performance

Senior Deep Learning Software Engineer, LLM Performance

Senior Deep Learning Software Engineer role at NVIDIA focusing on LLM performance optimization, offering $184K-$356.5K salary plus equity and benefits.

Full Stack Developer, AI and LLM

Senior Full Stack Developer position at NVIDIA focusing on AI and LLM development, offering competitive salary and hybrid work arrangement in Santa Clara, CA.

Senior Software Engineer, TensorRT Inference

Senior Software Engineering role at NVIDIA focusing on TensorRT inference optimization and development.

Senior Deep Learning Algorithm Engineer

Senior Deep Learning Algorithm Engineer position at NVIDIA focusing on developing and implementing advanced deep learning algorithms.

Senior Software Engineer - AI Infrastructure

Senior Software Engineering role focused on AI Infrastructure at NVIDIA, working on developing and maintaining AI computing solutions.