Taro Logo

Senior Deep Learning Software Engineer, Inference

NVIDIA is the world leader in accelerated computing, pioneering AI and digital twins technology.
$148,000 - $287,500
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI

Job Description

NVIDIA is seeking a Senior Deep Learning Software Engineer specializing in inference to join their growing team. This role focuses on designing, building, and optimizing GPU-accelerated software that powers sophisticated AI applications. The position involves working with open-source frameworks and tools like CUTLASS, OAI Triton, NCCL, and CUDA kernels to implement and optimize model serving pipelines.

The ideal candidate will have strong expertise in C/C++ programming, deep learning model optimization, and GPU programming. They will work closely with the deep learning community to implement cutting-edge algorithms for public release in inference frameworks. The role involves identifying and driving performance improvements for state-of-the-art LLM and Generative AI models across NVIDIA's range of accelerators.

NVIDIA offers highly competitive compensation, with base salary ranging from $148,000 to $287,500 depending on level and experience, plus equity and comprehensive benefits. The company is known for being one of the technology world's most desirable employers, with forward-thinking teams and outstanding growth opportunities.

The position is based in Santa Clara, CA, and requires at least 5 years of relevant software development experience. Key responsibilities include performance optimization of deep learning models, scaling solutions across different architectures, and contributing to NVIDIA's inference libraries. The role offers the opportunity to work at the forefront of AI technology, implementing solutions that power the next generation of AI applications.

NVIDIA values diversity and maintains an inclusive work environment, providing equal opportunities to all qualified candidates. The company's work in AI and digital twins is transforming major industries and making a significant societal impact. This role presents an excellent opportunity for those passionate about deep learning and high-performance computing to contribute to groundbreaking developments in AI technology.

Last updated 10 hours ago

Responsibilities For Senior Deep Learning Software Engineer, Inference

  • Performance optimization, analysis, and tuning of DL models in various domains like LLM, Multimodal and Generative AI
  • Scale performance of DL models across different architectures and types of NVIDIA accelerators
  • Contribute features and code to NVIDIA's inference libraries, vLLM and SGLang, FlashInfer and LLM software solutions
  • Work with cross-collaborative teams across frameworks, NVIDIA libraries and inference optimization innovative solutions

Requirements For Senior Deep Learning Software Engineer, Inference

Python
Linux
  • Masters or PhD or equivalent experience in relevant field (Computer Engineering, Computer Science, EECS, AI)
  • 5+ years of relevant software development experience
  • Excellent C/C++ programming and software design skills
  • Software Agile skills and Python experience
  • Experience with training, deploying or optimizing the inference of DL models in production
  • Background with performance modeling, profiling, debug, and code optimization
  • GPU programming experience (CUDA, OAI TRITON or CUTLASS)

Benefits For Senior Deep Learning Software Engineer, Inference

Equity
  • Equity
  • Competitive base salary

Related Jobs

AI and ML Infra Software Engineer, GPU Clusters

Senior AI/ML Infrastructure Software Engineer role at NVIDIA, focusing on GPU cluster optimization and infrastructure development for AI research teams.

Senior DFX Software Engineer - Machine Learning

Senior DFX Software Engineer role at NVIDIA focusing on machine learning applications in silicon testing and failure analysis, offering competitive compensation and the opportunity to work with cutting-edge AI technology.

Senior Solution Engineer, AI Enterprise

Senior Solution Engineer position at NVIDIA focusing on AI Enterprise solutions, requiring 8+ years of experience in AI/ML, offering competitive compensation and opportunity to work with cutting-edge technology.

Senior Deep Learning Software Engineer, Inference

Senior Deep Learning Software Engineer position at NVIDIA focusing on inference optimization and high-performance computing for AI applications.

Senior Computer Vision System Performance Engineer

Senior Computer Vision System Performance Engineer role at NVIDIA, focusing on optimizing computer vision applications with Python and CUDA, offering competitive compensation and hybrid work arrangement.