Senior Deep Learning Software Engineer, Inference

NVIDIA

NVIDIA is the world leader in accelerated computing, pioneering AI and digital twins technology.

Santa Clara, CA, USA

$148,000 - $287,500

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

5+ years of experience

Job Description

NVIDIA is seeking a Senior Deep Learning Software Engineer specializing in inference to join their growing team. This role focuses on designing, building, and optimizing GPU-accelerated software that powers sophisticated AI applications. The position involves working with open-source frameworks and tools like CUTLASS, OAI Triton, NCCL, and CUDA kernels to implement and optimize model serving pipelines.

The ideal candidate will have strong expertise in C/C++ programming, deep learning model optimization, and GPU programming. They will work closely with the deep learning community to implement cutting-edge algorithms for public release in inference frameworks. The role involves identifying and driving performance improvements for state-of-the-art LLM and Generative AI models across NVIDIA's range of accelerators.

NVIDIA offers highly competitive compensation, with base salary ranging from $148,000 to $287,500 depending on level and experience, plus equity and comprehensive benefits. The company is known for being one of the technology world's most desirable employers, with forward-thinking teams and outstanding growth opportunities.

The position is based in Santa Clara, CA, and requires at least 5 years of relevant software development experience. Key responsibilities include performance optimization of deep learning models, scaling solutions across different architectures, and contributing to NVIDIA's inference libraries. The role offers the opportunity to work at the forefront of AI technology, implementing solutions that power the next generation of AI applications.

NVIDIA values diversity and maintains an inclusive work environment, providing equal opportunities to all qualified candidates. The company's work in AI and digital twins is transforming major industries and making a significant societal impact. This role presents an excellent opportunity for those passionate about deep learning and high-performance computing to contribute to groundbreaking developments in AI technology.

Last updated 10 hours ago

Responsibilities For Senior Deep Learning Software Engineer, Inference

Performance optimization, analysis, and tuning of DL models in various domains like LLM, Multimodal and Generative AI
Scale performance of DL models across different architectures and types of NVIDIA accelerators
Contribute features and code to NVIDIA's inference libraries, vLLM and SGLang, FlashInfer and LLM software solutions
Work with cross-collaborative teams across frameworks, NVIDIA libraries and inference optimization innovative solutions

Requirements For Senior Deep Learning Software Engineer, Inference

Python

Linux

Masters or PhD or equivalent experience in relevant field (Computer Engineering, Computer Science, EECS, AI)
5+ years of relevant software development experience
Excellent C/C++ programming and software design skills
Software Agile skills and Python experience
Experience with training, deploying or optimizing the inference of DL models in production
Background with performance modeling, profiling, debug, and code optimization
GPU programming experience (CUDA, OAI TRITON or CUTLASS)

Benefits For Senior Deep Learning Software Engineer, Inference

Equity

Equity
Competitive base salary

NVIDIA

NVIDIA is the world leader in accelerated computing, pioneering AI and digital twins technology.

Santa Clara, CA, USA

$148,000 - $287,500

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

5+ years of experience

NVIDIA

Senior AI/ML Infrastructure Software Engineer role at NVIDIA, focusing on GPU cluster optimization and infrastructure development for AI research teams.

Senior DFX Software Engineer - Machine Learning

NVIDIA

Senior DFX Software Engineer role at NVIDIA focusing on machine learning applications in silicon testing and failure analysis, offering competitive compensation and the opportunity to work with cutting-edge AI technology.

Senior Solution Engineer, AI Enterprise

NVIDIA

Senior Solution Engineer position at NVIDIA focusing on AI Enterprise solutions, requiring 8+ years of experience in AI/ML, offering competitive compensation and opportunity to work with cutting-edge technology.

Senior Deep Learning Software Engineer, Inference

NVIDIA

Senior Deep Learning Software Engineer position at NVIDIA focusing on inference optimization and high-performance computing for AI applications.

Senior Computer Vision System Performance Engineer

NVIDIA

Senior Computer Vision System Performance Engineer role at NVIDIA, focusing on optimizing computer vision applications with Python and CUDA, offering competitive compensation and hybrid work arrangement.

Senior Deep Learning Software Engineer, Inference

NVIDIA

Job Description

Responsibilities For Senior Deep Learning Software Engineer, Inference

Requirements For Senior Deep Learning Software Engineer, Inference

Benefits For Senior Deep Learning Software Engineer, Inference

NVIDIA

Related Jobs