Taro Logo

Senior Software Engineer, Deep Learning Inference

NVIDIA is the world leader in accelerated computing, pioneering AI and digital twins technology.
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI

Job Description

NVIDIA, the pioneer in accelerated computing and AI technology, is seeking a Senior Software Engineer specializing in Deep Learning Inference. This role sits at the intersection of cutting-edge AI development and performance optimization, working with the latest generative AI models. The position involves building software solutions for efficient inference, from server-level request batching to GPU kernel fusion. You'll be working with NVIDIA's opensource AI runtimes, including Triton Inference Server and TensorRT-LLM, optimizing inference workloads, and implementing low-level GPU code. The ideal candidate combines strong software engineering principles with deep machine learning knowledge and performance optimization expertise. You'll collaborate with global teams, contributing to production-grade products that push the boundaries of AI acceleration. NVIDIA offers the opportunity to work with some of the most forward-thinking professionals in technology, in an environment that values creativity, autonomy, and diversity. This role is perfect for someone passionate about both software engineering excellence and advancing the state of AI technology.

Last updated 2 months ago

Responsibilities For Senior Software Engineer, Deep Learning Inference

  • Cooperate with research teams to onboard new LLMs and VLMs into Nvidia's opensource AI runtimes
  • Optimize inference workloads using sophisticated profiling and simulation tools
  • Build SOLID, extendable inference software systems, and refine robust APIs
  • Implement and debug low-level GPU code to harness the latest HW features
  • Own end-to-end inference acceleration features and work with teams around the world

Requirements For Senior Software Engineer, Deep Learning Inference

Python
  • B.Sc., M.Sc. or equivalent experience in Computer Science or Computer Engineering
  • 5+ years of relevant hands-on software engineering experience
  • Profound knowledge of software design principles
  • Strong proficiency in at least one system and one scripting language
  • Strong grasp of machine learning concepts
  • People person with excellent communication skills that enjoys collaboration and teamwork

Related Jobs

Senior Software Engineer, AI Agent Platform

Senior Software Engineer position at NVIDIA focusing on AI Agent Platform development, requiring 7+ years of experience and expertise in Python, AI/ML frameworks, and system architecture.

Senior Software Engineer, AI Agent Platform

Senior Software Engineer position at NVIDIA focusing on AI Agent Platform development, requiring 7+ years of experience and expertise in Python, AI/ML frameworks, and system architecture.

Senior Software Engineer, AI-Driven Performance Engineering

Senior Software Engineering role at NVIDIA focusing on AI-driven performance engineering, combining ML expertise with systems engineering to develop autonomous benchmarking and analysis tools.

Senior Software Engineer, AI-Driven Performance Engineering

Senior Software Engineering role at NVIDIA focusing on AI-driven performance engineering, combining systems engineering with cutting-edge generative AI to optimize networking technologies.

Senior Gen AI Software Engineer

Senior Gen AI Software Engineer position at Dell Technologies, focusing on AI-powered backend systems and engineering workflows, based in Israel.