Senior Software Engineer, Deep Learning Inference

NVIDIA is the world leader in accelerated computing, pioneering AI and digital twins technology.
Machine Learning
Senior Software Engineer
In-Person
5+ years of experience
AI

Description For Senior Software Engineer, Deep Learning Inference

NVIDIA, the pioneer in accelerated computing and AI technology, is seeking a Senior Software Engineer specialized in Deep Learning Inference. This role is at the forefront of AI innovation, working with the latest generative AI models and LLMs. The position involves optimizing performance at all stack levels, from server-level request batching to GPU kernel fusion. You'll be working with cutting-edge technology, collaborating with research teams to implement and optimize AI runtimes, and developing sophisticated software solutions. The ideal candidate combines strong software engineering principles with deep ML knowledge and performance optimization expertise. NVIDIA offers a collaborative environment working with some of the most forward-thinking professionals in the technology field. The role provides an opportunity to impact the future of AI computing while working with state-of-the-art hardware and software. The company is committed to diversity and inclusion, fostering an environment where creativity and autonomy are highly valued. This position is perfect for those passionate about pushing the boundaries of AI technology and performance optimization.

Last updated 3 days ago

Responsibilities For Senior Software Engineer, Deep Learning Inference

  • Cooperate with research teams to onboard new LLMs and VLMs into Nvidia's opensource AI runtimes
  • Optimize inference workloads using sophisticated profiling and simulation tools
  • Build SOLID, extendable inference software systems, and refine robust APIs
  • Implement and debug low-level GPU code to harness the latest HW features
  • Own end-to-end inference acceleration features and work with teams around the world

Requirements For Senior Software Engineer, Deep Learning Inference

Python
  • B.Sc., M.Sc. or equivalent experience in Computer Science or Computer Engineering
  • 5+ years of relevant hands-on software engineering experience
  • Profound knowledge of software design principles
  • Strong proficiency in at least one system and one scripting language
  • Strong grasp of machine learning concepts
  • People person with excellent communication skills that enjoys collaboration and teamwork

Interested in this job?

Jobs Related To NVIDIA Senior Software Engineer, Deep Learning Inference

Senior Scientific Machine Learning Software Engineer - Physics

Senior ML Engineer role at NVIDIA developing AI frameworks for scientific computing and digital twins, offering competitive salary and cutting-edge work environment.

Python Software Engineer, GPU - Accelerated LLM Data Applications

Senior Python Software Engineer role at NVIDIA focusing on GPU-accelerated LLM data applications and machine learning solutions.

Senior DevTech Engineer - Windows LLM and GenAI Open-Source Ecosystem

Senior DevTech Engineer role at NVIDIA focusing on Windows LLM and GenAI optimization, requiring 5+ years of GPU experience and strong AI expertise.

Senior On-Device Model Inference Optimization Engineer

Senior role focusing on optimizing AI model inference for autonomous vehicles at NVIDIA

DevTech Engineer - Windows LLM and GenAI Open-Source Ecosystem

Senior DevTech Engineer position at NVIDIA focusing on Windows LLM and GenAI optimization, requiring 5+ years of GPU deployment experience and strong AI knowledge.