Senior Software Engineer, Deep Learning Inference

NVIDIA is the world leader in accelerated computing and AI technology.
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI

Description For Senior Software Engineer, Deep Learning Inference

NVIDIA, the pioneer in accelerated computing and AI technology, is seeking a Senior Software Engineer specializing in Deep Learning Inference. This role sits at the intersection of AI innovation and performance optimization, working with cutting-edge generative AI models. The position involves building software solutions that enable efficient inference on state-of-the-art models, tackling challenges across the entire stack from server-level request batching to GPU kernel fusion. The ideal candidate will collaborate with research teams to integrate new Large Language Models (LLMs) and Vision Language Models (VLMs) into NVIDIA's opensource AI runtimes, optimize inference workloads, and build robust, scalable systems. The role requires expertise in performance optimization, strong software engineering principles, and deep understanding of machine learning concepts. Working at NVIDIA offers the opportunity to be at the forefront of AI innovation, collaborating with world-class teams to push the boundaries of what's possible with hardware acceleration. The company is known for its inclusive culture and commitment to diversity, offering a chance to work on transformative technology that impacts industries worldwide. This position is based in Tel Aviv, Israel, and requires hands-on experience with GPU programming and AI frameworks.

Last updated 7 hours ago

Responsibilities For Senior Software Engineer, Deep Learning Inference

  • Cooperate with research teams to onboard new LLMs and VLMs into Nvidia's opensource AI runtimes
  • Optimize inference workloads using sophisticated profiling and simulation tools
  • Build SOLID, extendable inference software systems, and refine robust APIs
  • Implement and debug low-level GPU code to harness the latest HW features
  • Own end-to-end inference acceleration features and work with teams around the world to deliver production-grade products

Requirements For Senior Software Engineer, Deep Learning Inference

Python
  • B.Sc., M.Sc. or equivalent experience in Computer Science or Computer Engineering
  • 5+ years of relevant hands-on software engineering experience
  • Profound knowledge of software design principles
  • Strong proficiency in at least one system and one scripting language
  • Strong grasp of machine learning concepts
  • People person with excellent communication skills that enjoys collaboration and teamwork

Interested in this job?

Jobs Related To NVIDIA Senior Software Engineer, Deep Learning Inference

Senior Software Engineer, Deep Learning Inference

Senior Software Engineer position at NVIDIA focusing on Deep Learning Inference optimization and implementation, requiring 5+ years of experience in software engineering and machine learning.

AI Computing Software Development Engineer, TensorRT

Senior AI Software Engineer role at NVIDIA developing TensorRT inferencing software, requiring 3+ years experience and expertise in C++, deep learning frameworks, and AI technologies.

Deep Learning Performance Architect

Senior Deep Learning Performance Architect position at NVIDIA, focusing on optimizing deep learning kernels and developing GPU-accelerated software solutions.

Senior Performance Engineer - Deep Learning

Senior Performance Engineer position at NVIDIA focusing on optimizing Deep Learning tools and frameworks, offering competitive salary and opportunity to work with cutting-edge AI technology.

Deep-Learning Software Engineer, Performance Optimization

Senior Deep Learning Software Engineer role at NVIDIA focusing on AI performance optimization and implementation, requiring 5+ years of experience in machine learning and software development.