Senior Software Engineer, Deep Learning Inference

NVIDIA

NVIDIA is the world leader in accelerated computing and AI technology.

Tel Aviv-Yafo, Israel

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

5+ years of experience

Description For Senior Software Engineer, Deep Learning Inference

NVIDIA, the pioneer in accelerated computing and AI technology, is seeking a Senior Software Engineer specializing in Deep Learning Inference. This role sits at the intersection of AI innovation and performance optimization, working with cutting-edge generative AI models. The position involves building software solutions that enable efficient inference on state-of-the-art models, tackling challenges across the entire stack from server-level request batching to GPU kernel fusion. The ideal candidate will collaborate with research teams to integrate new Large Language Models (LLMs) and Vision Language Models (VLMs) into NVIDIA's opensource AI runtimes, optimize inference workloads, and build robust, scalable systems. The role requires expertise in performance optimization, strong software engineering principles, and deep understanding of machine learning concepts. Working at NVIDIA offers the opportunity to be at the forefront of AI innovation, collaborating with world-class teams to push the boundaries of what's possible with hardware acceleration. The company is known for its inclusive culture and commitment to diversity, offering a chance to work on transformative technology that impacts industries worldwide. This position is based in Tel Aviv, Israel, and requires hands-on experience with GPU programming and AI frameworks.

Last updated 7 hours ago

Responsibilities For Senior Software Engineer, Deep Learning Inference

Cooperate with research teams to onboard new LLMs and VLMs into Nvidia's opensource AI runtimes
Optimize inference workloads using sophisticated profiling and simulation tools
Build SOLID, extendable inference software systems, and refine robust APIs
Implement and debug low-level GPU code to harness the latest HW features
Own end-to-end inference acceleration features and work with teams around the world to deliver production-grade products

Requirements For Senior Software Engineer, Deep Learning Inference

Python

B.Sc., M.Sc. or equivalent experience in Computer Science or Computer Engineering
5+ years of relevant hands-on software engineering experience
Profound knowledge of software design principles
Strong proficiency in at least one system and one scripting language
Strong grasp of machine learning concepts
People person with excellent communication skills that enjoys collaboration and teamwork

NVIDIA

NVIDIA is the world leader in accelerated computing and AI technology.

Tel Aviv-Yafo, Israel

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

5+ years of experience

Interested in this job?

Jobs Related To NVIDIA Senior Software Engineer, Deep Learning Inference

Senior Software Engineer, Deep Learning Inference

NVIDIA

Senior Software Engineer position at NVIDIA focusing on Deep Learning Inference optimization and implementation, requiring 5+ years of experience in software engineering and machine learning.

AI Computing Software Development Engineer, TensorRT

NVIDIA

Senior AI Software Engineer role at NVIDIA developing TensorRT inferencing software, requiring 3+ years experience and expertise in C++, deep learning frameworks, and AI technologies.

Deep Learning Performance Architect

NVIDIA

Senior Deep Learning Performance Architect position at NVIDIA, focusing on optimizing deep learning kernels and developing GPU-accelerated software solutions.

Senior Performance Engineer - Deep Learning

NVIDIA

Senior Performance Engineer position at NVIDIA focusing on optimizing Deep Learning tools and frameworks, offering competitive salary and opportunity to work with cutting-edge AI technology.

Deep-Learning Software Engineer, Performance Optimization

NVIDIA

Senior Deep Learning Software Engineer role at NVIDIA focusing on AI performance optimization and implementation, requiring 5+ years of experience in machine learning and software development.