AI and ML Performance Engineer

NVIDIA

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins that transform industries.

Redmond, WA, USA

$148,000 - $287,500

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

3+ years of experience

This job posting may no longer be active. You may be interested in these related jobs instead:

Description For AI and ML Performance Engineer

NVIDIA, the world leader in accelerated computing, is seeking an AI/ML Performance Engineer to drive the development of next-generation inference optimizations. This role sits at the intersection of artificial intelligence advancement and system performance optimization, focusing on developing scalable inference strategies and cross-stack optimizations. The position involves working with cutting-edge AI technologies, including attention mechanisms, speculative decoding, and system-level techniques for model deployment.

The role requires collaboration across multiple teams, including deep learning research, framework development, compiler engineering, and silicon architecture. The successful candidate will be responsible for developing performance models, designing optimizations for inference deployment, and quantifying performance benefits to guide future software and hardware roadmaps. This position offers an opportunity to shape the future of datacenter technology and AI infrastructure at one of technology's most innovative companies.

The ideal candidate should possess strong technical skills in computer architecture and AI/ML systems, with experience in performance analysis and optimization. They should be comfortable with Python programming and have a solid understanding of LLM internals. The role offers competitive compensation, including a base salary range of $148,000 - $287,500, equity, and additional benefits.

NVIDIA's commitment to diversity and inclusion, combined with its position at the forefront of AI innovation, makes this an excellent opportunity for someone looking to make a significant impact in the field of AI and machine learning optimization. The role offers exposure to the latest developments in AI technology while working with some of the industry's most forward-thinking professionals.

Last updated 2 months ago

Responsibilities For AI and ML Performance Engineer

Develop high-fidelity performance models for Generative AI
Design optimizations for inference deployment
Quantify performance benefits of targeted optimizations
Model end-to-end performance impact of emerging GenAI workflows
Collaborate with DL researchers, hardware architects, and software engineers

Requirements For AI and ML Performance Engineer

Python

Master's degree in Computer Science, Electrical Engineering or related fields
Strong background in computer architecture, roofline modeling, queuing theory
Solid understanding of LLM internals
3+ years of hands-on experience in system evaluation of AI/ML workloads
Proficiency in Python
Growth mindset and pragmatic approach

Benefits For AI and ML Performance Engineer

Equity

Equity
Competitive base salary
Additional benefits (not specified in detail)