Staff Software Engineer, ML Performance, GPUs

Google develops next-generation technologies that change how billions of users connect, explore, and interact with information and one another.
$197,000 - $291,000
Machine Learning
Staff Software Engineer
Hybrid
5,000+ Employees
8+ years of experience
AI

Description For Staff Software Engineer, ML Performance, GPUs

Google is seeking a Staff Software Engineer to join their ML, Systems, & Cloud AI organization, focusing on ML Performance and GPUs. This role is critical in optimizing Large Language Models (LLMs) and GPU infrastructure for Google's cutting-edge AI initiatives, including Gemini, Search, and Cloud services.

The position involves deep technical work with GPU programming, performance analysis, and machine learning systems. You'll be responsible for analyzing and optimizing LLM performance, maintaining benchmarks, and working closely with product teams to solve complex ML model performance challenges.

The role requires extensive experience in software development (8+ years), ML infrastructure optimization (5+ years), and a strong background in performance analysis and GPU programming. You'll be working in a global environment, contributing to Google's next-generation technologies that impact billions of users.

The compensation package is competitive, ranging from $197,000 to $291,000 base salary, plus bonus, equity, and comprehensive benefits. The position offers the opportunity to work from several locations including Kirkland, WA, Sunnyvale, CA, or Seattle, WA.

This is an excellent opportunity for a senior technical leader who wants to make a significant impact on Google's AI infrastructure and performance optimization, working with cutting-edge technology and some of the world's largest machine learning models.

Last updated 4 hours ago

Responsibilities For Staff Software Engineer, ML Performance, GPUs

  • Analyze Large Language Model (LLM) performance and optimizations for partner teams including Google Gemini, Search, Cloud LLM and APIs
  • Identify and maintain LLM training and serving benchmarks
  • Drive XLA:GPU/Triton performance and guide future XLA releases
  • Engage with Google Product teams to solve ML model performance challenges
  • Run architecture-level simulations on GPU designs
  • Analyze performance and efficiency metrics to identify bottlenecks

Requirements For Staff Software Engineer, ML Performance, GPUs

Python
  • Bachelor's degree or equivalent practical experience
  • 8 years of experience in software development, and with data structures/algorithms
  • 5 years of experience testing, and launching software products, and 3 years of experience with software design and architecture
  • 5 years of experience leading ML design and optimizing ML infrastructure
  • Experience with performance analysis and GPU programming
  • Experience with machine learning systems
  • Experience with architecture analysis and optimization

Benefits For Staff Software Engineer, ML Performance, GPUs

Medical Insurance
Dental Insurance
Vision Insurance
  • Bonus
  • Equity
  • Medical Insurance
  • Dental Insurance
  • Vision Insurance

Interested in this job?

Jobs Related To Google Staff Software Engineer, ML Performance, GPUs

Staff Software Engineer, Machine Learning, JAX Third Party

Staff Software Engineer position at Google focusing on Machine Learning and JAX Third Party development, offering competitive compensation and the opportunity to work on cutting-edge ML infrastructure.

Staff Software Engineer, GPU Performance, Core ML

Staff Software Engineer position at Google focusing on GPU performance optimization for machine learning systems, requiring expertise in ML infrastructure and GPU technologies.

Staff Software Engineer, Core Machine Learning, Google Cloud

Staff Software Engineer position at Google Cloud focusing on core machine learning infrastructure and AI systems development, offering competitive compensation and hybrid work options.

Staff Software Engineer, Calibration and Spatial Sensing

Staff Software Engineer position at Google focusing on calibration and spatial sensing for AR technology, requiring expertise in computer vision and machine learning.

Staff Software Engineer, GenAI, Data Quality

Lead technical initiatives for GenAI data quality at Google Cloud, focusing on ML tooling and infrastructure for large-scale human-generated datasets used in LLM development.