Taro Logo

Staff Software Engineer, ML Performance, GPUs

A global technology company that develops AI, search, cloud computing, software and online advertising technologies.
$197,000 - $291,000
Machine Learning
Staff Software Engineer
In-Person
5,000+ Employees
8+ years of experience
AI · Enterprise SaaS

Description For Staff Software Engineer, ML Performance, GPUs

Google is seeking a Staff Software Engineer specializing in ML Performance and GPUs to join their ML, Systems, & Cloud AI (MSCA) organization. This role is crucial for optimizing Large Language Models (LLMs) including Google Gemini, Search, and Cloud LLM applications. The position involves deep technical work with GPU programming, performance analysis, and machine learning infrastructure optimization.

The role sits within Google's MSCA organization, which is responsible for designing and implementing the hardware, software, machine learning, and systems infrastructure that powers all Google services and Google Cloud. This team's work has global impact, from developing TPUs to running global networks and shaping the future of hyperscale computing.

As a Staff Software Engineer, you'll be working on critical performance optimization for Google's most advanced AI models. You'll collaborate with product teams to solve complex ML model performance challenges, particularly in scaling LLM training across thousands of GPUs. The role requires deep expertise in both software engineering and machine learning systems, with a focus on performance analysis and optimization.

The position offers competitive compensation including a base salary range of $197,000-$291,000, plus bonus, equity, and comprehensive benefits. This is an excellent opportunity for experienced engineers who want to work at the cutting edge of AI and machine learning infrastructure, making direct impact on Google's most important AI initiatives.

The ideal candidate will bring strong technical depth in software development, machine learning systems, and GPU programming, combined with the leadership skills needed to drive technical direction and collaborate across Google's complex organization. This role offers the chance to shape the future of AI infrastructure at one of the world's leading technology companies.

Last updated 5 days ago

Responsibilities For Staff Software Engineer, ML Performance, GPUs

  • Analyze Large Language Model (LLM) performance and optimizations for partner teams
  • Identify and maintain LLM training and serving benchmarks
  • Engage with Google Product teams to solve ML model performance challenges
  • Run architecture-level simulations on GPU designs and perform roofline analysis
  • Analyze performance and efficiency metrics to identify bottlenecks, design, and implement solutions

Requirements For Staff Software Engineer, ML Performance, GPUs

Python
  • Bachelor's degree or equivalent practical experience
  • 8 years of experience in software development, and with data structures/algorithms
  • 5 years of experience testing, and launching software products
  • 3 years of experience with software design and architecture
  • 5 years of experience leading ML design and optimizing ML infrastructure
  • Experience with performance analysis and GPU programming

Benefits For Staff Software Engineer, ML Performance, GPUs

Medical Insurance
401k
Parental Leave
  • Bonus
  • Equity
  • Benefits package

Interested in this job?

Jobs Related To Google Staff Software Engineer, ML Performance, GPUs