Taro Logo

Staff Software Engineer, ML Performance, GPUs

Google develops next-generation technologies that change how billions of users connect, explore, and interact with information and one another.
$197,000 - $291,000
Machine Learning
Staff Software Engineer
In-Person
5,000+ Employees
8+ years of experience
AI · Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Staff Software Engineer, ML Performance, GPUs

Google is seeking a Staff Software Engineer to join their ML, Systems, & Cloud AI (MSCA) organization, focusing on ML Performance and GPUs. This role is critical in optimizing Large Language Models (LLMs) including Google Gemini, Search, and Cloud LLM applications. The position requires deep expertise in machine learning infrastructure, GPU programming, and performance analysis.

The role involves working with cutting-edge ML technologies and infrastructure that powers Google's most important services. You'll be responsible for analyzing and optimizing LLM performance, maintaining benchmarks, and working closely with product teams to solve complex ML model performance challenges. The position offers the opportunity to impact billions of users through Google's services and Cloud platforms.

The ideal candidate will bring extensive experience in software development, ML infrastructure optimization, and GPU programming. You'll work in a complex, matrixed organization, collaborating with various teams to improve ML model efficiency and performance at scale. The role offers competitive compensation including base salary, bonus, equity, and comprehensive benefits.

This is an excellent opportunity for someone passionate about machine learning, performance optimization, and large-scale systems who wants to work at the forefront of AI technology. You'll be part of shaping the future of hyperscale computing while working on some of the most advanced ML systems in the industry.

Last updated 14 days ago

Responsibilities For Staff Software Engineer, ML Performance, GPUs

  • Analyze Large Language Model (LLM) performance and optimizations for partner teams including Google Gemini, Search, Cloud LLM and APIs
  • Identify and maintain LLM training and serving benchmarks
  • Engage with Google Product teams, to solve their ML model performance challenges
  • Run architecture-level simulations on GPU designs and perform roofline analysis
  • Analyze performance and efficiency metrics to identify bottlenecks, design, and implement solutions

Requirements For Staff Software Engineer, ML Performance, GPUs

Python
  • Bachelor's degree or equivalent practical experience
  • 8 years of experience in software development, and with data structures/algorithms
  • 5 years of experience testing, and launching software products
  • 3 years of experience with software design and architecture
  • 5 years of experience leading ML design and optimizing ML infrastructure
  • Experience with performance analysis and GPU programming

Benefits For Staff Software Engineer, ML Performance, GPUs

Medical Insurance
Equity
401k
  • Medical Insurance
  • Equity
  • 401k

Interested in this job?