Machine Learning Compiler Software Engineer, TPU Horizontal Scaling

Google

Google develops next-generation technologies that change how billions of users connect, explore, and interact with information worldwide.

London, UK

Machine Learning

Mid-Level Software Engineer

In-Person

5,000+ Employees

2+ years of experience

AI · Enterprise SaaS

This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Machine Learning Compiler Software Engineer, TPU Horizontal Scaling

Google is seeking a Machine Learning Compiler Software Engineer to join their TPU Horizontal Scaling team, focusing on the development of the Accelerated Linear Algebra (XLA) TPU/GPU parallelizing compiler. This role is crucial for optimizing and scaling machine learning models across multiple accelerators for both internal and external customers.

The position involves working with cutting-edge technology in machine learning compilation, specifically on the XLA Single Program Multiple Data (SPMD) partitioner, collective and scheduling optimizations, and code generation. As part of Google Cloud, you'll be contributing to enterprise-grade solutions that leverage Google's advanced technology stack.

The ideal candidate should have a strong foundation in software development, data structures, and algorithms, with preferred experience in Machine Learning, High Performance Computing, and C++ programming. You'll be responsible for conducting performance analysis, implementing optimizations, and contributing to critical features that enhance production team efficiency.

This role offers the opportunity to work on projects that directly impact Google's core technologies and Cloud services, with the potential to influence how machine learning models are optimized and scaled across accelerator hardware. You'll be part of a team that serves customers in more than 200 countries, helping solve critical business problems through innovative technology solutions.

The position combines technical expertise with leadership opportunities, requiring both strong programming skills and the ability to collaborate with peers and stakeholders. You'll be at the forefront of developing solutions that enhance Google's machine learning infrastructure while working in an environment that values diversity, inclusion, and innovation.

Last updated 7 months ago

Responsibilities For Machine Learning Compiler Software Engineer, TPU Horizontal Scaling

Write product or system development code
Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies
Contribute to a compiler which scales-out machine learning models across accelerators such as TPU/GPU at Google and Cloud
Conduct static and runtime performance analysis of important large-scale production models
Design and implement performance optimizations and critical features, which increase the velocity of important production teams

Requirements For Machine Learning Compiler Software Engineer, TPU Horizontal Scaling

Bachelor's degree or equivalent practical experience
2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree
2 years of experience with data structures or algorithms
Master's degree or PhD in Computer Science, or a related technical field (preferred)
Experience in Machine Learning and High Performance Computing (HPC)
Experience optimizing programs at distributed scale
Experience in C++
Experience in compilers
Ability to debug and program concurrent/parallel computations