Machine Learning Compiler Software Engineer, TPU Horizontal Scaling

Google develops next-generation technologies that change how billions of users connect, explore, and interact with information worldwide.
Machine Learning
Mid-Level Software Engineer
In-Person
5,000+ Employees
2+ years of experience
AI · Enterprise SaaS

Description For Machine Learning Compiler Software Engineer, TPU Horizontal Scaling

Google is seeking a Machine Learning Compiler Software Engineer to join their TPU Horizontal Scaling team, focusing on the development of the Accelerated Linear Algebra (XLA) TPU/GPU parallelizing compiler. This role is crucial for optimizing and scaling machine learning models across multiple accelerators for both internal and external customers.

The position involves working with cutting-edge technology in machine learning compilation, specifically on the XLA Single Program Multiple Data (SPMD) partitioner, collective and scheduling optimizations, and code generation. As part of Google Cloud, you'll be contributing to enterprise-grade solutions that leverage Google's advanced technology stack.

The ideal candidate should have a strong foundation in software development, data structures, and algorithms, with preferred experience in Machine Learning, High Performance Computing, and C++ programming. You'll be responsible for conducting performance analysis, implementing optimizations, and contributing to critical features that enhance production team efficiency.

This role offers the opportunity to work on projects that directly impact Google's core technologies and Cloud services, with the potential to influence how machine learning models are optimized and scaled across accelerator hardware. You'll be part of a team that serves customers in more than 200 countries, helping solve critical business problems through innovative technology solutions.

The position combines technical expertise with leadership opportunities, requiring both strong programming skills and the ability to collaborate with peers and stakeholders. You'll be at the forefront of developing solutions that enhance Google's machine learning infrastructure while working in an environment that values diversity, inclusion, and innovation.

Last updated 16 days ago

Responsibilities For Machine Learning Compiler Software Engineer, TPU Horizontal Scaling

  • Write product or system development code
  • Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies
  • Contribute to a compiler which scales-out machine learning models across accelerators such as TPU/GPU at Google and Cloud
  • Conduct static and runtime performance analysis of important large-scale production models
  • Design and implement performance optimizations and critical features, which increase the velocity of important production teams

Requirements For Machine Learning Compiler Software Engineer, TPU Horizontal Scaling

  • Bachelor's degree or equivalent practical experience
  • 2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree
  • 2 years of experience with data structures or algorithms
  • Master's degree or PhD in Computer Science, or a related technical field (preferred)
  • Experience in Machine Learning and High Performance Computing (HPC)
  • Experience optimizing programs at distributed scale
  • Experience in C++
  • Experience in compilers
  • Ability to debug and program concurrent/parallel computations

Interested in this job?

Jobs Related To Google Machine Learning Compiler Software Engineer, TPU Horizontal Scaling

Research Scientist, Geo Algorithms, Research

Research Scientist position at Google focusing on Geo Algorithms research, requiring PhD and publication experience, offering $136K-$200K salary plus benefits.

Research Scientist, Natural Language Processing

Research Scientist position at Google focusing on improving large language models' factuality and grounding capabilities.

Research Scientist, Gemini

Research Scientist position at Google working on Gemini, focusing on conversational AI and machine learning research with opportunity to publish and collaborate globally.

Research Scientist, Multilingual Large Language Models

Research Scientist position at Google focusing on developing and improving multilingual Large Language Models, requiring PhD and expertise in AI/ML.

Software Developer III, AI/ML, Google Cloud AI

Software Developer III position at Google Cloud AI, focusing on machine learning implementation and research, requiring 2 years of software development experience.