Taro Logo

Senior Software Engineer, Cloud AI/Machine Learning Infrastructure

Google Cloud accelerates organizations' digital transformation by delivering enterprise-grade solutions leveraging cutting-edge technology and developer tools.
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS · Cloud

Job Description

Google Cloud is seeking a Senior Software Engineer to join their Cloud ML Compute Services team, focusing on advancing AI infrastructure capabilities. This role combines deep technical expertise in machine learning infrastructure with performance optimization across the entire technical stack. The position involves working with cutting-edge AI technologies, including Generative AI and Large Language Models, while managing complex distributed computing systems utilizing GPUs and TPUs.

The ideal candidate will be responsible for optimizing ML workload performance across all technical layers, from networking and storage to model deployment. They will conduct performance profiling, debugging, and troubleshooting of AI/ML training and inference workloads, while collaborating with cross-functional teams globally to ensure customer success.

This role offers an opportunity to work at the forefront of AI technology, helping shape the future of machine learning infrastructure at scale. The position requires strong technical skills in software development, distributed systems, and machine learning, combined with the ability to adapt to rapidly evolving AI technologies.

Working at Google Cloud means joining a team that's driving innovation in enterprise-grade solutions and helping organizations across 200+ countries transform their businesses through technology. The role offers exposure to cutting-edge AI technologies and the chance to solve complex technical challenges while working with a diverse, global team of experts.

The position is based in Taipei, Taiwan, offering the opportunity to work with Google's world-class engineering teams while contributing to projects that have global impact. This role is perfect for someone who is passionate about AI/ML infrastructure and wants to help shape the future of cloud computing and machine learning technologies.

Last updated 9 days ago

Responsibilities For Senior Software Engineer, Cloud AI/Machine Learning Infrastructure

  • Measure and enhance performance on Google Cloud across the technical stack, including storage, networking, and model throughput
  • Conduct performance profiling, debugging, and troubleshooting of AI/ML training and inference workloads
  • Partner with cross-functional, cross-regional teams to ensure the AI/ML infrastructure delivers excellent value
  • Identify and resolve performance bottlenecks, ensuring the infrastructure operates at optimal levels
  • Support the future of the AI/ML infrastructure by identifying gaps in the existing products and recommending enhancements

Requirements For Senior Software Engineer, Cloud AI/Machine Learning Infrastructure

Python
Java
Go
  • Bachelor's degree in Computer Science or equivalent practical experience
  • 5 years of experience with software development in one or more programming languages, and with data structures/algorithms
  • 3 years of experience with full stack development, across back-end such as Python, Java, C++, or GO codebases
  • 3 years of experience testing, maintaining, or launching software products
  • 1 year of experience with software design and architecture