Taro Logo

Senior Software Engineer, Cloud AI/Machine Learning Infrastructure

Google Cloud accelerates organizations' digital transformation by delivering enterprise-grade solutions leveraging cutting-edge technology and developer tools.
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS · Cloud

Job Description

Google Cloud is seeking a Senior Software Engineer to join their Cloud ML Compute Services team, focusing on advancing AI infrastructure capabilities. This role combines deep technical expertise in machine learning systems with large-scale distributed computing challenges. The position involves optimizing ML workload performance across the entire technical stack, from networking and storage to ML models, while working with cutting-edge AI technologies.

As a key member of the team, you'll be responsible for designing and implementing solutions that enhance the performance and reliability of Google Cloud's AI/ML infrastructure. You'll work with state-of-the-art technologies including GPUs and TPUs, and handle complex challenges in distributed computing and machine learning systems. The role requires strong collaboration with cross-functional teams and the ability to adapt to rapidly evolving AI technologies.

This is an excellent opportunity for someone passionate about AI infrastructure who wants to make a significant impact on Google Cloud's machine learning capabilities. You'll be working at the forefront of AI technology, helping to shape the future of cloud-based machine learning infrastructure while solving complex technical challenges that affect Google Cloud's global customer base.

The position offers the chance to work with world-class engineers and researchers, access to Google's advanced computing resources, and the opportunity to influence the direction of Google Cloud's AI infrastructure. You'll be part of a team that's driving innovation in cloud computing and machine learning, making these technologies more accessible and efficient for organizations worldwide.

Last updated 2 days ago

Responsibilities For Senior Software Engineer, Cloud AI/Machine Learning Infrastructure

  • Measure and enhance performance on Google Cloud across the technical stack, including storage, networking, and model throughput
  • Conduct performance profiling, debugging, and troubleshooting of AI/ML training and inference workloads
  • Partner with cross-functional, cross-regional teams to ensure the AI/ML infrastructure delivers excellent value
  • Identify and resolve performance bottlenecks, ensuring the infrastructure operates at optimal levels
  • Support the future of the AI/ML infrastructure by identifying gaps in the existing products and recommending enhancements

Requirements For Senior Software Engineer, Cloud AI/Machine Learning Infrastructure

Python
Java
Go
  • Bachelor's degree in Computer Science or equivalent practical experience
  • 5 years of experience with software development in one or more programming languages, and with data structures/algorithms
  • 3 years of experience with full stack development, across back-end such as Python, Java, C++, or GO codebases
  • 3 years of experience testing, maintaining, or launching software products
  • 1 year of experience with software design and architecture

Benefits For Senior Software Engineer, Cloud AI/Machine Learning Infrastructure

Medical Insurance
401k
Parental Leave
  • Comprehensive health benefits
  • Retirement plans
  • Parental leave support