Taro Logo

Senior Software Engineer, Cloud AI/Machine Learning Infrastructure

Google Cloud accelerates organizations' digital transformation by delivering enterprise-grade solutions leveraging Google's cutting-edge technology.
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS · Cloud

Description For Senior Software Engineer, Cloud AI/Machine Learning Infrastructure

Google Cloud is seeking a Senior Software Engineer to join their Cloud ML Compute Services team, focusing on advancing AI infrastructure capabilities. This role combines deep technical expertise in machine learning infrastructure with performance optimization across the entire technical stack. The position involves working with cutting-edge AI/ML technologies, including model deployment, performance optimization, and distributed computing with GPUs/TPUs.

As a key member of the team, you'll be responsible for optimizing ML workload performance across all technical layers, from networking and storage to ML models. You'll design and implement custom ML solutions from prototype to production, while providing technical guidance to customers throughout their journey. The role requires strong collaboration with cross-functional and cross-regional teams to drive technology forward and ensure customer success.

The ideal candidate should have extensive experience in software development, particularly in backend technologies like Python, Java, C++, or Go. Experience with Generative AI, Large Language Models (LLM), or Machine Learning infrastructure is highly valued. The position offers the opportunity to work at the forefront of AI technology advancement while solving complex technical challenges in a dynamic environment.

This role at Google Cloud combines technical depth with customer impact, making it perfect for engineers passionate about both cutting-edge AI technology and delivering practical solutions. You'll be part of a team that's shaping the future of AI infrastructure while working for one of the world's leading technology companies, with access to state-of-the-art resources and the opportunity to make a significant impact in the field of AI/ML infrastructure.

Last updated a month ago

Responsibilities For Senior Software Engineer, Cloud AI/Machine Learning Infrastructure

  • Measure and enhance performance on Google Cloud across the technical stack, including storage, networking, and model throughput
  • Conduct performance profiling, debugging, and troubleshooting of AI/ML training and inference workloads
  • Partner with cross-functional, cross-regional teams to ensure the AI/ML infrastructure delivers excellent value
  • Identify and resolve performance bottlenecks, ensuring the infrastructure operates at optimal levels
  • Support the future of the AI/ML infrastructure by identifying gaps in the existing products and recommending enhancements

Requirements For Senior Software Engineer, Cloud AI/Machine Learning Infrastructure

Python
Java
Go
  • Bachelor's degree in Computer Science or equivalent practical experience
  • 5 years of experience with software development in one or more programming languages, and with data structures/algorithms
  • 3 years of experience with full stack development, across back-end such as Python, Java, C++, or GO codebases
  • 3 years of experience testing, maintaining, or launching software products
  • 1 year of experience with software design and architecture

Jobs Related To Google Senior Software Engineer, Cloud AI/Machine Learning Infrastructure