Google Cloud is seeking a Staff Software Engineer for their Cloud ML Compute Services team. This role is crucial in building and supporting the Google Cloud Platform (GCP) Cloud TPU and GPU services, as well as related ML models and frameworks. The position involves working on projects that provide ML infrastructure customers with large-scale, cloud-based access to Google's ML supercomputers for running training and inference workloads using PyTorch and JAX.
Key responsibilities include improving LLM training and inference performance on TPU, adding new features, publishing high-performance open-source kernels, and collaborating with various teams to design and implement new PyTorch features. The ideal candidate will have extensive experience in software development, machine learning algorithms, and technical leadership.
Google Cloud accelerates digital transformation for organizations worldwide, delivering enterprise-grade solutions that leverage cutting-edge technology. The role offers a competitive salary range of $189,000-$284,000 plus bonus, equity, and benefits, depending on factors such as location, skills, and experience.
This position requires a blend of technical expertise and leadership skills, with a focus on machine learning infrastructure and high-performance computing. The successful candidate will play a vital role in advancing Google Cloud's ML capabilities and supporting customers in solving critical business problems through innovative cloud solutions.