Google Cloud is seeking a Staff Software Engineer to lead GPU performance optimization efforts at Google scale. This role is part of the ML, Systems, & Cloud AI (MSCA) organization, which is responsible for the hardware, software, and ML infrastructure powering Google's services and Cloud offerings. The position requires deep expertise in GPU architecture, low-level programming, and ML infrastructure to optimize performance across Google's massive computing infrastructure.
The role combines technical leadership in GPU performance optimization with hands-on development work on critical systems that impact billions of users. You'll work on everything from low-level kernel optimization to high-level ML model design, helping bridge the gap between ML practitioners and hardware capabilities. This involves collaborating across teams to influence the entire GPU software stack while having access to cutting-edge GPU hardware and tools.
As a Staff Engineer, you'll shape technical direction for GPU performance at Google scale, working with teams across the company and external partners. The role offers the opportunity to impact major Google products and Cloud services while advancing the state-of-the-art in GPU computing. You'll need to balance deep technical work with technical leadership, mentoring other engineers while driving architectural decisions.
The position offers competitive compensation including base salary, bonus, equity, and comprehensive benefits. You'll work alongside world-class engineers and researchers while having the resources and scale of Google to tackle challenging technical problems. This is an opportunity to work on next-generation AI infrastructure while helping shape the future of GPU computing at one of the world's leading technology companies.