Taro Logo

Accelerator Architect and Performance Engineer, Generative AI

Google organizes the world's information and makes it universally accessible and useful, combining AI, Software, and Hardware to create helpful experiences.
$183,000 - $271,000
Machine Learning
Staff Software Engineer
In-Person
5,000+ Employees
8+ years of experience
AI

Description For Accelerator Architect and Performance Engineer, Generative AI

Google is seeking an experienced Accelerator Architect and Performance Engineer to join their Generative AI team. This role combines advanced machine learning architecture with hardware optimization, focusing on developing custom silicon solutions for Google's direct-to-consumer products. The position requires deep expertise in computer architecture, AI models, and programming, with a focus on optimizing performance for next-generation AI applications.

The ideal candidate will work at the intersection of hardware and software, collaborating with research teams, system architects, and compiler engineers to optimize future workloads across the entire tech stack. They will be responsible for defining system architecture requirements for future Generative AI use cases and applying cutting-edge research to achieve breakthrough performance improvements.

This is an advanced technical role that offers the opportunity to shape the future of AI hardware at one of the world's leading technology companies. The position comes with competitive compensation including a base salary range of $183,000-$271,000, plus bonus, equity, and comprehensive benefits. The role is based in either Mountain View or San Diego, California, and requires 8+ years of relevant experience.

The role involves working with state-of-the-art AI technologies, including Large Language Models, Vision Transformers, and Image Diffusion Models. The successful candidate will contribute to Google's mission of organizing the world's information while developing hardware solutions that power the next generation of AI applications. This is an excellent opportunity for someone passionate about the intersection of AI, hardware architecture, and performance optimization.

Last updated 2 days ago

Responsibilities For Accelerator Architect and Performance Engineer, Generative AI

  • Drive forward-looking GenAI machine learning architecture exploration for Tensor mobile SoCs
  • Work with researchers and program management teams to define system architecture requirements for future Generative AI use cases
  • Apply advanced research in architecture and process technology
  • Optimize performance of GenAI use cases by defining optimal model scheduling on TPU compute engines

Requirements For Accelerator Architect and Performance Engineer, Generative AI

Python
  • Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, a related field, or equivalent practical experience
  • 8 years of work or academic research experience in computer or chip architecture, performance, or compiler
  • Experience with Generative AI model architectures
  • Experience with C/C++ or Python and deep learning frameworks like TensorFlow/Jax/Pytorch

Benefits For Accelerator Architect and Performance Engineer, Generative AI

Medical Insurance
Equity
401k
  • Medical Insurance
  • Equity
  • 401k

Interested in this job?

Jobs Related To Google Accelerator Architect and Performance Engineer, Generative AI