Taro Logo

Accelerator Architect and Performance Engineer, Generative AI

Google organizes the world's information and makes it universally accessible and useful, combining AI, Software, and Hardware to create helpful experiences.
$183,000 - $271,000
Machine Learning
Staff Software Engineer
In-Person
5,000+ Employees
8+ years of experience
AI

Description For Accelerator Architect and Performance Engineer, Generative AI

Google is seeking an experienced Accelerator Architect and Performance Engineer to join their Generative AI team. This role combines advanced hardware architecture with cutting-edge AI technology, focusing on developing custom silicon solutions for Google's direct-to-consumer products. The position requires deep expertise in computer architecture, AI models, and performance optimization, working at the intersection of hardware and machine learning.

The ideal candidate will drive the architecture exploration for Tensor mobile SoCs, collaborate with research teams, and optimize future workloads across the entire tech stack. They will work on breakthrough improvements in power and performance for Generative AI applications, particularly focusing on TPU compute engines and system architecture requirements.

This is an advanced technical role requiring both broad and deep expertise across computer architecture, AI/ML systems, and performance engineering. The position offers competitive compensation including base salary, bonus, equity, and comprehensive benefits. The role provides an opportunity to shape the future of AI hardware at one of the world's leading technology companies, working on products that impact millions of users.

The position requires significant experience in computer architecture and AI systems, with particular emphasis on Generative AI model architectures. The successful candidate will need to demonstrate strong programming skills, understanding of hardware/software co-design, and excellent communication abilities to work effectively across multiple technical teams.

Last updated 5 days ago

Responsibilities For Accelerator Architect and Performance Engineer, Generative AI

  • Drive forward-looking GenAI machine learning architecture exploration for Tensor mobile SoCs
  • Work with researchers and program management teams to define system architecture requirements for future Generative AI use cases
  • Apply advanced research in architecture and process technology to get breakthrough power and performance improvements on Generative AI workloads
  • Optimize performance of GenAI use cases by defining an optimal model scheduling on the TPU compute engines

Requirements For Accelerator Architect and Performance Engineer, Generative AI

Python
  • Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, a related field, or equivalent practical experience
  • 8 years of work or academic research experience in computer or chip architecture, performance, or compiler
  • Experience with Generative AI model architectures (e.g., Large Language Models, Vision Transformers, Image Diffusion Models, etc.)
  • Experience with one or more general purpose programming languages including (but not limited to) C/C++ or Python and deep learning frameworks like TensorFlow/Jax/Pytorch

Benefits For Accelerator Architect and Performance Engineer, Generative AI

Medical Insurance
Equity
401k
  • Medical Insurance
  • Equity
  • 401k

Interested in this job?

Jobs Related To Google Accelerator Architect and Performance Engineer, Generative AI