Taro Logo

Compute Architect

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.
Backend
Staff Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI

Job Description

NVIDIA, the world leader in accelerated computing, is seeking a Compute Architect to join their team. This role sits at the crucial intersection of compiler technology and computer architectures for deep learning. The position involves developing and optimizing MLIR-based compiler infrastructure that powers NVIDIA's deep learning libraries and influences future GPU architectures. The ideal candidate will have strong expertise in compiler development, particularly with MLIR or LLVM frameworks, and deep understanding of computer architecture and performance optimization. They will work on cutting-edge projects involving deep learning algorithms, especially matrix multiplication and convolution. This role offers the opportunity to make significant impact in a fast-moving, technology-focused company, working with world-class teams at the forefront of AI and hardware innovation. The position requires collaboration with hardware architects and software teams, making it perfect for someone who thrives in a cross-functional environment and wants to shape the future of machine learning infrastructure. Located in Shanghai, this role combines technical depth with strategic influence on NVIDIA's compiler technology direction.

Last updated a month ago

Responsibilities For Compute Architect

  • Design, implement, and optimize MLIR-based compiler passes for deep learning and data analytics workloads
  • Analyze and improve the performance of machine learning and deep learning algorithms
  • Identify performance bottlenecks in compiler-generated code and propose solutions
  • Collaborate with hardware architects and software teams to co-design features
  • Contribute to the evolution of NVIDIA's deep learning compiler stack and libraries

Requirements For Compute Architect

Python
  • MS or PhD in Computer Science, Electrical Engineering, Mathematics, or related field
  • 5+ years of working experience
  • Experience developing compilers or compiler infrastructure
  • Strong programming skills in C++ and Python
  • Understanding of computer architecture
  • Experience optimizing code for CPUs or GPUs
  • Experience with deep learning algorithms