Compute Architect

NVIDIA

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.

Shanghai, China

Backend

Staff Software Engineer

In-Person

5,000+ Employees

5+ years of experience

Job Description

NVIDIA, the world leader in accelerated computing, is seeking a Compute Architect to join their team. This role sits at the crucial intersection of compiler technology and computer architectures for deep learning. The position involves developing and optimizing MLIR-based compiler infrastructure that powers NVIDIA's deep learning libraries and influences future GPU architectures. The ideal candidate will have strong expertise in compiler development, particularly with MLIR or LLVM frameworks, and deep understanding of computer architecture and performance optimization. They will work on cutting-edge projects involving deep learning algorithms, especially matrix multiplication and convolution. This role offers the opportunity to make significant impact in a fast-moving, technology-focused company, working with world-class teams at the forefront of AI and hardware innovation. The position requires collaboration with hardware architects and software teams, making it perfect for someone who thrives in a cross-functional environment and wants to shape the future of machine learning infrastructure. Located in Shanghai, this role combines technical depth with strategic influence on NVIDIA's compiler technology direction.

Last updated a month ago

Responsibilities For Compute Architect

Design, implement, and optimize MLIR-based compiler passes for deep learning and data analytics workloads
Analyze and improve the performance of machine learning and deep learning algorithms
Identify performance bottlenecks in compiler-generated code and propose solutions
Collaborate with hardware architects and software teams to co-design features
Contribute to the evolution of NVIDIA's deep learning compiler stack and libraries

Requirements For Compute Architect

Python

MS or PhD in Computer Science, Electrical Engineering, Mathematics, or related field
5+ years of working experience
Experience developing compilers or compiler infrastructure
Strong programming skills in C++ and Python
Understanding of computer architecture
Experience optimizing code for CPUs or GPUs
Experience with deep learning algorithms