NVIDIA, the world leader in accelerated computing, is seeking a Deep Learning Performance Architect to join their innovative team. This role focuses on developing GPU-accelerated Deep Learning software and optimizing deep learning kernels for inference. The position offers an opportunity to work with cutting-edge technology and collaborate with researchers worldwide who are using NVIDIA GPUs to power breakthroughs in numerous areas.
The role involves working with cross-collaborative teams across automotive, image understanding, and speech understanding domains to develop innovative solutions. You'll be responsible for performance optimization, analysis, and tuning of deep learning systems, while also having the opportunity to implement the latest algorithms for public release in Tensor-RT.
As a Deep Learning Performance Architect, you'll be part of a fast-paced, customer-oriented team where excellent communication skills are essential. The position requires strong technical expertise in C/C++ programming, GPU architecture, and deep learning frameworks. You'll have the chance to work with some of the most brilliant minds in the technology industry while contributing to NVIDIA's mission of advancing accelerated computing.
The ideal candidate should have at least 5 years of relevant experience, strong software development skills, and deep understanding of performance optimization. This role offers the opportunity to shape the future of AI and deep learning while working at one of the technology world's most desirable employers.