Deep Learning Performance Architect

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI

Description For Deep Learning Performance Architect

NVIDIA, the world leader in accelerated computing, is seeking a Deep Learning Performance Architect to join their innovative team. This role focuses on developing GPU-accelerated Deep Learning software and optimizing deep learning kernels for inference. The position offers an opportunity to work with cutting-edge technology and collaborate with researchers worldwide who are using NVIDIA GPUs to power breakthroughs in numerous areas.

The role involves working with cross-collaborative teams across automotive, image understanding, and speech understanding domains to develop innovative solutions. You'll be responsible for performance optimization, analysis, and tuning of deep learning systems, while also having the opportunity to implement the latest algorithms for public release in Tensor-RT.

As a Deep Learning Performance Architect, you'll be part of a fast-paced, customer-oriented team where excellent communication skills are essential. The position requires strong technical expertise in C/C++ programming, GPU architecture, and deep learning frameworks. You'll have the chance to work with some of the most brilliant minds in the technology industry while contributing to NVIDIA's mission of advancing accelerated computing.

The ideal candidate should have at least 5 years of relevant experience, strong software development skills, and deep understanding of performance optimization. This role offers the opportunity to shape the future of AI and deep learning while working at one of the technology world's most desirable employers.

Last updated 3 hours ago

Responsibilities For Deep Learning Performance Architect

  • Develop highly optimized deep learning kernels for inference
  • Perform performance optimization, analysis, and tuning
  • Work with cross-collaborative teams across automotive, image understanding, and speech understanding
  • Travel to conferences and customers for technical consultation and training

Requirements For Deep Learning Performance Architect

Python
  • Masters or PhD or equivalent experience in relevant discipline (CE, CS&E, CS, AI)
  • Software Agile skills
  • Excellent C/C++ programming and software design skills
  • Python experience
  • Performance modelling, profiling, debug, and code optimization knowledge
  • GPU programming experience (CUDA or OpenCL)
  • 5 years of relevant work experience

Interested in this job?

Jobs Related To NVIDIA Deep Learning Performance Architect

AI Computing Software Development Engineer, TensorRT

Senior AI Software Engineer role at NVIDIA developing TensorRT inferencing software, requiring 3+ years experience and expertise in C++, deep learning frameworks, and AI technologies.

Senior Performance Engineer - Deep Learning

Senior Performance Engineer position at NVIDIA focusing on optimizing Deep Learning tools and frameworks, offering competitive salary and opportunity to work with cutting-edge AI technology.

Deep-Learning Software Engineer, Performance Optimization

Senior Deep Learning Software Engineer role at NVIDIA focusing on AI performance optimization and implementation, requiring 5+ years of experience in machine learning and software development.

AI Developer Technology Engineer

Senior AI Developer Technology Engineer role at NVIDIA focusing on deep learning, GPU optimization, and customer solutions.

Senior Compiler Engineer, Software - Deep Learning Accelerator

Senior Compiler Engineer position at NVIDIA focusing on Deep Learning Accelerator development, requiring 7+ years of experience in compiler or deep learning software projects.