Deep-Learning Software Engineer, Performance Optimization

NVIDIA is the world leader in accelerated computing, pioneering GPU technology and AI solutions.
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI

Description For Deep-Learning Software Engineer, Performance Optimization

NVIDIA is seeking a Deep Learning Software Engineer to join their team in Tokyo, focusing on performance optimization of deep learning solutions. This role combines cutting-edge AI development with practical implementation, requiring expertise in both theoretical machine learning and hands-on software engineering. The position involves working with NVIDIA's latest hardware platforms to optimize and deploy deep neural networks for real-world applications. As a senior member of the team, you'll collaborate with researchers and engineers across NVIDIA to improve chip architecture while also working directly with partners to implement AI solutions. The role requires strong technical skills in C++, Python, and various deep learning frameworks, along with experience in model optimization techniques. This is an opportunity to work at the intersection of hardware and AI, helping shape the future of accelerated computing at a global technology leader. The position offers exposure to the latest developments in AI technology and the chance to work on projects that will impact various industries through NVIDIA's widespread presence in the AI and GPU computing space.

Last updated 3 hours ago

Responsibilities For Deep-Learning Software Engineer, Performance Optimization

  • Analyze, profile and optimize the latest DNN AI algorithms
  • Implement production-quality software libraries for latency-critical use-cases
  • Push boundaries in DNN performance and efficiency, including model compression and quantization
  • Collaborate with researchers and engineers across NVIDIA to improve chip architecture
  • Assist NVIDIA customers in implementing AI technology

Requirements For Deep-Learning Software Engineer, Performance Optimization

Python
  • University degree in Computer Science, Electrical Engineering, Physics or Mathematics
  • 5+ years of work experience in HPC, numeric computing, machine learning, or AI
  • Proficiency in C++, Python, data structures, algorithms
  • Knowledge of deep-learning toolchains (PyTorch, TensorFlow, Keras, ONNX, TensorRT)
  • Experience with neural network training, pruning and quantization
  • Experience optimizing algorithms on accelerated hardware
  • Familiarity with CNN, LLM and ViT architectures
  • Experience creating DNN models for production problems

Interested in this job?

Jobs Related To NVIDIA Deep-Learning Software Engineer, Performance Optimization

AI Computing Software Development Engineer, TensorRT

Senior AI Software Engineer role at NVIDIA developing TensorRT inferencing software, requiring 3+ years experience and expertise in C++, deep learning frameworks, and AI technologies.

Deep Learning Performance Architect

Senior Deep Learning Performance Architect position at NVIDIA, focusing on optimizing deep learning kernels and developing GPU-accelerated software solutions.

Senior Performance Engineer - Deep Learning

Senior Performance Engineer position at NVIDIA focusing on optimizing Deep Learning tools and frameworks, offering competitive salary and opportunity to work with cutting-edge AI technology.

AI Developer Technology Engineer

Senior AI Developer Technology Engineer role at NVIDIA focusing on deep learning, GPU optimization, and customer solutions.

Senior Compiler Engineer, Software - Deep Learning Accelerator

Senior Compiler Engineer position at NVIDIA focusing on Deep Learning Accelerator development, requiring 7+ years of experience in compiler or deep learning software projects.