Taro Logo

Deep-Learning Software Engineer, Performance Optimization

NVIDIA is the world leader in accelerated computing, pioneering GPU technology and AI solutions.
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Job Description

NVIDIA is seeking a Deep Learning Software Engineer to join their team in Tokyo, focusing on performance optimization of deep learning solutions. This role combines cutting-edge AI development with practical implementation, requiring expertise in both theoretical machine learning and hands-on software engineering. The position involves working with NVIDIA's latest hardware platforms to optimize and deploy deep neural networks for real-world applications. The ideal candidate will have strong programming skills in C++ and Python, deep understanding of machine learning frameworks, and experience with hardware acceleration. This is an opportunity to work at the forefront of AI technology, collaborating with world-class partners and internal teams to develop innovative solutions. The role offers exposure to the latest developments in AI, from model compression to neural architecture search, while working for the world leader in accelerated computing. NVIDIA's work spans from groundbreaking AI research to practical applications in autonomous vehicles and high-performance computing, making this an exciting opportunity for someone passionate about pushing the boundaries of what's possible in AI and computing.

Last updated 11 hours ago

Responsibilities For Deep-Learning Software Engineer, Performance Optimization

  • Push the boundaries of state of the art in DNN performance and efficiency
  • Analyze, profile and optimize the latest DNN AI algorithms
  • Implement production-quality software libraries for latency-critical use-cases
  • Collaborate with researchers and engineers across NVIDIA
  • Assist NVIDIA customers to bring ground-breaking products to life

Requirements For Deep-Learning Software Engineer, Performance Optimization

Python
Kubernetes
  • University degree in Computer Science, Electrical Engineering, Physics or Mathematics
  • 5+ years of work experience in AI, machine learning, HPC, or numeric computing
  • Proficiency in C++, Python, data structures, algorithms
  • Knowledge of deep-learning toolchains (PyTorch, TensorFlow, Keras, ONNX, TensorRT)
  • Experience with neural network training, pruning and quantization
  • Experience optimizing algorithms on accelerated hardware
  • Familiarity with CNN, LLM and ViT architectures

Related Jobs

AI Developer Technology Engineer

Senior AI Developer Technology Engineer role at NVIDIA focusing on deep learning, GPU optimization, and customer solutions.

Machine Learning Solutions Engineer, Google Cloud Learning Services (Japanese, English)

Senior Machine Learning Solutions Engineer position at Google Cloud, focusing on delivering ML/AI training and solutions, requiring Japanese-English bilingual skills and 6 years of ML experience.

Cloud & AI Solution Engineer – AI Applications

Senior Cloud & AI Solution Engineer role at Microsoft, focusing on AI Applications and enterprise solutions, requiring 4+ years of technical pre-sales experience and expertise in AI/ML models.

Machine Learning Solutions Engineer, Google Cloud Learning Services (Japanese, English)

Senior Machine Learning Solutions Engineer position at Google Cloud, focusing on delivering ML/AI content and leading technical training while working with cutting-edge technology.

Machine Learning Solutions Engineer, Google Cloud Learning Services (Japanese, English)

Senior Machine Learning Solutions Engineer position at Google Cloud, focusing on delivering ML/AI content and leading technical training while working directly with customers to implement ML solutions.