Taro Logo

Senior Deep Learning Performance Engineer - Training at Scale

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins to transform industries.
Machine Learning
Senior Software Engineer
Remote
5,000+ Employees
5+ years of experience
AI

Job Description

NVIDIA is seeking a Senior Deep Learning Performance Engineer to join their team focused on optimizing Deep Learning training, inference, and NVIDIA AI Services. This role offers a unique opportunity to work across the entire hardware/software stack, from GPU architecture to Deep Learning Frameworks, to achieve maximum performance. As a senior engineer, you'll be implementing and optimizing deep learning models across various domains including Computer Vision, NLP/LLMs, ASR, and RecSys using multiple frameworks. You'll work with cutting-edge technology, including enterprise-grade GPU clusters capable of hundreds of Peta FLOPS, and have early access to unreleased hardware.

The position requires strong expertise in deep learning implementation and software development, with a focus on performance optimization. You'll be collaborating with world-class engineers and researchers, providing guidance on improving workload design, usability, and performance. The ideal candidate should have excellent Python programming skills, strong analytical capabilities, and a solid understanding of deep learning fundamentals.

Working at NVIDIA, one of technology's most desirable employers, you'll be at the forefront of the AI revolution, directly impacting hardware and software roadmaps. The company offers a diverse and inclusive work environment, with the opportunity to work remotely from various European locations. This role is perfect for someone passionate about high-performance computing, deep learning, and pushing the boundaries of what's possible in AI technology.

Last updated a day ago

Responsibilities For Senior Deep Learning Performance Engineer - Training at Scale

  • Implement deep learning models from multiple data domains (CV, NLP/LLMs, ASR, TTS, RecSys and others) in multiple DL frameworks
  • Implement and test new SW features that use the most recent HW functionalities
  • Analyze, profile, and optimize deep learning workloads on state-of-the-art hardware and software platforms
  • Collaborate with researchers and engineers across NVIDIA
  • Lead best-practices for building, testing, and releasing DL software

Requirements For Senior Deep Learning Performance Engineer - Training at Scale

Python
  • 5+ years of experience in DL model implementation and SW Development
  • BSc, MS or PhD degree in Computer Science, Computer Architecture, Mathematics, Physics or related technical field
  • Excellent Python programming skills
  • Extensive knowledge of at least one DL Framework
  • Strong problem solving and analytical skills
  • Algorithms and DL fundamentals

Related Jobs