Taro Logo

Senior Deep Learning Systems Engineer, Datacenters

NVIDIA is the world leader in accelerated computing, pioneering AI and digital twins technology.
$184,000 - $356,500
Machine Learning
Senior Software Engineer
Hybrid
5,000+ Employees
8+ years of experience
AI · Enterprise SaaS

Job Description

NVIDIA is seeking a Senior Deep Learning Systems Engineer to join their Datacenter team, playing a crucial role in optimizing their growing datacenter deployments and establishing data-driven approaches to hardware design and system software development. This position offers an opportunity to work at the intersection of deep learning and datacenter architecture, focusing on performance optimization for AI applications.

The role involves analyzing and improving the performance of deep learning applications on datacenter-class hardware, with a particular emphasis on Large Language Models (LLMs). You'll be working with cutting-edge technology in AI and deep learning, developing tools and methodologies to measure and enhance system performance.

As a Senior Deep Learning Systems Engineer, you'll be responsible for developing software infrastructure to analyze deep learning applications, creating profiling tools, and evolving cost-efficient datacenter architectures. The position requires expertise in system architecture, performance analysis, and programming skills in languages like C++, Python, and CUDA.

The ideal candidate will have 8+ years of experience, with a strong background in either system software (including Linux, compilers, and deep learning frameworks) or silicon architecture. A deep understanding of computer system architecture and performance analysis is essential. Experience with containerization platforms like Docker and datacenter workload managers like Slurm is advantageous.

NVIDIA offers competitive compensation, including a base salary range of $184,000 - $356,500 USD (depending on level), equity, and comprehensive benefits. The company is known for its innovative culture and commitment to pushing the boundaries of technology, particularly in AI and accelerated computing. This role presents an excellent opportunity to work with some of the most forward-thinking professionals in the industry while contributing to the development of next-generation AI infrastructure.

Last updated a day ago

Responsibilities For Senior Deep Learning Systems Engineer, Datacenters

  • Develop software infrastructure to characterize and analyze Deep Learning applications
  • Evolve cost-efficient datacenter architectures for Large Language Models (LLMs)
  • Develop analysis and profiling tools in Python, bash and C++ to measure performance metrics
  • Analyze system and software characteristics of DL applications
  • Develop analysis tools and methodologies to measure key performance metrics

Requirements For Senior Deep Learning Systems Engineer, Datacenters

Python
Linux
Kubernetes
  • Bachelor's degree in Electrical Engineering or Computer Science (Masters or PhD preferred)
  • 8+ years of relevant experience
  • Experience in System Software (Linux, Compilers, GPU kernels, DL Frameworks) or Silicon Architecture
  • Experience programming in C/C++ and Python
  • Deep understanding of computer system architecture and performance analysis
  • Ability to work in virtual environments

Benefits For Senior Deep Learning Systems Engineer, Datacenters

Equity
Medical Insurance
  • Equity
  • Medical Insurance

Related Jobs

Senior Architecture Energy Modeling Engineer

Senior Architecture Energy Modeling Engineer role at NVIDIA focusing on ML-based power modeling and energy efficiency optimization for GPUs, offering $168K-$310K base salary plus equity.

Senior DFX Software Engineer - Machine Learning

Senior DFX Software Engineer role at NVIDIA focusing on machine learning applications in silicon testing, offering $136K-$264.5K salary plus benefits.

Senior Software Engineer, Agentic AI

Senior Software Engineer position at NVIDIA focusing on developing the Agent Intelligence (AIQ) toolkit for enterprise AI applications, requiring 5+ years of Python experience and expertise in LLM frameworks.

Senior Deep Learning Frameworks Sustaining Engineer

Senior Deep Learning Engineer role at NVIDIA focusing on maintaining and improving machine learning frameworks and enterprise products.

Senior Computer Vision System Performance Engineer

Senior Computer Vision System Performance Engineer role at NVIDIA focusing on optimizing computer vision applications and developing hardware-accelerated pipelines.