NVIDIA is seeking a Senior Deep Learning Systems Engineer to join their Datacenter team, playing a crucial role in optimizing their growing datacenter deployments and establishing data-driven approaches to hardware design and system software development. This position offers an opportunity to work at the intersection of deep learning and datacenter architecture, focusing on performance optimization for AI applications.
The role involves analyzing and improving the performance of deep learning applications on datacenter-class hardware, with a particular emphasis on Large Language Models (LLMs). You'll be working with cutting-edge technology in AI and deep learning, developing tools and methodologies to measure and enhance system performance.
As a Senior Deep Learning Systems Engineer, you'll be responsible for developing software infrastructure to analyze deep learning applications, creating profiling tools, and evolving cost-efficient datacenter architectures. The position requires expertise in system architecture, performance analysis, and programming skills in languages like C++, Python, and CUDA.
The ideal candidate will have 8+ years of experience, with a strong background in either system software (including Linux, compilers, and deep learning frameworks) or silicon architecture. A deep understanding of computer system architecture and performance analysis is essential. Experience with containerization platforms like Docker and datacenter workload managers like Slurm is advantageous.
NVIDIA offers competitive compensation, including a base salary range of $184,000 - $356,500 USD (depending on level), equity, and comprehensive benefits. The company is known for its innovative culture and commitment to pushing the boundaries of technology, particularly in AI and accelerated computing. This role presents an excellent opportunity to work with some of the most forward-thinking professionals in the industry while contributing to the development of next-generation AI infrastructure.