NVIDIA is seeking an experienced Senior Deep Learning Engineer focused on analyzing and improving LLM inference performance. As a world leader in accelerated computing and AI, NVIDIA's GPUs power breakthroughs in deep learning, particularly in LLM, Generative AI, Recommenders and Vision technologies. The role involves working with cutting-edge LLM frameworks like TensorRT LLM, VLLM, and SGLang to optimize performance across NVIDIA's GPU portfolio.
The position requires strong expertise in deep learning software development, with emphasis on performance optimization and scaling of large language models. You'll be collaborating with diverse teams on performance modeling, analysis, and kernel development, while contributing to both NVIDIA and open-source LLM frameworks.
This is an exciting opportunity to work at the forefront of AI computing, helping build the platforms that enable real-time, cost-effective computing solutions. You'll be part of the team that's driving innovation in GPU deep learning, which has become fundamental to machine perception, reasoning, and natural language processing.
The role offers competitive compensation with a base salary range of $184,000 - $356,500 USD, plus equity and benefits. NVIDIA's commitment to diversity and inclusion, combined with their position as "the AI computing company," makes this an excellent opportunity for those passionate about advancing the field of deep learning and AI technology.