NVIDIA is seeking an experienced Senior Deep Learning Engineer focused on analyzing and improving LLM inference performance. As a key member of the team building GPU-accelerated Deep Learning software, you'll work on cutting-edge technologies like TensorRT, DL benchmarking software, and performant solutions for model deployment and serving. The role involves collaborating with the deep learning community to implement latest algorithms in TensorRT LLM, VLLM, and SGLang while identifying and optimizing performance opportunities across NVIDIA's accelerator spectrum. You'll be working at the intersection of deep learning and high-performance computing, contributing to NVIDIA's position as a leader in AI computing. The role offers competitive compensation, including a base salary range of $184,000-$356,500 USD, plus equity and benefits. This is an excellent opportunity to join a company at the forefront of AI and accelerated computing, working on technology that's transforming industries and society. The hybrid work environment and collaborative culture make this an ideal position for someone passionate about deep learning and performance optimization.