NVIDIA, the world leader in accelerated computing, is seeking a Senior Software Engineer to join their Deep Learning Inference team. This role presents an exciting opportunity to make a significant impact in the field of Deep Learning by developing state-of-the-art inference frameworks for Large Language Models (LLMs) on NVIDIA GPUs.
The position involves working with TensorRT-LLM, NVIDIA's premier library for optimizing LLM inference performance. You'll be at the forefront of AI technology, collaborating with deep learning experts, GPU architects, and DevOps engineers while engaging with the broader deep learning community through open-source development.
The ideal candidate brings 6+ years of software development experience, strong Python skills, and a deep understanding of Machine Learning concepts, particularly in Large Language Models. Experience with C++, open-source development, and frameworks like vLLM, TensorRT, PyTorch, or JAX would be highly valuable.
NVIDIA offers a competitive compensation package with a base salary range of $184,000 - $287,500 USD, plus equity and benefits. The company is known for its innovative culture and commitment to pushing technological boundaries. Working at NVIDIA means joining one of technology's most desirable employers, where you'll help build the computing platforms driving success in AI and digital twins.
This hybrid role is based in Santa Clara, CA, offering the flexibility of both remote and office work. NVIDIA maintains a strong commitment to diversity and inclusion, fostering an environment where creativity and autonomy are highly valued. If you're passionate about deep learning, high-performance computing, and want to work with some of the industry's brightest minds, this role presents an exceptional opportunity to advance your career while contributing to groundbreaking technological developments.