NVIDIA, the world leader in accelerated computing, is seeking a Senior Software Engineer to join their Deep Learning Inference TensorRT software team. This role presents an exciting opportunity to make a significant impact in the field of Deep Learning by developing state-of-the-art inference frameworks for accelerating Deep Learning models, particularly Large Language Models, on NVIDIA GPUs.
The position involves working with cutting-edge technology at the intersection of deep learning and high-performance computing. As a Senior Software Engineer, you'll be responsible for developing components of TensorRT, NVIDIA's SDK for high-performance deep learning inference, using C++ and Python to build graph parsers, optimizers, and deployment tools for trained deep learning models.
The ideal candidate brings at least 3 years of software development experience, with strong expertise in C++11/C++14/C++17 and a solid foundation in Machine Learning concepts. Knowledge of Computer Architecture, Data Structures, and Algorithms is essential. The role requires excellent communication skills and the ability to collaborate effectively with diverse teams of deep learning experts, GPU architects, and DevOps engineers.
What makes this opportunity particularly compelling is the chance to work at NVIDIA, widely recognized as one of the technology world's most desirable employers. The company offers competitive compensation, including a base salary range of $148,000 - $287,500 USD, equity, and comprehensive benefits. The position is hybrid, based in Santa Clara, CA, allowing for both collaborative in-person work and flexible remote options.
Additional valuable skills include experience with system software development, GPU kernel programming using CUDA or OpenCL, software performance optimization, compiler development, and familiarity with ML frameworks like TensorRT, PyTorch, TensorFlow, and ONNX Runtime. This role offers the opportunity to work on real-world applications of deep learning technology while contributing to NVIDIA's mission of transforming industries through AI and accelerated computing.