NVIDIA, a pioneer in computer graphics and accelerated computing for over 25 years, is seeking a Senior On-Device Model Inference Optimization Engineer to drive innovation in autonomous vehicles technology. This role combines deep technical expertise in AI model optimization with practical implementation skills for on-device deployment. The position offers an opportunity to work at the intersection of AI and automotive technology, optimizing critical systems that power the future of autonomous vehicles.
The role demands expertise in advanced optimization techniques like pruning, quantization, and knowledge distillation, along with strong programming skills in CUDA, Python, and C++. You'll be working with cutting-edge frameworks including PyTorch, ONNX, and TensorRT, while collaborating with cross-functional teams to align optimization efforts with hardware capabilities.
As an NVIDIAN, you'll join a diverse and supportive environment where innovation is celebrated. The position offers competitive compensation with a base salary range of $184,000 - $356,500 USD, plus equity benefits. This is an excellent opportunity for experienced engineers passionate about pushing the boundaries of AI optimization and autonomous vehicle technology.
The ideal candidate brings 10+ years of relevant experience, with at least 5 years specifically in model inference and optimization. You should have an advanced degree in Computer Science or related field, and a proven track record of deploying optimized AI models at scale. The role requires both technical excellence and strong collaborative skills, as you'll be working across teams to deliver efficient, production-ready solutions for safety-critical systems.