NVIDIA is seeking a Senior Deep Learning Performance Engineer to join their team focused on optimizing Deep Learning training, inference, and NVIDIA AI Services. This role offers a unique opportunity to work across the entire hardware/software stack, from GPU architecture to Deep Learning Frameworks, to achieve maximum performance. As a senior engineer, you'll be implementing and optimizing deep learning models across various domains including Computer Vision, NLP/LLMs, ASR, and RecSys using multiple frameworks. You'll work with cutting-edge technology, including enterprise-grade GPU clusters capable of hundreds of Peta FLOPS, and have early access to unreleased hardware.
The position requires strong expertise in deep learning implementation and software development, with a focus on performance optimization. You'll be collaborating with world-class engineers and researchers, providing guidance on improving workload design, usability, and performance. The ideal candidate should have excellent Python programming skills, strong analytical capabilities, and a solid understanding of deep learning fundamentals.
Working at NVIDIA, one of technology's most desirable employers, you'll be at the forefront of the AI revolution, directly impacting hardware and software roadmaps. The company offers a diverse and inclusive work environment, with the opportunity to work remotely from various European locations. This role is perfect for someone passionate about high-performance computing, deep learning, and pushing the boundaries of what's possible in AI technology.