NVIDIA, the pioneer in accelerated computing and AI innovation, is seeking a Senior Software Engineer specializing in Deep Learning Inference. This role sits at the intersection of cutting-edge AI technology and performance optimization, focusing on building software solutions for efficient inference on state-of-the-art generative AI models. The position involves working with NVIDIA's hardware and software stack, from server-level request batching to GPU kernel fusion.
The ideal candidate will join a team that tackles challenges across all levels of the technology stack, collaborating with diverse teams worldwide. They'll be responsible for optimizing inference workloads, implementing low-level GPU code, and delivering production-grade solutions. The role requires expertise in software engineering, machine learning concepts, and performance optimization.
NVIDIA offers the opportunity to work with industry-leading technology and contribute to transformative AI solutions. The company is known for its innovative culture and commitment to pushing technological boundaries. As part of NVIDIA's team, you'll be at the forefront of the deep learning revolution, working with the latest hardware and software technologies in AI computing.
The position is based in Ramat Gan, Israel, offering the chance to work with a global team while contributing to projects that impact the future of AI and computing. NVIDIA provides a diverse and inclusive work environment, valuing creativity and autonomous thinking in its employees.