NVIDIA's Deep Learning Libraries Group is seeking a Senior Infrastructure Software Engineer to drive the development and maintenance of their high-performance deep learning libraries infrastructure. This role is crucial in enabling NVIDIA's next generation of AI platforms, working across multiple products including cuDNN, TensorRT, and CUDA kernel libraries. The position focuses on designing and developing scalable, modular infrastructure that streamlines development, builds, and tests across NVIDIA's diverse platforms, from Drive AGX for autonomous vehicles to DGX servers for datacenters.
The ideal candidate will be responsible for building and maintaining the infrastructure that supports NVIDIA's open-source-first strategy, implementing scalable automation for build, test, integration, and release processes. They will work throughout the software stack, from user interfaces down to cluster and database layers, while configuring and maintaining industry-standard tools like Kubernetes, Jenkins, Docker, and CMake.
This is an opportunity to have a significant impact at NVIDIA by improving development velocity across their AI/DL/Compute Software projects. The role requires a combination of technical expertise in software engineering, infrastructure automation, and deep learning technologies, along with the ability to work autonomously on challenging problems. The position offers the chance to work with a technically diverse team of software engineers and infrastructure experts, contributing to NVIDIA's mission of delivering the world's fastest deep learning platforms.