NVIDIA, the world leader in accelerated computing, is seeking a Senior Software Engineer to join their Enterprise AI Software team. This role focuses on developing components for NVIDIA Inference Microservices (NIMs) and its deployed services. The position involves working with cutting-edge AI technology, specifically in optimizing and scaling LLM applications through containerized solutions.
The role requires expertise in distributed systems, containerization, and AI model deployment. You'll be responsible for designing and implementing high-performance inference solutions that leverage NVIDIA's GPU infrastructure. The work involves close collaboration with various teams, including software engineers, researchers, SREs, and product management.
This is an excellent opportunity for an experienced software engineer who wants to work at the intersection of AI and enterprise software. The ideal candidate will have strong technical skills in containerization (Docker, Kubernetes), distributed systems, and experience with LLM deployment. The position offers the chance to work on cutting-edge AI technology while contributing to NVIDIA's mission of transforming industries through accelerated computing.
Working at NVIDIA means joining one of technology's most desirable employers, known for innovation and forward-thinking approaches. The company's work in AI and digital twins is revolutionizing major industries, making this an exciting opportunity to be at the forefront of technological advancement.