NVIDIA, a pioneer in accelerated computing and AI technology for over 25 years, is seeking a Senior LLM Systems Engineer for their NeMo Microservice Platform team. This role is at the forefront of building primitives that enable software engineers to train and deploy AI at scale. The position involves developing distributed cloud applications and microservices capable of handling massive language models, implementing core infrastructure for cloud-native AI evaluation, and optimizing service performance.
The ideal candidate will have 5+ years of experience and strong expertise in Python programming, microservices architecture, and distributed systems. They'll work on creating task-specific AI cloud services and improving service stability, observability, and reliability. The role requires a deep understanding of performance optimization, security, and reliability in complex distributed environments.
NVIDIA offers a competitive compensation package with a base salary range of $148,000 - $287,500 USD, plus equity and benefits. The company provides a diverse, supportive environment where innovation is encouraged and employees can make lasting impacts on the world. This position offers the flexibility of multiple locations, including Santa Clara, CA headquarters and remote options in several US states.
The role presents an exciting opportunity to work with cutting-edge AI technology and contribute to all aspects of the machine learning lifecycle, from conceptualization to deployment. Experience with Rust or Golang and background in deep learning research, particularly in model evaluation techniques, would be valuable assets for this position.