NVIDIA is seeking a Senior Applied Machine Learning Engineer to help build the NeMo Microservices Suite Platform. The role involves developing next-generation AI services and interfaces for training and fine-tuning machine learning models and deploying AI at scale. The team focuses on speech, vision, and NLP technologies, contributing to all steps of the machine learning lifecycle.
Key responsibilities include:
- Developing a new generation of Compound AI Systems platform with reasoning capabilities across multiple modalities
- Creating distributed cloud applications, microservices, and MLOps platforms for large-scale models
- Implementing core infrastructure for cloud-native AI training and inference
- Optimizing performance under high load
Requirements:
- BS, Masters, or equivalent experience in computer science, computer architecture, or related field
- 5+ years of experience
- Exceptional coding skills and ability to create high-quality software
- Experience with microservices, cloud-native applications, and related technologies
- Proficiency in deploying applications on Kubernetes
- Strong understanding of performance, security, and reliability in complex distributed infrastructure
- Excellent Python or Golang programming skills
Preferred qualifications include experience with machine learning frameworks, MLOps platforms, and production NLP systems.
NVIDIA offers a competitive base salary range of $148,000 - $276,000 USD, along with equity and comprehensive benefits. The company values diversity and is an equal opportunity employer, committed to fostering an inclusive work environment.