Netflix's Machine Learning Platform (MLP) team is seeking a Staff Software Engineer to lead the Model Observability & Lifecycle Management initiatives. This role is crucial in developing a comprehensive MLOps platform that enhances the productivity of ML practitioners across Netflix. The position focuses on building systems for managing ML models, including visualization, observability, and performance benchmarking capabilities.
The role involves creating and expanding model observability workflows supporting various ML applications, from bandits to Large Language Models (LLMs). You'll be working on business-critical models across personalization, growth and commerce, ads, and studio algorithms, supporting hundreds of ML practitioners throughout the company.
As a Staff Engineer, you'll be responsible for developing sophisticated systems including observability dashboards, model registries, anomaly detection systems, and cost monitoring solutions. The position requires expertise in distributed systems, full-stack development, and cloud technologies, with a strong foundation in MLOps practices.
The team's mission is to maintain the reliability of ML applications through proactive issue detection and diagnosis. You'll work in a highly collaborative environment, partnering with engineers, product managers, and data scientists to drive innovation in Netflix's ML/AI initiatives. The role offers the opportunity to impact Netflix's ML infrastructure significantly while working with cutting-edge technologies and frameworks.
Netflix offers a unique compensation structure where you can choose your preferred mix of salary and stock options annually. The position provides competitive compensation ranging from $100,000 to $619,000, based on experience and expertise.