Software Engineer L5, Model Observability & Lifecycle Management, Machine Learning Platform

Global streaming entertainment service pioneering content delivery and technology innovation
United States
$100,000 - $619,000
Machine Learning
Staff Software Engineer
Remote
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Description For Software Engineer L5, Model Observability & Lifecycle Management, Machine Learning Platform

Netflix's Machine Learning Platform (MLP) team is seeking a Staff Software Engineer to lead the Model Observability & Lifecycle Management initiatives. This role is crucial in developing a comprehensive MLOps platform that enhances the productivity of ML practitioners across Netflix. The position focuses on building systems for managing ML models, including visualization, observability, and performance benchmarking capabilities.

The role involves creating and expanding model observability workflows supporting various ML applications, from bandits to Large Language Models (LLMs). You'll be working on business-critical models across personalization, growth and commerce, ads, and studio algorithms, supporting hundreds of ML practitioners throughout the company.

As a Staff Engineer, you'll be responsible for developing sophisticated systems including observability dashboards, model registries, anomaly detection systems, and cost monitoring solutions. The position requires expertise in distributed systems, full-stack development, and cloud technologies, with a strong foundation in MLOps practices.

The team's mission is to maintain the reliability of ML applications through proactive issue detection and diagnosis. You'll work in a highly collaborative environment, partnering with engineers, product managers, and data scientists to drive innovation in Netflix's ML/AI initiatives. The role offers the opportunity to impact Netflix's ML infrastructure significantly while working with cutting-edge technologies and frameworks.

Netflix offers a unique compensation structure where you can choose your preferred mix of salary and stock options annually. The position provides competitive compensation ranging from $100,000 to $619,000, based on experience and expertise.

Last updated 2 days ago

Responsibilities For Software Engineer L5, Model Observability & Lifecycle Management, Machine Learning Platform

  • Develop and maintain observability dashboard and backend system for ML entities
  • Build and manage model registry for ML models cataloging and versioning
  • Implement anomaly and drift detection for models and features
  • Create cost monitoring and chargeback dashboards
  • Enhance user interfaces for ML practitioners

Requirements For Software Engineer L5, Model Observability & Lifecycle Management, Machine Learning Platform

Java
React
  • Experience building backend distributed systems and full-stack systems using object-oriented programming
  • Experience with web API frameworks (preferably Spring Boot) and UI frameworks like React
  • Experience working with public cloud platforms (AWS, Azure, or GCP)
  • Knowledge of ML model lifecycle management and MLOps best practices
  • Proactive communication skills with cross-functional teams
  • BS/MS in Computer Science, Applied Math, Engineering, or related field

Benefits For Software Engineer L5, Model Observability & Lifecycle Management, Machine Learning Platform

Equity
  • Flexible compensation structure (choice between salary and stock options)

Interested in this job?

Jobs Related To Netflix Software Engineer L5, Model Observability & Lifecycle Management, Machine Learning Platform

Machine Learning Software Engineer (L5) - Content and Studio

Senior Machine Learning Software Engineer position at Netflix, focusing on algorithm development and implementation for content localization, offering competitive compensation and comprehensive benefits.

Machine Learning Software Engineer L4/L5

Machine Learning Software Engineer position at Netflix focusing on developing and scaling ML algorithms for personalization systems.

Software Engineer L5 - Data and Feature Infrastructure, Machine Learning Platform

Staff Software Engineer position at Netflix focusing on building ML data and feature infrastructure to power machine learning models across various domains.

Research Scientist L4/L5, Algorithms Engineering

Senior Research Scientist position at Netflix focusing on machine learning and algorithms engineering, offering competitive compensation and remote work opportunities.

Software Engineer L5, Machine Learning Platform

Staff Software Engineer position at Netflix focusing on building and scaling machine learning infrastructure, offering competitive compensation and comprehensive benefits.