Taro Logo

Software Engineer L5, Offline Inference, Machine Learning Platform

World's leading entertainment service with over 300 million paid memberships in 190+ countries offering TV series, films and games.
United States
$100,000 - $720,000
Machine Learning
Staff Software Engineer
Remote
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS · Entertainment

Description For Software Engineer L5, Offline Inference, Machine Learning Platform

Netflix, the world's leading entertainment service with 300M+ members across 190+ countries, is seeking a Staff Software Engineer (L5) for their Machine Learning Platform team. This role focuses on the Offline Inference team, which is crucial for batch-prediction capabilities supporting various ML models including LLMs and computer-vision systems. The position offers a unique opportunity to build next-generation systems for large-scale batch inference workloads while creating seamless experiences for ML practitioners.

The role combines deep technical expertise in distributed systems with ML infrastructure knowledge, requiring the ability to design and operate robust services that handle everything from minutes-long to multi-day jobs. You'll be instrumental in building developer-friendly tools and APIs that democratize ML capabilities across Netflix, particularly in content and media domains.

As part of Netflix's innovative culture, you'll work with cutting-edge technologies and contribute to systems that directly impact how Netflix creates and produces the content millions enjoy. The position offers competitive compensation ($100K-$720K), comprehensive benefits, and the flexibility of remote work within the USA. You'll join a team that values engineering excellence, operational reliability, and effective collaboration across distributed teams.

The ideal candidate will bring strong experience in ML engineering, scalable infrastructure, and modern backend development, with expertise in containerization and cloud services. This role presents an exceptional opportunity to shape the future of ML infrastructure at one of the world's most influential entertainment companies.

Last updated 3 hours ago

Responsibilities For Software Engineer L5, Offline Inference, Machine Learning Platform

  • Build developer-friendly APIs, SDKs, and CLIs for batch inference jobs
  • Design, implement, and operate distributed services for batch inference workflows at massive scale
  • Instrument the platform for reliability, debuggability, observability, and cost control
  • Define SLOs and share an equitable on-call rotation
  • Foster a culture of engineering excellence through design reviews, mentorship, and feedback

Requirements For Software Engineer L5, Offline Inference, Machine Learning Platform

Python
Java
Scala
Kubernetes
  • Hands-on experience with ML engineering or production systems involving training or inference of deep-learning models
  • Proven track record of operating scalable infrastructure for ML workloads
  • Proficiency in one or more modern backend languages (e.g. Python, Java, Scala)
  • Production experience with containerization & orchestration and at least one major cloud provider (AWS preferred)
  • Comfortable with ambiguity and working across multiple layers of the tech stack
  • Commitment to operational best practices
  • Excellent written and verbal communication skills
  • Comfortable working in a team with peers and partners distributed across US geographies & time zones

Benefits For Software Engineer L5, Offline Inference, Machine Learning Platform

Medical Insurance
Mental Health Assistance
401k
Vision Insurance
Dental Insurance
Parental Leave
  • Health Plans
  • Mental Health support
  • 401(k) Retirement Plan with employer match
  • Stock Option Program
  • Disability Programs
  • Health Savings and Flexible Spending Accounts
  • Family-forming benefits
  • Life and Serious Injury Benefits
  • 35 days annually for paid time off (hourly employees)
  • Flexible time off (salaried employees)

Interested in this job?

Jobs Related To Netflix Software Engineer L5, Offline Inference, Machine Learning Platform