Taro Logo

Staff Software Engineer, ML Training Platform

Pinterest is a visual discovery platform where millions of people find inspiration and plan for what matters most in their lives.
$166,694 - $342,844
Machine Learning
Staff Software Engineer
Remote
1,000 - 5,000 Employees
7+ years of experience
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Staff Software Engineer, ML Training Platform

Pinterest is seeking a highly skilled and experienced Staff Software Engineer to join their ML Training Infrastructure team and lead the technical strategy. The role involves implementing cost-effective and scalable solutions for ML training and inference workloads, leading key projects such as GPU sharing and intelligent resource management, and setting the multi-year roadmap for ML Training Infrastructure. The ideal candidate will have 7+ years of experience in software engineering and machine learning, with a focus on building and maintaining ML infrastructure or Batch Compute infrastructure. They will collaborate with internal clients, mentor engineers, and forge strong partnerships across the organization. The position offers a competitive salary range of $166,694—$342,844 USD and is eligible for equity. Pinterest values diversity and inclusion, offering equal opportunities to all qualified applicants.

Key Responsibilities:

  • Implement scalable solutions for ML workloads on platforms like Kubernetes
  • Lead projects in GPU sharing, intelligent resource management, and fault-tolerant training
  • Set multi-year roadmap for ML Training Infrastructure
  • Collaborate with internal clients and ML engineers
  • Forge partnerships with tech leaders in Data and Infra organizations
  • Mentor engineers and demonstrate technical leadership

Required Skills:

  • 7+ years of experience in software engineering and machine learning
  • Technical leadership experience
  • Strong understanding of High Performance Computing and parallel computing
  • Ability to drive cross-team projects
  • Strong experience in Python and/or other programming languages
  • Experience with GPU programming, containerization, and orchestration technologies (preferred)

Pinterest offers a flexible work model called PinFlex, allowing for remote work options. The company is committed to creating an equitable, inclusive, and inspiring workplace for every employee.

Last updated a year ago

Responsibilities For Staff Software Engineer, ML Training Platform

  • Implement cost effective and scalable solutions for ML workloads
  • Lead key projects in GPU sharing, intelligent resource management, and fault-tolerant training
  • Set multi-year roadmap for ML Training Infrastructure
  • Collaborate with internal clients and ML engineers
  • Forge partnerships with tech leaders in Data and Infra organizations
  • Mentor engineers and demonstrate technical leadership

Requirements For Staff Software Engineer, ML Training Platform

Python
Kubernetes
  • 7+ years of experience in software engineering and machine learning
  • Technical leadership experience
  • Strong understanding of High Performance Computing and parallel computing
  • Ability to drive cross-team projects
  • Strong experience in Python and/or other programming languages
  • Experience with GPU programming, containerization, orchestration technologies (preferred)

Benefits For Staff Software Engineer, ML Training Platform

Equity
  • Equity

Interested in this job?