Staff Software Engineer, Capacity Engineering

A platform where millions of people find creative ideas and inspiration to create a life they love.
$170,371 - $350,763
Machine Learning
Staff Software Engineer
Hybrid
5,000+ Employees
8+ years of experience
AI · Enterprise SaaS

Description For Staff Software Engineer, Capacity Engineering

Pinterest is seeking a Staff Software Engineer to join their Capacity Engineering team, focusing on managing and optimizing ML infrastructure. This is a highly impactful role at one of the world's largest cloud-native infrastructures, where efficiency is a strategic priority. The position offers direct visibility across Pinterest Engineering and company leadership.

The role involves managing ML hardware capacity, improving infrastructure efficiency, and building profiling capabilities at Pinterest's scale. The ideal candidate should have deep expertise in GPU architectures, ML software stacks, and cloud platforms like AWS. They should be proficient in languages like Python, Java, and C++, with experience in distributed systems and Kubernetes.

Pinterest provides a flexible working environment through their PinFlex program, requiring only 1-2 office visits per quarter. The compensation is highly competitive, ranging from $170,371 to $350,763 USD, plus equity. The company offers comprehensive benefits and emphasizes an inclusive workplace culture.

This position offers an opportunity to work on cutting-edge ML infrastructure, solving complex technical challenges at scale. The role combines technical leadership with hands-on engineering, requiring both deep technical knowledge and excellent communication skills. Pinterest's platform serves millions of users worldwide, making this role crucial for maintaining and optimizing the infrastructure that powers their ML capabilities.

As a Staff Engineer, you'll collaborate with ML Platform, Infrastructure Engineering, and SRE teams, contributing to Pinterest's mission of bringing inspiration to users worldwide. The role requires a balance of technical expertise, leadership skills, and strategic thinking to drive efficiency improvements across Pinterest's ML infrastructure.

Last updated 9 hours ago

Responsibilities For Staff Software Engineer, Capacity Engineering

  • Manage the ML hardware capacity that powers the models running at Pinterest
  • Improve the efficiency of ML Infrastructure at Pinterest
  • Build develop and mature profiling and optimization capabilities for ML Infrastructure at Pinterest scale
  • Collaborate with ML Platform, Infrastructure Engineering and SRE teams

Requirements For Staff Software Engineer, Capacity Engineering

Python
Java
Kubernetes
  • Deep understanding of GPU Architectures, Pytorch, etc.
  • Deep understanding of supporting parts of ML software stack like Scheduling, Data and Storage
  • Hands on experience with shared platforms like Kubernetes
  • Strong technical and performance engineering skills
  • Experience building and managing highly available distributed applications at scale
  • Proficiency in software development languages such as Java, Python and C++
  • Excellent skills in communicating complex technical issues
  • Understanding of ML Models, Kernels and optimization opportunities
  • Hands-on experience with large, cloud-native multi-tenant platforms at Internet scale
  • Experience with AWS or similar cloud environments
  • Deep understanding of infrastructure capacity and performance
  • Bachelor's degree in Computer Science, Engineering, or related field, or equivalent experience

Benefits For Staff Software Engineer, Capacity Engineering

Medical Insurance
Equity
  • Equity compensation
  • Comprehensive benefits package
  • Flexible working model (PinFlex)

Interested in this job?

Jobs Related To Pinterest Staff Software Engineer, Capacity Engineering

Staff Machine Learning Engineer - Responsible AI

Lead responsible AI initiatives at Pinterest as a Staff Machine Learning Engineer, focusing on fairness, ethics, and bias mitigation in ML systems.

Staff Software Engineer, Ads ML Training Infrastructure

Staff Software Engineer position at Pinterest focusing on Ads ML Training Infrastructure, building and optimizing machine learning systems for advertising models.

Manager II, Machine Learning - Search Retrieval

Lead Pinterest's ML Search Retrieval team, managing large-scale search systems and ML engineers to enhance search experience for 500M+ users.

Staff GenAI Evaluation Engineer - Qualcomm Research

Staff GenAI Evaluation Engineer position at Qualcomm Research focusing on machine learning algorithm evaluation and optimization for embedded GenAI systems.

Staff Software Engineer, Machine Learning, JAX Third Party

Staff Software Engineer position at Google focusing on Machine Learning and JAX Third Party development, offering competitive compensation and the opportunity to work on cutting-edge ML infrastructure.