Capacity Efficiency & Performance Engineer

Anthropic creates reliable, interpretable, and steerable AI systems, focusing on safe and beneficial AI development through research and engineering.
$320,000 - $405,000
Cloud
Staff Software Engineer
Hybrid
5+ years of experience
AI · Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:
Staff Software Engineer, Google Cloud

Lead software development and technical direction for Google Cloud's infrastructure and AI systems, combining hands-on engineering with technical leadership to build scalable solutions.

OCI Multicloud SW Engineering Roles

Senior cloud engineering role at Oracle building distributed cloud infrastructure and services, requiring 6+ years of experience in systems development.

Oracle Cloud Operation Engineer

Oracle Cloud Operation Engineer position focusing on OCI cloud operations, database administration, and 24/7 production support for banking applications.

Staff Software Engineer - Cloud

Remote Staff Software Engineer position specializing in cloud technologies, requiring 7+ years of backend development experience and expertise in AWS/Azure/GCP.

Staff Cloud Platform Engineer - Core Infra

Staff Cloud Platform Engineer position at Sift, focusing on core infrastructure, distributed systems, and cloud platforms, offering competitive compensation and remote work flexibility.

Description For Capacity Efficiency & Performance Engineer

Anthropic is seeking a Capacity Efficiency & Performance Engineer to join their mission of creating reliable, interpretable, and steerable AI systems. This role is crucial in shaping the company's cloud infrastructure strategy and efficiency initiatives.

As a member of the Capacity Efficiency & Performance team, you'll work at the intersection of infrastructure, machine learning, and business operations. Your responsibilities will span from developing self-service tools for capacity monitoring to implementing sophisticated ML-informed forecasting models.

The ideal candidate brings 5+ years of experience in capacity efficiency or performance engineering, with strong knowledge of public cloud providers and Kubernetes-based ML infrastructure. You'll collaborate with research teams, engineering leadership, and finance stakeholders to optimize infrastructure costs and performance.

Anthropic offers a unique work environment focused on big science and collaborative research. Based in San Francisco, the company provides competitive compensation ($320,000-$405,000), comprehensive benefits, and a flexible hybrid work arrangement requiring minimum 25% office presence.

The role involves working with cutting-edge AI technology, including LLM training and inference workloads, while contributing to Anthropic's mission of developing safe and beneficial AI systems. You'll be part of a growing team of researchers, engineers, and policy experts working on some of the most impactful challenges in AI development.

This position offers the opportunity to work on large-scale distributed systems, implement advanced observability solutions, and partner with leading cloud providers and hardware vendors. The company values diversity of perspective and encourages applications from candidates with varied backgrounds and experiences.

Join Anthropic to help shape the future of AI infrastructure while working in an environment that prioritizes impact, scientific rigor, and ethical AI development. The company's commitment to public benefit and comprehensive approach to AI safety makes this an unique opportunity for those passionate about both technical excellence and responsible AI advancement.

Last updated 2 months ago

Responsibilities For Capacity Efficiency & Performance Engineer

  • Develop self-service tools and dashboards for capacity, efficiency, and cost monitoring
  • Design ML-informed forecasting models for capacity planning
  • Institute governance workflows for cloud resource management
  • Investigate capacity requests and recommend right-sizing strategies
  • Build comprehensive cost-to-serve analytics programs
  • Lead technical partnerships with cloud providers and hardware vendors
  • Design and implement observability solutions for infrastructure efficiency
  • Collaborate with engineering teams on Kubernetes-based ML infrastructure
  • Partner with research teams on computational requirements planning

Requirements For Capacity Efficiency & Performance Engineer

Python
Kubernetes
  • 5+ years experience in capacity efficiency or performance engineering
  • 5+ years experience in a technical role
  • Intermediate knowledge of various public cloud providers
  • Experience with data modeling for public cloud
  • Experience with budgeting and capacity planning
  • Experience in scripting and building automation tools
  • Self-disciplined and thrive in fast-paced environments
  • Excellent communication skills
  • Bachelor's degree in a related field or equivalent experience
  • Attention to detail and passion for correctness

Benefits For Capacity Efficiency & Performance Engineer

Visa Sponsorship
Parental Leave
  • Competitive compensation
  • Optional equity donation matching
  • Generous vacation
  • Parental leave
  • Flexible working hours
  • Office space in San Francisco
  • Visa sponsorship available

Interested in this job?