Taro Logo

Software Engineer, Inference

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.
$280,000 - $510,000
Backend
Senior Software Engineer
Hybrid
5+ years of experience
This job posting may no longer be active. You may be interested in these related jobs instead:
Senior Engineer - Backend (Java)

Senior Backend Engineer position at PayPal focusing on Java development and production system reliability, offering competitive benefits and hybrid work arrangement in Chennai.

Senior Full Stack Engineer

Senior Full Stack Engineer role at Fidelity Investments focusing on building scalable web applications using Angular, NodeJS, and Java, with 5+ years of experience required.

Systems Engineer

Senior Systems Engineer position at Dell Technologies in Riyadh, focusing on pre-sales technical support and solution architecture for enterprise customers.

Software Senior Engineer

Senior Software Engineer position at Dell Technologies in Bangalore, focusing on Java enterprise applications and cloud infrastructure, requiring 5-8 years of experience.

Performance & Scale Engineer

Senior Performance & Scale Engineer position at PayPal's Venmo, focusing on platform reliability, scalability, and efficiency optimization through advanced performance engineering and testing.

Description For Software Engineer, Inference

Anthropic is seeking a Software Engineer for their Inference team to build the service that generates outputs from their models in production. This role is crucial in driving efficiency, latency, and reliability. As an engineer on this team, you'll work on improving these metrics by solving complex distributed-systems problems across all layers of the stack.

The ideal candidate should have significant software engineering experience and be results-oriented with a bias towards flexibility and impact. You should be willing to pick up slack even if it goes outside your job description, enjoy pair programming, want to learn more about machine learning research, and care about the societal impacts of your work.

Strong candidates may also have experience with high performance, large-scale distributed systems, Kubernetes, Python, and machine learning. The role involves working on projects such as improving inference request routing, building performance models, implementing new model architectures, analyzing observability data, and optimizing accelerator kernels.

Anthropic offers a competitive compensation package including a salary range of $280,000 to $510,000 USD, equity, and comprehensive benefits. The company has a hybrid work policy, expecting staff to be in one of their offices at least 25% of the time.

If you're passionate about creating reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society, this role at Anthropic could be an excellent opportunity to make a significant impact in the field of AI development and safety.

Last updated 9 months ago

Responsibilities For Software Engineer, Inference

  • Improving how inference requests are routed to model servers to maximize compute efficiency
  • Building a performance model to predict the impact of future architecture and hardware improvements
  • Implementing inference for a new model architecture
  • Analyzing observability data to tune performance based on production workloads
  • Implementing inference on a new hardware platform
  • Building instrumentation to detect and eliminate Python GIL contention
  • Optimizing the efficiency of our accelerator kernels
  • Ensuring smooth and regular deployment of inference services

Requirements For Software Engineer, Inference

Python
Kubernetes
  • Significant software engineering experience
  • Results-oriented, with a bias towards flexibility and impact
  • Pick up slack, even if it goes outside your job description
  • Enjoy pair programming
  • Want to learn more about machine learning research
  • Care about the societal impacts of your work

Benefits For Software Engineer, Inference

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
  • Comprehensive health, dental, and vision insurance for you and all your dependents
  • 401(k) plan with 4% matching
  • 22 weeks of paid parental leave
  • Unlimited PTO
  • Stipends for education, home office improvements, commuting, and wellness
  • Fertility benefits via Carrot
  • Daily lunches and snacks in our office
  • Relocation support for those moving to the Bay Area

Interested in this job?