Software Engineer, Inference

Anthropic

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.

San Francisco Bay Area, CA, USA • New York, NY, USA • Seattle, WA, USA

$280,000 - $510,000

Backend

Senior Software Engineer

Hybrid

5+ years of experience

This job posting may no longer be active. You may be interested in these related jobs instead:

Senior Engineer - Backend (Java)

PayPal

Senior Backend Engineer position at PayPal focusing on Java development and production system reliability, offering competitive benefits and hybrid work arrangement in Chennai.

Senior Full Stack Engineer

Fidelity Investments

Senior Full Stack Engineer role at Fidelity Investments focusing on building scalable web applications using Angular, NodeJS, and Java, with 5+ years of experience required.

Systems Engineer

Dell Technologies

Senior Systems Engineer position at Dell Technologies in Riyadh, focusing on pre-sales technical support and solution architecture for enterprise customers.

Software Senior Engineer

Dell Technologies

Senior Software Engineer position at Dell Technologies in Bangalore, focusing on Java enterprise applications and cloud infrastructure, requiring 5-8 years of experience.

Performance & Scale Engineer

PayPal (Venmo)

Senior Performance & Scale Engineer position at PayPal's Venmo, focusing on platform reliability, scalability, and efficiency optimization through advanced performance engineering and testing.

Description For Software Engineer, Inference

Anthropic is seeking a Software Engineer for their Inference team to build the service that generates outputs from their models in production. This role is crucial in driving efficiency, latency, and reliability. As an engineer on this team, you'll work on improving these metrics by solving complex distributed-systems problems across all layers of the stack.

The ideal candidate should have significant software engineering experience and be results-oriented with a bias towards flexibility and impact. You should be willing to pick up slack even if it goes outside your job description, enjoy pair programming, want to learn more about machine learning research, and care about the societal impacts of your work.

Strong candidates may also have experience with high performance, large-scale distributed systems, Kubernetes, Python, and machine learning. The role involves working on projects such as improving inference request routing, building performance models, implementing new model architectures, analyzing observability data, and optimizing accelerator kernels.

Anthropic offers a competitive compensation package including a salary range of $280,000 to $510,000 USD, equity, and comprehensive benefits. The company has a hybrid work policy, expecting staff to be in one of their offices at least 25% of the time.

If you're passionate about creating reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society, this role at Anthropic could be an excellent opportunity to make a significant impact in the field of AI development and safety.

Last updated 9 months ago

Responsibilities For Software Engineer, Inference

Improving how inference requests are routed to model servers to maximize compute efficiency
Building a performance model to predict the impact of future architecture and hardware improvements
Implementing inference for a new model architecture
Analyzing observability data to tune performance based on production workloads
Implementing inference on a new hardware platform
Building instrumentation to detect and eliminate Python GIL contention
Optimizing the efficiency of our accelerator kernels
Ensuring smooth and regular deployment of inference services

Requirements For Software Engineer, Inference

Python

Kubernetes

Significant software engineering experience
Results-oriented, with a bias towards flexibility and impact
Pick up slack, even if it goes outside your job description
Enjoy pair programming
Want to learn more about machine learning research
Care about the societal impacts of your work

Benefits For Software Engineer, Inference

Medical Insurance

Dental Insurance

Vision Insurance

401k

Parental Leave

Comprehensive health, dental, and vision insurance for you and all your dependents
401(k) plan with 4% matching
22 weeks of paid parental leave
Unlimited PTO
Stipends for education, home office improvements, commuting, and wellness
Fertility benefits via Carrot
Daily lunches and snacks in our office
Relocation support for those moving to the Bay Area