Software Engineer, Model Inference

OpenAI

AI research and deployment company dedicated to ensuring general-purpose artificial intelligence benefits humanity

San Francisco, CA, USA

$325,000 - $490,000

Backend

Senior Software Engineer

In-Person

1,000 - 5,000 Employees

5+ years of experience

Description For Software Engineer, Model Inference

OpenAI is seeking a Senior Software Engineer for their Model Inference team, focusing on optimizing the world's largest AI models for production use. This role sits at the intersection of cutting-edge AI research and practical implementation, working within the Scaling department to make OpenAI's most capable models accessible through their products.

The position offers a competitive compensation package of $325K-$490K plus equity and comprehensive benefits, including medical coverage and parental leave. Based in San Francisco, you'll be part of a team that brings state-of-the-art AI models to consumers, enterprises, and developers.

The ideal candidate brings 5+ years of software engineering experience, with expertise in distributed systems and ML architecture optimization. You'll work on high-stakes challenges, optimizing performance, latency, and efficiency of model inference systems. Key responsibilities include collaborating with researchers and product managers, implementing new architectural improvements, and ensuring optimal hardware utilization.

This role is perfect for someone who combines technical depth with practical problem-solving abilities. You'll need familiarity with PyTorch, GPU optimization, and HPC technologies. The position offers unique opportunities to impact AI development at scale, working with cutting-edge technology while ensuring it benefits humanity. OpenAI's commitment to safety and human-centric AI development makes this an opportunity to contribute to responsible AI advancement.

Last updated 3 hours ago

Responsibilities For Software Engineer, Model Inference

Work alongside machine learning researchers, engineers, and product managers to bring latest technologies into production
Enable advanced research through engineering
Introduce new techniques, tools, and architecture to improve performance, latency, throughput, and efficiency of model inference stack
Build tools for visibility into bottlenecks and sources of instability
Optimize code and fleet of Azure VMs to utilize GPU hardware efficiently

Requirements For Software Engineer, Model Inference

Python

Linux

Understanding of modern ML architectures and optimization for inference
Ability to own problems end-to-end
At least 5 years of professional software engineering experience
Familiarity with PyTorch, NVidia GPUs and software stacks (NCCL, CUDA), HPC technologies
Experience with architecting, building, observing, and debugging production distributed systems
Experience with rebuilding/refactoring production systems for scale
Self-directed with ability to prioritize important problems
Humble attitude and eagerness to help colleagues

Benefits For Software Engineer, Model Inference

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Assistance

401k

Parental Leave

Education Budget

Medical, dental, and vision insurance for you and your family
Mental health and wellness support
401(k) plan with 50% matching
Generous time off and company holidays
24 weeks paid birth-parent leave & 20-week paid parental leave
Annual learning & development stipend ($1,500 per year)
Equity compensation

OpenAI

AI research and deployment company dedicated to ensuring general-purpose artificial intelligence benefits humanity

San Francisco, CA, USA

$325,000 - $490,000

Backend

Senior Software Engineer

In-Person

1,000 - 5,000 Employees

5+ years of experience

Interested in this job?

Jobs Related To OpenAI Software Engineer, Model Inference

Forward Deployed Engineer - Tokyo

OpenAI

Senior Forward Deployed Engineer position at OpenAI Tokyo, focusing on implementing AI solutions for strategic customers, requiring bilingual Japanese-English fluency and 4+ years of experience.

Full-Stack Engineer, Public Sector

OpenAI

Senior Full-Stack Engineer role at OpenAI focusing on public sector implementations, offering $255K-$405K plus equity, requiring 5+ YOE and active US security clearance.

Software Engineer, Backend (Knowledge Innovation)

OpenAI

Senior Backend Software Engineer role at OpenAI, building scalable knowledge systems with Python and PostgreSQL, 4+ years experience required, $325K + equity.

Software Engineer, Online Storage

OpenAI

Senior Software Engineer role at OpenAI focusing on building scalable database systems for ChatGPT and other AI products, offering $255K-$405K plus equity and benefits.

Sr Software Engineer

Uber

Senior Software Engineer position at Uber focusing on Spark and distributed computing infrastructure, requiring 5+ years of experience in building large-scale systems.