Software Engineer, Model Inference

OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.

San Francisco Bay Area, CA, USA

$200,000 - $370,000

Backend

Senior Software Engineer

In-Person

501 - 1,000 Employees

3+ years of experience

This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Software Engineer, Model Inference

OpenAI is seeking a Software Engineer for Model Inference to join their Applied AI team. This role focuses on scaling up critical inference infrastructure to efficiently service customer requests for state-of-the-art AI models like GPT-4 and DALL-E.

Key responsibilities include:

Collaborating with ML researchers, engineers, and product managers to productionize latest technologies
Improving performance, latency, throughput, and efficiency of deployed models
Building tools for visibility into bottlenecks and addressing high-priority issues
Optimizing code and Azure VMs to maximize hardware utilization

The ideal candidate should have:

Understanding of modern ML architectures and optimization for inference
At least 3 years of professional software engineering experience
Expertise in HPC technologies (InfiniBand, MPI, CUDA)
Experience with production distributed systems
Ability to work on end-to-end problems and learn new skills as needed

OpenAI offers a competitive compensation package, including equity and comprehensive benefits such as medical insurance, 401(k) matching, unlimited time off, and parental leave. The company values diversity and is committed to creating an inclusive environment for all employees.

Join OpenAI in shaping the future of AI technology and ensuring its benefits are widely shared.

Last updated a year ago

Responsibilities For Software Engineer, Model Inference

Work with ML researchers, engineers, and product managers to bring latest technologies into production
Improve performance, latency, throughput, and efficiency of deployed models
Build tools for visibility into bottlenecks and address high-priority issues
Optimize code and Azure VMs to maximize hardware utilization

Requirements For Software Engineer, Model Inference

Python

At least 3 years of professional software engineering experience
Expert in core HPC technologies: InfiniBand, MPI, CUDA
Experience architecting, observing, and debugging production distributed systems
Understanding of modern ML architectures and optimization for inference
Ability to own problems end-to-end and learn new skills as needed

Benefits For Software Engineer, Model Inference

Medical Insurance

Dental Insurance

Vision Insurance

401k

Education Budget

Parental Leave

Mental Health Assistance

Medical, dental, and vision insurance for you and your family
Mental health and wellness support
401(k) plan with 50% matching
Unlimited time off and 13 company holidays per year
Paid parental leave (20 weeks) and family-planning support
Annual learning & development stipend ($1,500 per year)