Taro Logo

Software Engineer, Model Inference

AI research and deployment company dedicated to ensuring general-purpose artificial intelligence benefits humanity
$325,000 - $490,000
Backend
Senior Software Engineer
In-Person
1,000 - 5,000 Employees
5+ years of experience
AI

Description For Software Engineer, Model Inference

OpenAI is seeking a Senior Software Engineer for their Model Inference team, focusing on optimizing the world's largest AI models for production use. This role sits at the intersection of cutting-edge AI research and practical implementation, working within the Scaling department to make OpenAI's most capable models accessible through their products.

The position offers a competitive compensation package of $325K-$490K plus equity and comprehensive benefits, including medical coverage and parental leave. Based in San Francisco, you'll be part of a team that brings state-of-the-art AI models to consumers, enterprises, and developers.

The ideal candidate brings 5+ years of software engineering experience, with expertise in distributed systems and ML architecture optimization. You'll work on high-stakes challenges, optimizing performance, latency, and efficiency of model inference systems. Key responsibilities include collaborating with researchers and product managers, implementing new architectural improvements, and ensuring optimal hardware utilization.

This role is perfect for someone who combines technical depth with practical problem-solving abilities. You'll need familiarity with PyTorch, GPU optimization, and HPC technologies. The position offers unique opportunities to impact AI development at scale, working with cutting-edge technology while ensuring it benefits humanity. OpenAI's commitment to safety and human-centric AI development makes this an opportunity to contribute to responsible AI advancement.

Last updated 3 hours ago

Responsibilities For Software Engineer, Model Inference

  • Work alongside machine learning researchers, engineers, and product managers to bring latest technologies into production
  • Enable advanced research through engineering
  • Introduce new techniques, tools, and architecture to improve performance, latency, throughput, and efficiency of model inference stack
  • Build tools for visibility into bottlenecks and sources of instability
  • Optimize code and fleet of Azure VMs to utilize GPU hardware efficiently

Requirements For Software Engineer, Model Inference

Python
Linux
  • Understanding of modern ML architectures and optimization for inference
  • Ability to own problems end-to-end
  • At least 5 years of professional software engineering experience
  • Familiarity with PyTorch, NVidia GPUs and software stacks (NCCL, CUDA), HPC technologies
  • Experience with architecting, building, observing, and debugging production distributed systems
  • Experience with rebuilding/refactoring production systems for scale
  • Self-directed with ability to prioritize important problems
  • Humble attitude and eagerness to help colleagues

Benefits For Software Engineer, Model Inference

Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Assistance
401k
Parental Leave
Education Budget
  • Medical, dental, and vision insurance for you and your family
  • Mental health and wellness support
  • 401(k) plan with 50% matching
  • Generous time off and company holidays
  • 24 weeks paid birth-parent leave & 20-week paid parental leave
  • Annual learning & development stipend ($1,500 per year)
  • Equity compensation

Interested in this job?

Jobs Related To OpenAI Software Engineer, Model Inference

Forward Deployed Engineer - Tokyo

Senior Forward Deployed Engineer position at OpenAI Tokyo, focusing on implementing AI solutions for strategic customers, requiring bilingual Japanese-English fluency and 4+ years of experience.

Full-Stack Engineer, Public Sector

Senior Full-Stack Engineer role at OpenAI focusing on public sector implementations, offering $255K-$405K plus equity, requiring 5+ YOE and active US security clearance.

Software Engineer, Backend (Knowledge Innovation)

Senior Backend Software Engineer role at OpenAI, building scalable knowledge systems with Python and PostgreSQL, 4+ years experience required, $325K + equity.

Software Engineer, Online Storage

Senior Software Engineer role at OpenAI focusing on building scalable database systems for ChatGPT and other AI products, offering $255K-$405K plus equity and benefits.

Sr Software Engineer

Senior Software Engineer position at Uber focusing on Spark and distributed computing infrastructure, requiring 5+ years of experience in building large-scale systems.