Taro Logo

Software Engineer, Model Inference

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.
$200,000 - $370,000
Backend
Senior Software Engineer
In-Person
501 - 1,000 Employees
3+ years of experience
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Software Engineer, Model Inference

OpenAI is seeking a Software Engineer for Model Inference to join their Applied AI team. This role focuses on scaling up critical inference infrastructure to efficiently service customer requests for state-of-the-art AI models like GPT-4 and DALL-E.

Key responsibilities include:

  • Collaborating with ML researchers, engineers, and product managers to productionize latest technologies
  • Improving performance, latency, throughput, and efficiency of deployed models
  • Building tools for visibility into bottlenecks and addressing high-priority issues
  • Optimizing code and Azure VMs to maximize hardware utilization

The ideal candidate should have:

  • Understanding of modern ML architectures and optimization for inference
  • At least 3 years of professional software engineering experience
  • Expertise in HPC technologies (InfiniBand, MPI, CUDA)
  • Experience with production distributed systems
  • Ability to work on end-to-end problems and learn new skills as needed

OpenAI offers a competitive compensation package, including equity and comprehensive benefits such as medical insurance, 401(k) matching, unlimited time off, and parental leave. The company values diversity and is committed to creating an inclusive environment for all employees.

Join OpenAI in shaping the future of AI technology and ensuring its benefits are widely shared.

Last updated a year ago

Responsibilities For Software Engineer, Model Inference

  • Work with ML researchers, engineers, and product managers to bring latest technologies into production
  • Improve performance, latency, throughput, and efficiency of deployed models
  • Build tools for visibility into bottlenecks and address high-priority issues
  • Optimize code and Azure VMs to maximize hardware utilization

Requirements For Software Engineer, Model Inference

Python
  • At least 3 years of professional software engineering experience
  • Expert in core HPC technologies: InfiniBand, MPI, CUDA
  • Experience architecting, observing, and debugging production distributed systems
  • Understanding of modern ML architectures and optimization for inference
  • Ability to own problems end-to-end and learn new skills as needed

Benefits For Software Engineer, Model Inference

Medical Insurance
Dental Insurance
Vision Insurance
401k
Education Budget
Parental Leave
Mental Health Assistance
  • Medical, dental, and vision insurance for you and your family
  • Mental health and wellness support
  • 401(k) plan with 50% matching
  • Unlimited time off and 13 company holidays per year
  • Paid parental leave (20 weeks) and family-planning support
  • Annual learning & development stipend ($1,500 per year)

Interested in this job?