OpenAI is seeking a Research Engineer/Scientist to join their ChatGPT RLHF team, a specialized subteam within the Post-Training organization. This role combines cutting-edge research with engineering, focusing on aligning ChatGPT models with user needs through Reinforcement Learning with Human Feedback (RLHF). The position offers a competitive salary range of $295K-$530K plus equity and comprehensive benefits.
The ideal candidate should have 2+ years of experience in reinforcement learning or large-scale ML systems, along with a Ph.D. or equivalent research experience. You'll work at the intersection of research and product development, contributing to the advancement of AI alignment and model optimization. The role involves developing advanced reward models, building robust evaluations, and collaborating with cross-functional teams to deploy models in production.
Based in San Francisco with a hybrid work model (3 days in office), this position offers the opportunity to directly impact millions of users globally. The team's mission is to make ChatGPT more helpful and personalized through large-scale feedback learning. Benefits include comprehensive health insurance, mental health support, generous parental leave (24 weeks for birth parents), and an annual learning stipend.
OpenAI provides an inclusive work environment, committed to equal opportunity and reasonable accommodations. The company's mission focuses on ensuring AI benefits all of humanity, making this role ideal for those passionate about developing safe, user-focused AI systems with real-world impact.