Taro Logo

Member of Technical Staff: Research Engineer, Post-Training

Essential AI's mission is to deepen the partnership between humans and computers, unlocking collaborative capabilities that far exceed what could be achieved today.
Machine Learning
Senior Software Engineer
In-Person
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Member of Technical Staff: Research Engineer, Post-Training

Essential AI's mission is to deepen the partnership between humans and computers, unlocking collaborative capabilities that far exceed what could be achieved today. We believe that building delightful end-user experiences requires innovating across the stack - from the UX all the way down to models that achieve the best user value per FLOP.

The Research Engineer, Post-Training will be responsible for developing and implementing techniques to optimize and fine-tune models after the initial training process, with the goal of improving performance, robustness, and efficiency. You will work cross-functionally to identify areas for post-training optimization and measure the impact on model performance. You will also be responsible for benchmarking and evaluating post-training techniques on a variety of datasets and model architectures.

Key responsibilities include:

  • Leading or being a core contributor to research bets that advance the real-world capabilities of our models
  • Collaborating closely with product teams to close the loop between research and product
  • Developing and implementing novel post-training techniques to optimize machine learning models
  • Benchmarking and evaluating post-training techniques on various datasets and model architectures
  • Implementing post-training models and algorithms, optimizing for performance and scalability

We are looking for candidates with:

  • Research experience focusing on post-training and optimizing large language models
  • Strong ML fundamentals and first principles thinking
  • Experience in developing new methods or improving existing techniques in ML
  • Proficiency in programming languages such as Python, C++, or Java
  • Familiarity with ML deployment and orchestration

We are based in-person in SF and fully onsite 5 days a week, offering relocation assistance to new employees. Join us in building a world-class multi-disciplinary team excited to solve hard real-world AI problems.

Last updated a year ago

Responsibilities For Member of Technical Staff: Research Engineer, Post-Training

  • Lead or be a core contributor to research bets that advance the real-world capabilities of our models
  • Collaborate closely with product teams to close the loop between research and product, identify capability gaps and evaluate progress
  • Develop and implement novel post-training techniques to optimize machine learning models
  • Benchmark and evaluate post-training techniques on a variety of datasets and model architectures
  • Analyze experimental results to gain insights into model behavior and identify areas for improvement
  • Implement post-training models and algorithms; Optimize model performance and scalability for deployment in production environments

Requirements For Member of Technical Staff: Research Engineer, Post-Training

Python
Java
  • Research experience with a focus on post-training and optimizing large language models using frameworks such as Megatron, DeepSpeed, MaxText, etc.
  • Strong ML fundamentals and first principles thinking
  • Experience of coming up with new methods or improving existing techniques in ML or related fields
  • Proficiency in programming languages such as Python, C++, or Java
  • Familiarity with ML deployment and orchestration
  • Strong problem solving, analytical, communication, and collaboration skills

Interested in this job?