Taro Logo

Senior Research Engineer, Model Evaluation

AI company training and deploying frontier models for developers and enterprises to power AI systems for content generation, semantic search, RAG, and agents.
Machine Learning
Senior Software Engineer
Hybrid
501 - 1,000 Employees
5+ years of experience
AI
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Senior Research Engineer, Model Evaluation

Cohere is an AI company with a mission to scale intelligence to serve humanity. They're seeking a Senior Research Engineer specializing in Model Evaluation to join their team. This role is crucial for advancing the field of AI evaluation as models become increasingly sophisticated. The position involves developing cutting-edge evaluation methods and infrastructure to measure LLM performance accurately. The ideal candidate will work on creating evaluation benchmarks, conducting research on LLM evaluation methods, and building scalable analysis tools. They'll collaborate with top researchers and engineers in the field. The company offers a strong benefits package including health coverage, parental leave, and flexible work arrangements. Cohere values diversity and maintains offices in major tech hubs while supporting hybrid work. This role represents an opportunity to shape the future of AI evaluation and work with frontier models in a fast-paced, customer-focused environment. The company emphasizes both technical excellence and practical impact, making it an ideal place for those passionate about advancing AI capabilities while ensuring their accurate measurement and evaluation.

Last updated 8 days ago

Responsibilities For Senior Research Engineer, Model Evaluation

  • Develop evaluation benchmarks, datasets, and environments for measuring model capabilities
  • Conduct research to push the state-of-the-art in LLM evaluation methods
  • Build scalable tools for investigating and understanding evaluation results
  • Work with researchers and engineers in the field

Requirements For Senior Research Engineer, Model Evaluation

  • Experience in building high-quality evaluation resources (datasets, simulators, environments)
  • Track record of developing new methods/data to evaluate LLMs
  • Deep experience building with and around LLMs
  • Strong software engineering skills

Benefits For Senior Research Engineer, Model Evaluation

Dental Insurance
Medical Insurance
Mental Health Assistance
Parental Leave
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits
  • Mental health budget
  • 100% Parental Leave top-up for 6 months
  • Personal enrichment benefits
  • Remote-flexible work
  • Co-working stipend
  • 6 weeks of vacation

Interested in this job?