Senior Research Engineer, Model Evaluation

Cohere

AI company training and deploying frontier models for developers and enterprises to power AI systems for content generation, semantic search, RAG, and agents.

Toronto, ON, Canada • New York, NY, USA • Seattle, WA, USA…

Machine Learning

Senior Software Engineer

Hybrid

501 - 1,000 Employees

5+ years of experience

This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Senior Research Engineer, Model Evaluation

Cohere is an AI company with a mission to scale intelligence to serve humanity. They're seeking a Senior Research Engineer specializing in Model Evaluation to join their team. This role is crucial for advancing the field of AI evaluation as models become increasingly sophisticated. The position involves developing cutting-edge evaluation methods and infrastructure to measure LLM performance accurately. The ideal candidate will work on creating evaluation benchmarks, conducting research on LLM evaluation methods, and building scalable analysis tools. They'll collaborate with top researchers and engineers in the field. The company offers a strong benefits package including health coverage, parental leave, and flexible work arrangements. Cohere values diversity and maintains offices in major tech hubs while supporting hybrid work. This role represents an opportunity to shape the future of AI evaluation and work with frontier models in a fast-paced, customer-focused environment. The company emphasizes both technical excellence and practical impact, making it an ideal place for those passionate about advancing AI capabilities while ensuring their accurate measurement and evaluation.

Last updated 8 days ago

Responsibilities For Senior Research Engineer, Model Evaluation

Develop evaluation benchmarks, datasets, and environments for measuring model capabilities
Conduct research to push the state-of-the-art in LLM evaluation methods
Build scalable tools for investigating and understanding evaluation results
Work with researchers and engineers in the field

Requirements For Senior Research Engineer, Model Evaluation

Experience in building high-quality evaluation resources (datasets, simulators, environments)
Track record of developing new methods/data to evaluate LLMs
Deep experience building with and around LLMs
Strong software engineering skills

Benefits For Senior Research Engineer, Model Evaluation

Dental Insurance

Medical Insurance

Mental Health Assistance

Parental Leave

Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits
Mental health budget
100% Parental Leave top-up for 6 months
Personal enrichment benefits
Remote-flexible work
Co-working stipend
6 weeks of vacation