Senior Software Engineer, GenAI & ML Evaluation Frameworks - Grafana Ops, AI/ML

Grafana Labs

Developer of open source visualization and observability tools used by over 20M users globally, including major companies like Bloomberg, JPMorgan Chase, and eBay.

United States

$148,505 - $178,206

Machine Learning

Senior Software Engineer

Remote

1,000 - 5,000 Employees

5+ years of experience

AI · Enterprise SaaS

Description For Senior Software Engineer, GenAI & ML Evaluation Frameworks - Grafana Ops, AI/ML

Grafana Labs is seeking a Senior Software Engineer specializing in GenAI & ML Evaluation Frameworks to join their AI teams. This role is crucial in helping users understand and improve their systems through AI-driven features. The position focuses on building and evolving internal evaluation frameworks for Generative AI systems, particularly Large Language Models (LLMs).

The role involves designing and scaling automated evaluation pipelines, integrating them into CI/CD workflows, and defining metrics that align with both product goals and model behavior. You'll be working on implementing robust evaluation frameworks, developing tooling for automated assessment of model outputs, and leading dataset management processes.

Grafana Labs is a leader in observability tools, with their open-source visualization tool used by over 20M users globally. Their tools help users monitor everything from beehives to climate change, and their technology stack is used by major companies including Bloomberg, JPMorgan Chase, and eBay.

The ideal candidate should have strong experience in evaluating AI/ML systems, familiarity with prompt engineering, and the ability to work autonomously. You'll be joining a company that values pragmatic approaches, reproducibility, and thoughtful trade-offs when scaling GenAI systems.

This is a remote position based in the United States, offering competitive compensation between $148,505 - $178,206, along with equity and comprehensive benefits. The role provides an opportunity to shape the future of AI-driven observability tools while working with a team passionate about reducing human toil and building supportive AI systems.

Last updated 2 months ago

Responsibilities For Senior Software Engineer, GenAI & ML Evaluation Frameworks - Grafana Ops, AI/ML

Design and implement robust evaluation frameworks for GenAI and LLM-based systems
Develop tooling for automated evaluation of model outputs, prompts, and agent behaviors
Define and refine metrics for structure and semantics
Lead dataset management processes and guide teams in GenAI evaluation best practices

Requirements For Senior Software Engineer, GenAI & ML Evaluation Frameworks - Grafana Ops, AI/ML

Experience designing and implementing evaluation frameworks for AI/ML systems
Familiarity with prompt engineering, structured output evaluation, and context-window management in LLM systems
High autonomy to collaborate and translate team goals into clear, testable criteria
Experience working in environments with rapid iteration and experimental development
Pragmatic mindset valuing reproducibility and developer experience

Benefits For Senior Software Engineer, GenAI & ML Evaluation Frameworks - Grafana Ops, AI/ML

Equity

Equity
Bonus (if applicable)

Grafana Labs

Developer of open source visualization and observability tools used by over 20M users globally, including major companies like Bloomberg, JPMorgan Chase, and eBay.

United States

$148,505 - $178,206

Machine Learning

Senior Software Engineer

Remote

1,000 - 5,000 Employees

5+ years of experience

AI · Enterprise SaaS

Senior Software Engineer, GenAI & ML Evaluation Frameworks - Grafana Ops, AI/ML

Grafana Labs

Description For Senior Software Engineer, GenAI & ML Evaluation Frameworks - Grafana Ops, AI/ML

Responsibilities For Senior Software Engineer, GenAI & ML Evaluation Frameworks - Grafana Ops, AI/ML

Requirements For Senior Software Engineer, GenAI & ML Evaluation Frameworks - Grafana Ops, AI/ML

Benefits For Senior Software Engineer, GenAI & ML Evaluation Frameworks - Grafana Ops, AI/ML

Grafana Labs

Jobs Related To Grafana Labs Senior Software Engineer, GenAI & ML Evaluation Frameworks - Grafana Ops, AI/ML