Senior Machine Learning Engineer, RAG

Klue is a VC-backed, capital-efficient growing SaaS company creating the category of competitive enablement, helping companies understand their market and outmaneuver their competition.
$144,000 - $160,000
Machine Learning
Senior Software Engineer
Hybrid
2+ years of experience
AI · Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:
Senior Backend Engineer, AI / ML

Senior Backend Engineer position at Klue focusing on AI/ML, building LLM-powered agents and ML platforms in Toronto, offering competitive compensation and hybrid work environment.

Senior Software Engineer, AWS Neuron Inference

Senior Software Engineer position at AWS Neuron team working on machine learning inference optimization and implementation.

Senior Software Development Engineer, ML Ops, AWS Infrastructure Science Engineering

Senior SDE role at AWS focusing on MLOps and infrastructure optimization, leading the development of ML platforms for data center management with competitive compensation and benefits.

Sr. SDE - Scanless Tech, Scanless Tech

Senior Software Development Engineer role at Amazon Robotics, focusing on computer vision and machine learning applications for automated fulfillment systems.

Senior Software Development Engineer, Sponsored Products

Senior Software Development Engineer position at Amazon Ads, focusing on ML and large-scale systems for Sponsored Products, offering competitive compensation and growth opportunities.

Description For Senior Machine Learning Engineer, RAG

Klue is seeking a Senior Machine Learning Engineer to join their ML team in Toronto, focusing on building and optimizing state-of-the-art RAG (Retrieval Augmented Generation) systems. This role offers an exciting opportunity to reinvent RAG systems, ideal for someone with strong ML and IR fundamentals who wants to dive deep into practical LLM applications.

Key Responsibilities:

  • Optimize RAG systems with scientific rigor and reproducible results
  • Measure and improve retrieval systems using comprehensive evaluation metrics
  • Develop optimal chunking and enrichment strategies for diverse data sources
  • Work on prompt engineering to effectively utilize retrieved context
  • Train and fine-tune smaller, more efficient models
  • Deploy and monitor models in production, optimize latency, and implement metrics
  • Apply latest research breakthroughs to improve systems
  • Collaborate with backend engineers to build scalable, production-ready systems

Required Experience:

  • Masters or PhD in Machine Learning, NLP, or related field
  • 2+ years building and optimizing retrieval systems
  • 2+ years training/fine-tuning transformer models
  • Strong foundation in evaluating RAG systems
  • Deep understanding of retrieval metrics and their trade-offs
  • Expertise in automated LLM evaluation and prompt engineering
  • Experience deploying models to production and monitoring system health

Technologies Used:

  • LLM platforms: OpenAI, Anthropic, open-source models
  • ML frameworks: PyTorch, Transformers, spaCy
  • Search/Vector DBs: Elasticsearch, Pinecone, PostgreSQL
  • MLOps tools: Weights & Biases, MLflow, Langfuse
  • Infrastructure: Docker, Kubernetes, GCP
  • Development: Python, Git, CI/CD

Klue offers a hybrid work environment with main Canadian hubs in Vancouver and Toronto. The role is ideally located in Toronto, with at least 2 days per week in the office. Join a company recognized as one of Canada's Most Admired Corporate Cultures, a Deloitte Technology Fast 50 & Fast 500 winner, and recipient of Startup of the Year and Tech Culture of the Year awards.

Last updated 6 months ago

Responsibilities For Senior Machine Learning Engineer, RAG

  • Optimize RAG systems with scientific rigor and reproducible results
  • Measure and improve retrieval systems across the spectrum from BM25 to semantic search
  • Develop optimal chunking and enrichment strategies for diverse data sources
  • Work on prompt engineering to effectively utilize the retrieved context
  • Train and fine-tune smaller, more efficient models
  • Deploy and monitor models in production, optimize latency, and implement comprehensive metrics
  • Apply deep understanding of latest breakthroughs to improve systems
  • Collaborate with backend engineers to build scalable, production-ready systems

Requirements For Senior Machine Learning Engineer, RAG

Python
PostgreSQL
  • Masters or PhD in Machine Learning, NLP, or related field
  • 2+ years building and optimizing retrieval systems
  • 2+ years training/fine-tuning transformer models
  • Strong foundation in evaluating RAG systems
  • Deep understanding of retrieval metrics and their trade-offs
  • Strong grasp of embedding models, semantic similarity techniques, and clustering similar content
  • Knowledge of query augmentation and content enrichment strategies
  • Expertise in automated LLM evaluation, including LLM-as-judge methodologies
  • Skilled at prompt engineering - including zero-shot, few-shot, and chain-of-thought
  • Experience deploying models to production and monitoring the health of the system and the predictions
  • Knowledge of ML infrastructure, model serving, and observability best practices
  • Proven ability to balance scientific rigor with driving business impact
  • Track record of staying current with ML research and breakthrough papers

Benefits For Senior Machine Learning Engineer, RAG

Equity
  • Equity

Interested in this job?