Taro Logo

Research Engineer - Reinforcement Learning Fundamentals

Anthropic creates reliable, interpretable, and steerable AI systems for safe and beneficial use.
$250,000 - $340,000
Machine Learning
Senior Software Engineer
Hybrid
5+ years of experience
AI
This job posting may no longer be active. You may be interested in these related jobs instead:
Machine Learning Systems Engineer, RL Engineering

Senior ML Systems Engineer role at Anthropic focused on building and improving reinforcement learning systems for AI model training

Research Engineer, Knowledge Team

Senior Research Engineer position at Anthropic focused on redesigning how AI systems interact with external data sources through innovative information architectures and LLM training.

Research Engineer, Frontier Red Team (RSP Evaluations)

Senior Research Engineer position at Anthropic focusing on AI safety evaluations and implementing responsible scaling policies for frontier AI models.

Software Engineer

Senior Software Engineering role at Anthropic focusing on building and optimizing large-scale ML systems, with emphasis on AI safety and interpretability.

Applied AI ML - Senior Associate - Machine Learning Engineer

Senior Machine Learning Engineer role at JPMorgan focusing on applied AI/ML solutions for financial services, requiring PhD and expertise in NLP or Computer Vision.

Description For Research Engineer - Reinforcement Learning Fundamentals

Anthropic is seeking a Research Engineer for their Reinforcement Learning Fundamentals team. This role involves collaborating with researchers and engineers to advance large language models through reinforcement learning research. Key responsibilities include developing novel RL techniques, creating tools for complex tasks, and enhancing reasoning capabilities in areas like code generation and mathematics. The ideal candidate has 5+ years of industry experience, proficiency in Python and deep learning frameworks, strong software engineering skills, and a passion for AI safety. The role offers competitive compensation, including a salary range of £250,000 - £340,000 GBP, equity, and comprehensive benefits. Anthropic values diversity and encourages applications from all backgrounds. The company operates on a hybrid work model, with at least 25% office presence required, and offers visa sponsorship. Anthropic is committed to big science AI research, working as a cohesive team on large-scale efforts to create steerable, trustworthy AI systems.

Last updated 7 months ago

Responsibilities For Research Engineer - Reinforcement Learning Fundamentals

  • Develop and implement novel reinforcement learning techniques
  • Create tools and environments for models to perform complex tasks
  • Design and run experiments to enhance models' reasoning capabilities
  • Collaborate with researchers and engineers

Requirements For Research Engineer - Reinforcement Learning Fundamentals

Python
Kubernetes
  • 5+ years of industry-related experience
  • Proficiency in Python
  • Experience with deep learning frameworks (PyTorch or Jax)
  • Strong software engineering background
  • Passion for AI safety and beneficial systems

Benefits For Research Engineer - Reinforcement Learning Fundamentals

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
Education Budget
Commuter Benefits
Equity
  • Health insurance
  • Dental insurance
  • Vision insurance
  • 401(k) with 4% matching
  • 22 weeks paid parental leave
  • Unlimited PTO
  • Education stipend
  • Home office improvement stipend
  • Commuting stipend
  • Wellness stipend
  • Fertility benefits
  • Daily lunches and snacks
  • Relocation support
  • Equity donation matching

Interested in this job?