Taro Logo

Machine Learning Researcher / Engineer (Foundational Models)

Pathway is building Live AI™ systems that think and learn in real time as humans do, focusing on deeply understanding how and why LLMs work.
Machine Learning
Senior Software Engineer
Hybrid
11 - 50 Employees
AI · Enterprise SaaS

Job Description

Pathway, a cutting-edge AI company backed by over $15M in funding, is seeking a Machine Learning Researcher/Engineer specializing in Foundational Models. This R&D position focuses on attention-based models with a guaranteed 7-digit GPU budget. The role combines deep research in Language Models and RL with practical implementation, offering a unique opportunity to work on "Live AI" systems that think and learn in real-time.

The company is led by distinguished AI experts, including CTO Jan Chorowski, who has collaborated with Geoff Hinton and Yoshua Bengio, and CSO Adrian Kosowski, a prodigy who earned his PhD in Theoretical Computer Science at age 20. The team comprises world-class scientists and competitive programmers, alongside experienced Silicon Valley executives.

This position offers an intellectually stimulating environment where you'll pioneer new approaches to AI, particularly around long sequences and changing data. You'll be responsible for distributed model training, architecture improvements, and experimental design. The role requires either significant research publications, contributions to noteworthy LLM projects, experience at leading ML research centers, or exceptional competitive programming achievements.

The position offers competitive compensation including a six-digit salary and equity, with flexible work locations in Palo Alto, Paris, or Wroclaw. This is an opportunity to join an early-stage AI startup focused on making foundational changes in how AI systems think and learn.

Last updated 2 months ago

Responsibilities For Machine Learning Researcher / Engineer (Foundational Models)

  • Perform distributed model training
  • Improve/adapt model architectures based on experiment results
  • Design new tasks and experiments
  • Oversee activities of team members involved in data preparation

Requirements For Machine Learning Researcher / Engineer (Foundational Models)

Python
  • Published at least one paper at NeurIPS, ICLR, or ICML as lead author or significant contributor
  • Contributed to a newsworthy LLM training effort
  • 6 months experience in leading Machine Learning research center
  • ICPC World Finalist, or IOI, IMO, or IPhO medalist
  • Deep learning research background
  • Experience with PyTorch, Jax, or Tensorflow
  • Understanding of GPU architecture, memory design, and communication
  • Understanding of graph algorithms
  • Familiarity with model monitoring, git, build systems, and CI/CD
  • Fluent in English

Benefits For Machine Learning Researcher / Engineer (Foundational Models)

Equity
  • Six-digit annual salary
  • Employee Stock Option Plan
  • Remote work options
  • Flexible work location between Palo Alto, Paris, or Wroclaw offices

Related Jobs

Machine Learning Researcher / Engineer (Foundational Models)

Senior Machine Learning Researcher/Engineer position at Pathway, focusing on foundational models and requiring expertise in both ML research and engineering implementation.

Machine Learning Researcher / Engineer (Foundational Models)

Senior ML Research Engineer position at Pathway, focusing on foundational model development with significant GPU resources and competitive compensation.

AIML - Sr. Software Engineer - AIML Observability

Senior Software Engineer role at Apple focusing on AI/ML observability, building cloud-native solutions for monitoring and visualization of AI infrastructure at scale.

Cellular Machine Learning Engineer - Embedded Software

Senior Machine Learning Engineer role at Apple focusing on developing AI/ML solutions for cellular technologies in iPhone, iPad, and Watch products.

AIML - Sr. Software Engineer - AIML Observability

Senior Software Engineer role focused on building AI-powered observability solutions at Apple, working on cloud-native systems and AIML infrastructure.