Taro Logo

Machine Learning Engineer (Vision)

Building Maia, a cutting-edge multimodal agent system combining advanced research in neural network architectures, long-term memory, and reinforcement learning.
Machine Learning
Senior Software Engineer
In-Person
AI
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Machine Learning Engineer (Vision)

Zyphra, a Palo Alto-based AI company, is seeking a Machine Learning Engineer specializing in vision to join their team building Maia, a cutting-edge multimodal agent system. The role focuses on developing next-generation vision-language models for understanding natural scenes, particularly in web, desktop, and mobile UIs. The ideal candidate will contribute to large-scale vision encoder training, performance optimization, and dataset management.

The company brings together talent from leading AI organizations including Google DeepMind, Anthropic, StabilityAI, and others. They value both deep research and engineering excellence, embracing new ideas and maintaining a fast-paced, impact-driven culture. The team strongly emphasizes making grounded, methodical steps toward ambitious goals.

The position requires strong research intuition, implementation skills, and the ability to work effectively in a collaborative environment. Experience with vision language models, large-scale datasets, and deep learning frameworks like PyTorch is highly valued. The role offers comprehensive benefits including competitive compensation, healthcare, and equity, plus unique perks like on-site meals and regular team events.

This is an in-person role at their Palo Alto headquarters, ideal for candidates passionate about AI who can contribute to both research and engineering implementation at scale. The company offers visa sponsorship for exceptional candidates and maintains a culture where both crazy ideas and methodical execution are celebrated.

Last updated 17 days ago

Responsibilities For Machine Learning Engineer (Vision)

  • Core contributor on Vision Team building next generation vision-language models
  • Large-scale vision encoder and vision language training runs
  • Performance optimization of training stack
  • Image and video dataset collection, processing, and evaluation
  • Architecture and training methodology improvements

Requirements For Machine Learning Engineer (Vision)

Python
  • Strong research taste and intuition with ability to work through projects from conception to execution
  • Strong implementation and prototyping ability
  • Ability to work well in a high-paced research setting
  • Willing to be in-person in Palo Alto office
  • US work authorization required
  • Highly proficient with Pytorch and Python
  • Excellent communication and collaboration skills

Benefits For Machine Learning Engineer (Vision)

Medical Insurance
Dental Insurance
Vision Insurance
401k
Relocation Benefits
Visa Sponsorship
  • Medical, dental, vision and FSA plans
  • Competitive salary and equity
  • 401(k)
  • Relocation and immigration support (case-by-case)
  • On-site meals prepared by dedicated culinary team
  • Thursday Happy Hours

Interested in this job?