Taro Logo

Multimodal Generative Modeling Research Engineer - SIML, ISE

A leading technology company that creates innovative products and experiences in computing, mobile devices, and software.
$175,800 - $312,200
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Multimodal Generative Modeling Research Engineer - SIML, ISE

Apple's Scene Understanding team is seeking a senior technical leader for their Intelligence System Experience (ISE) team to work on cutting-edge multimodal machine learning projects. This role focuses on transforming creative workflows and smart assistants through generative models, working on technologies like Image Playground, Genmoji, Generative Memories, and Semantic Search. The position involves leading cross-functional efforts in ML modeling, prototyping, validation, and private learning, with a focus on training and adapting large language models. The team works at the intersection of multimodal machine learning and system experiences, delivering features for Spotlight Search, Photos Memories, Generative Playgrounds, and Smart wallpapers. The role requires expertise in distributed training, on-device optimization, and privacy-preserving personalization. The successful candidate will work closely with ML researchers, software engineers, hardware & design teams to enhance multimodal capabilities of large language models and create compelling user experiences that align image/video content with language models for visual actions and multi-turn interactions.

Last updated 4 months ago

Responsibilities For Multimodal Generative Modeling Research Engineer - SIML, ISE

  • Training large scale multimodal (2D/3D vision-language) models on distributed backends
  • Deployment of compact neural architectures efficiently on device
  • Learning policies that can be personalized to the user in a privacy preserving manner
  • Ensuring quality in the wild, with emphasis on fairness and model robustness
  • Enriching multimodal capabilities of large language models
  • Aligning image/video content to the space of LMs for visual actions & multi-turn interactions

Requirements For Multimodal Generative Modeling Research Engineer - SIML, ISE

Python
  • M.S. or PhD in Computer Science or related field (Electrical Engineering, Robotics, Statistics, Applied Mathematics) or equivalent experience
  • Hands on experience training LLMs/adapting pre-trained LLMs for downstream tasks & alignment
  • Modeling experience at the intersection of NLP and vision
  • Proficiency in ML toolkit of choice, e.g., PyTorch
  • Strong programming skills in Python

Benefits For Multimodal Generative Modeling Research Engineer - SIML, ISE

Medical Insurance
Dental Insurance
Education Budget
Equity
Relocation Benefits
  • Comprehensive medical and dental coverage
  • Retirement benefits
  • Discounted products and free services
  • Education reimbursement for career advancement
  • Discretionary restricted stock unit awards
  • Employee Stock Purchase Plan
  • Discretionary bonuses
  • Relocation benefits