Multimodal Generative Modeling Research Engineer - SIML, ISE

Apple

A leading technology company that creates innovative products and experiences in computing, mobile devices, and software.

San Francisco, CA, USA

$175,800 - $312,200

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

5+ years of experience

This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Multimodal Generative Modeling Research Engineer - SIML, ISE

Apple's Scene Understanding team is seeking a senior technical leader for their Intelligence System Experience (ISE) team to work on cutting-edge multimodal machine learning projects. This role focuses on transforming creative workflows and smart assistants through generative models, working on technologies like Image Playground, Genmoji, Generative Memories, and Semantic Search. The position involves leading cross-functional efforts in ML modeling, prototyping, validation, and private learning, with a focus on training and adapting large language models. The team works at the intersection of multimodal machine learning and system experiences, delivering features for Spotlight Search, Photos Memories, Generative Playgrounds, and Smart wallpapers. The role requires expertise in distributed training, on-device optimization, and privacy-preserving personalization. The successful candidate will work closely with ML researchers, software engineers, hardware & design teams to enhance multimodal capabilities of large language models and create compelling user experiences that align image/video content with language models for visual actions and multi-turn interactions.

Last updated 4 months ago

Responsibilities For Multimodal Generative Modeling Research Engineer - SIML, ISE

Training large scale multimodal (2D/3D vision-language) models on distributed backends
Deployment of compact neural architectures efficiently on device
Learning policies that can be personalized to the user in a privacy preserving manner
Ensuring quality in the wild, with emphasis on fairness and model robustness
Enriching multimodal capabilities of large language models
Aligning image/video content to the space of LMs for visual actions & multi-turn interactions

Requirements For Multimodal Generative Modeling Research Engineer - SIML, ISE

Python

M.S. or PhD in Computer Science or related field (Electrical Engineering, Robotics, Statistics, Applied Mathematics) or equivalent experience
Hands on experience training LLMs/adapting pre-trained LLMs for downstream tasks & alignment
Modeling experience at the intersection of NLP and vision
Proficiency in ML toolkit of choice, e.g., PyTorch
Strong programming skills in Python

Benefits For Multimodal Generative Modeling Research Engineer - SIML, ISE

Medical Insurance

Dental Insurance

Education Budget

Equity

Relocation Benefits

Comprehensive medical and dental coverage
Retirement benefits
Discounted products and free services
Education reimbursement for career advancement
Discretionary restricted stock unit awards
Employee Stock Purchase Plan
Discretionary bonuses
Relocation benefits