Taro Logo

AIML - Machine Learning Engineer, Foundation Model Services

Apple is a technology company that designs, develops, and sells consumer electronics, software, and services.
$171,600 - $302,200
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For AIML - Machine Learning Engineer, Foundation Model Services

Apple's Foundation Model Infrastructure team, within Machine Learning Platform Technologies organization, is seeking a Machine Learning Engineer to join their team. This role is at the heart of Apple Intelligence, building frameworks, services, and tools that power Apple's largest foundation models on servers.

The infrastructure you'll work with powers a wide range of Apple services including Apple Search, Apple Music, AppleTV, AppStore, iMessages, Photos & Camera, Spotlight, Safari, Siri, and upcoming products, serving millions of queries daily with incredibly low latencies. You'll be optimizing billions of parameter language, vision, and speech models using state-of-the-art technologies at Apple's scale.

As a Machine Learning Engineer, you'll collaborate with the Foundation Model Research team to optimize inference for cutting-edge model architectures and work closely with product teams to build production-grade solutions. You'll be responsible for building tools to understand inference bottlenecks across different hardware configurations and use cases, while also mentoring and guiding other engineers.

The role requires expertise in high-throughput services at supercomputing scale, proficiency with cloud platforms and containerization, and strong knowledge of GPU programming and machine learning frameworks. You'll work with modern languages like Go and Python, and should be familiar with fundamental deep learning architectures and tools like NVIDIA TensorRT-LLM, vLLM, and DeepSpeed.

This position offers a competitive compensation package ranging from $171,600 to $302,200, along with comprehensive benefits including medical coverage, stock options, and education reimbursement. Join Apple in pushing the boundaries of computing and intelligence, making a direct impact on billions of users worldwide.

Last updated 22 days ago

Responsibilities For AIML - Machine Learning Engineer, Foundation Model Services

  • Work alongside Foundation Model Research team to optimize inference for cutting edge model architectures
  • Work closely with product teams to build Production grade solutions to launch models serving millions of customers in real time
  • Build tools to understand bottlenecks in Inference for different hardwares and use cases
  • Mentor and guide engineers in the organization

Requirements For AIML - Machine Learning Engineer, Foundation Model Services

Python
Go
Kubernetes
  • Demonstrated experience in leading and driving complex, ambiguous projects
  • Experience with high throughput services particularly at supercomputing scale
  • Proficient in running applications on Cloud (AWS, Azure, or equivalent) using Kubernetes and Docker
  • Familiar with GPU programming concepts using CUDA and with popular machine learning frameworks like PyTorch or TensorFlow
  • Proficient in building and maintaining systems written in modern languages (e.g. Go, Python)
  • Familiar with fundamental deep learning architectures such as Transformer models and encoder/decoder models
  • Familiar with NVIDIA TensorRT-LLM, vLLM, DeepSpeed, NVIDIA Triton Inference Server
  • Experience in writing custom CUDA kernels using CUDA or OpenAI Triton

Benefits For AIML - Machine Learning Engineer, Foundation Model Services

Medical Insurance
Dental Insurance
Vision Insurance
401k
Equity
Education Budget
  • Comprehensive medical and dental coverage
  • Vision insurance
  • Retirement benefits
  • Employee stock programs
  • Education reimbursement
  • Discretionary bonuses
  • Relocation benefits