Apple's Foundation Model Infrastructure team, within Machine Learning Platform Technologies organization, is seeking a Machine Learning Engineer to join their team. This role is at the heart of Apple Intelligence, building frameworks, services, and tools that power Apple's largest foundation models on servers.
The infrastructure you'll work with powers a wide range of Apple services including Apple Search, Apple Music, AppleTV, AppStore, iMessages, Photos & Camera, Spotlight, Safari, Siri, and upcoming products, serving millions of queries daily with incredibly low latencies. You'll be optimizing billions of parameter language, vision, and speech models using state-of-the-art technologies at Apple's scale.
As a Machine Learning Engineer, you'll collaborate with the Foundation Model Research team to optimize inference for cutting-edge model architectures and work closely with product teams to build production-grade solutions. You'll be responsible for building tools to understand inference bottlenecks across different hardware configurations and use cases, while also mentoring and guiding other engineers.
The role requires expertise in high-throughput services at supercomputing scale, proficiency with cloud platforms and containerization, and strong knowledge of GPU programming and machine learning frameworks. You'll work with modern languages like Go and Python, and should be familiar with fundamental deep learning architectures and tools like NVIDIA TensorRT-LLM, vLLM, and DeepSpeed.
This position offers a competitive compensation package ranging from $171,600 to $302,200, along with comprehensive benefits including medical coverage, stock options, and education reimbursement. Join Apple in pushing the boundaries of computing and intelligence, making a direct impact on billions of users worldwide.