Production Systems Engineer, AI Systems

Meta builds technologies that help people connect, find communities, and grow businesses, including social platforms like Facebook, Instagram, WhatsApp, and developing AR/VR experiences.
$104,000 - $155,000
Backend
Mid-Level Software Engineer
In-Person
5,000+ Employees
2+ years of experience
AI · Hardware

Description For Production Systems Engineer, AI Systems

Meta is seeking a Production Systems Engineer to join their AI Systems team, focusing on critical AI System Platforms and server infrastructure. This role combines hands-on technical work with strategic platform development, offering a unique opportunity to impact Meta's AI infrastructure at scale.

The position involves working with cutting-edge AI platforms, managing server scale-up and scale-out networking connectivity, and maintaining the complex infrastructure that powers Meta's AI initiatives. You'll be at the forefront of implementing and optimizing AI system platforms, ensuring robust networking connectivity, and solving complex technical challenges.

As a Production Systems Engineer, you'll be responsible for supporting new AI platform introductions, creating diagnostic tools, and developing deep understanding of AI workload traffic patterns. The role requires expertise in system architecture, networking protocols, and hardware integration, combined with strong troubleshooting abilities.

The ideal candidate will have a background in Computer Science or related field, with experience in network platform development, Linux systems, and TCP/IP protocols. You'll work with various teams to improve product quality, implement solutions, and drive innovation in AI infrastructure.

Meta offers a competitive compensation package ranging from $104,000 to $155,000 annually, plus bonus and equity opportunities. The position is based in Menlo Park, CA, where you'll work with world-class engineers and have access to cutting-edge technology and resources.

This role presents an excellent opportunity for someone passionate about AI infrastructure, system optimization, and large-scale platform development. You'll be part of Meta's mission to advance AI technology while working on some of the most sophisticated systems in the industry.

Last updated a few seconds ago

Responsibilities For Production Systems Engineer, AI Systems

  • Support new AI platform introduction into Meta fleet by driving scale up and scale out interface integration
  • Create experiments and tooling to detect and diagnose hardware/firmware/software health issues
  • Develop understanding of AI workload traffic and incorporate as part of NPI
  • Contribute to enabling hacks for future technology explorations in AI space
  • Troubleshoot, diagnose and root cause system failures
  • Develop visibility through data visualization
  • Implement systemic solutions to hardware health issues
  • Drive continuous product quality improvement

Requirements For Production Systems Engineer, AI Systems

Linux
Python
  • Bachelor's degree in Computer Science, Computer Engineering, or relevant technical field
  • 2+ years of work experience in Network ASIC/Platform development, network product deployment, or Interconnect Technologies
  • Knowledge of server architecture and components
  • Experience working with Linux
  • Knowledge of TCP/IP and experience using iperf
  • Hands on troubleshooting and debug experience

Benefits For Production Systems Engineer, AI Systems

Medical Insurance
Equity
401k
  • Bonus
  • Equity
  • Medical benefits
  • 401k

Interested in this job?

Jobs Related To Meta Production Systems Engineer, AI Systems

Network Production Engineer - Core Networking, Backbone

Network Production Engineer role at Meta focusing on designing and implementing global core IP networks, requiring expertise in both networking protocols and software engineering.

Optical Network Engineer

Meta is seeking an Optical Network Engineer to design, build, and operate their global optical network infrastructure, combining software engineering with network expertise.

Software Engineer, Infrastructure

Software Engineer position at Meta focusing on infrastructure development, building core backend systems that power Meta's products used by billions globally.

Network Production Engineer, Infrastructure

Network Production Engineer role at Meta, combining networking expertise with software engineering to manage and scale one of the world's largest network infrastructures.

Developer Support Engineer

Meta is seeking a Developer Support Engineer to research, review, and solve technical issues related to developer-reported problems in the AR/VR space.