Meta is seeking an experienced Systems Engineer to join their Release to Production (RTP) team working on the Meta Training and Inference Accelerator (MTIA) program. This role is crucial for Meta's AI/ML initiatives, supporting large-scale AI Training and Inference operations. The position focuses on the end-to-end Hardware Lifecycle of Meta's servers, including prototyping, debugging, and system monitoring.
The ideal candidate will work on scale up and scale out network technologies, particularly RDMA NIC, for MTIA systems that power Meta's AI advancements. This role requires deep knowledge of network protocols (TCP/IP, RDMA) and hands-on experience with post-Silicon validation for networking platforms.
As a Production Systems Engineer, you'll collaborate with various teams including hardware designers, networking teams, system manufacturers, and data center operations teams. You'll be responsible for system validation, troubleshooting, and ensuring the successful deployment of new platforms into Meta's fleet.
The position offers competitive compensation ranging from $163,000 to $225,000 per year, plus bonus and equity opportunities. This is an excellent opportunity for experienced engineers who want to work at the intersection of hardware systems and AI technology at one of the world's leading tech companies.
Meta provides a comprehensive benefits package and promotes an inclusive work environment, being an Equal Employment Opportunity employer. The role is based in Austin, TX, and offers the chance to work on cutting-edge AI infrastructure that powers Meta's innovative services.