OpenAI's Hardware Health team is dedicated to ensuring the optimal performance and reliability of our custom-built hyperscale supercomputers. As the Engineering Manager for Networking on the Hardware Health team, you will lead a group of engineers responsible for ensuring the reliability, performance, and scalability of the networking components within our supercomputing infrastructure. Your team will focus on monitoring, diagnosing, and resolving networking issues that could affect the overall health and efficiency of our systems.
Key responsibilities include:
The ideal candidate will have:
This role is based in San Francisco, CA, with a hybrid work model of 3 days in the office per week. OpenAI offers relocation assistance to new employees and is committed to creating an inclusive environment that values diverse perspectives and experiences.