SpaceX is seeking a Site Reliability Engineer to join their Guidance, Navigation, and Control (GNC) team, focusing on mission-critical systems for the Falcon program. This role combines DevOps expertise with specialized aerospace applications, working with cutting-edge technology in space exploration.
The position involves managing and scaling custom-built mission-critical products for GNC, including Monte Carlo simulations on high-performance computing clusters, automated data analysis systems, and vehicle configuration verification tools. The role requires a unique blend of site reliability engineering skills and software development expertise, working in a fast-paced environment that directly impacts space missions.
As an SRE at SpaceX, you'll be responsible for maintaining a 4000+ thread HPC cluster, deploying and scaling mission-critical applications, and collaborating closely with GNC software engineers to ensure robust and maintainable systems. The role combines traditional SRE responsibilities with aerospace-specific challenges, requiring expertise in Python, Linux systems, and modern DevOps tools.
The compensation package is competitive, ranging from $120,000 to $170,000 based on experience level, plus comprehensive benefits including medical coverage, 401(k), stock options, and various other perks. This is an excellent opportunity for someone passionate about both infrastructure engineering and space technology to contribute to SpaceX's mission of enabling human life on Mars.
The ideal candidate will thrive in a challenging environment where they can apply their technical skills to solve complex problems in space exploration. This role offers unique exposure to aerospace engineering while leveraging modern DevOps practices and tools. You'll be working with cutting-edge technology and contributing directly to SpaceX's mission of making humanity a multi-planetary species.