SpaceX is seeking an experienced Site Reliability Engineer to join their Information Technology Linux Infrastructure team. This role is crucial in supporting SpaceX's mission to make life multiplanetary through the development and maintenance of large-scale, distributed, fault-tolerant systems.
The position focuses on Kubernetes design, maintenance, scaling, and optimization in support of critical business functions. The ideal candidate will thrive in a fast-paced environment while managing a fleet of Kubernetes clusters at scale. They will be responsible for infusing SRE culture and practices across teams while tackling complex scaling challenges on the journey to Mars.
The role offers a comprehensive benefits package including medical, dental, and vision coverage, 401(k) retirement plan, equity opportunities through stock options and ESPP, and various other perks. Compensation ranges from $120,000 to $170,000 based on experience level, with additional long-term incentives available.
Key responsibilities include managing production Kubernetes installations, collaborating on deployment automation, and ensuring cluster reliability and security. The position requires strong experience with Linux systems, Kubernetes architecture, and Infrastructure as Code practices. Candidates should be prepared for on-call rotations and occasional extended hours.
This is an exciting opportunity to work at the forefront of space technology while building and maintaining critical infrastructure. The role combines technical expertise in site reliability engineering with SpaceX's ambitious goal of enabling human life on Mars. Successful candidates will join a team of passionate engineers working on cutting-edge technology in the space industry.
The position requires ITAR compliance, meaning candidates must be U.S. citizens, permanent residents, or eligible for required authorizations. SpaceX offers a dynamic work environment where engineers can directly impact the company's mission of making humanity multiplanetary.