Taro Logo

Site Reliability Engineer, GNC (Falcon)

SpaceX develops technologies to enable human life on Mars, actively working towards making space exploration possible.
Hawthorne, CA, USA
$120,000 - $170,000
Site Reliability
Mid-Level Software Engineer
In-Person
5,000+ Employees
2+ years of experience
Space

Job Description

SpaceX is seeking a Site Reliability Engineer to join their Guidance, Navigation, and Control (GNC) team, focusing on mission-critical systems for the Falcon program. This role combines DevOps expertise with specialized aerospace applications, working with cutting-edge technology in space exploration.

The position involves managing and scaling custom-built mission-critical products for GNC, including Monte Carlo simulations on high-performance computing clusters, automated data analysis systems, and vehicle configuration verification tools. The role requires a unique blend of site reliability engineering skills and software development expertise, working in a fast-paced environment that directly impacts space missions.

As an SRE at SpaceX, you'll be responsible for maintaining a 4000+ thread HPC cluster, deploying and scaling mission-critical applications, and collaborating closely with GNC software engineers to ensure robust and maintainable systems. The role combines traditional SRE responsibilities with aerospace-specific challenges, requiring expertise in Python, Linux systems, and modern DevOps tools.

The compensation package is competitive, ranging from $120,000 to $170,000 based on experience level, plus comprehensive benefits including medical coverage, 401(k), stock options, and various other perks. This is an excellent opportunity for someone passionate about both infrastructure engineering and space technology to contribute to SpaceX's mission of enabling human life on Mars.

The ideal candidate will thrive in a challenging environment where they can apply their technical skills to solve complex problems in space exploration. This role offers unique exposure to aerospace engineering while leveraging modern DevOps practices and tools. You'll be working with cutting-edge technology and contributing directly to SpaceX's mission of making humanity a multi-planetary species.

Last updated 3 months ago

Responsibilities For Site Reliability Engineer, GNC (Falcon)

  • Deploy, upgrade, operate/maintain, and scale mission-critical GNC products and services
  • Provision and maintain virtual and physical servers
  • Work with SpaceX HPC team to monitor and maintain a 4000+ thread HPC cluster
  • Collaborate with GNC software engineers to create highly operable products
  • Add monitoring for web apps and respond to outages
  • Manage computational infrastructure of GNC in collaboration with IT
  • Practice sustainable incident response and postmortems
  • Configure automated deployment pipelines for web apps
  • Develop or improve GNC web apps and tools
  • Focus on performance bottlenecks and improvements

Requirements For Site Reliability Engineer, GNC (Falcon)

Python
Linux
Kubernetes
  • Bachelor's degree in computer science, information systems/IT, engineering, math, or scientific discipline and 2+ years of software development experience OR 4+ years of professional experience
  • Experience with Linux operating systems
  • Experience with Python and Python based development frameworks
  • Must be willing to work extended hours and weekends when needed
  • Must meet ITAR requirements (US citizen, permanent resident, refugee, or asylee)

Benefits For Site Reliability Engineer, GNC (Falcon)

401k
Medical Insurance
Dental Insurance
Vision Insurance
Equity
  • Medical, vision, and dental coverage
  • 401(k) retirement plan
  • Short and long-term disability insurance
  • Life insurance
  • Paid parental leave
  • 3 weeks paid vacation
  • 10+ paid holidays per year
  • Paid sick leave
  • Stock options/equity
  • Employee Stock Purchase Plan
  • Long-term incentives
  • Potential discretionary bonuses