Taro Logo

Site Reliability Engineer, Kubernetes Platform (Starshield)

SpaceX is a space exploration company developing technologies to enable human life on Mars and deploying the world's largest US government satellite constellation.
Hawthorne, CA, USA
$120,000 - $170,000
Site Reliability
Mid-Level Software Engineer
In-Person
5,000+ Employees
1+ year of experience
Space · Enterprise SaaS

Description For Site Reliability Engineer, Kubernetes Platform (Starshield)

SpaceX, a pioneering space exploration company, is seeking a Site Reliability Engineer to join their Starshield program - the world's largest US government satellite constellation. This role combines cutting-edge space technology with modern infrastructure engineering, focusing on Kubernetes platform management and site reliability engineering. As part of the team, you'll be responsible for designing, operating, and scaling the infrastructure that powers critical intelligence and national security data delivery across the globe.

The position offers an exciting opportunity to work with SpaceX's innovative satellite technology while building and maintaining robust infrastructure systems. You'll be developing automation for on-premise Kubernetes clusters, managing core infrastructure components, and ensuring high system availability. The role requires expertise in Linux systems, containerization technologies, and infrastructure automation tools.

This is an ideal opportunity for an experienced SRE or DevOps engineer who wants to make a direct impact on space technology and national security. You'll work with cutting-edge technology while contributing to SpaceX's mission of making humanity a multi-planetary species. The position offers competitive compensation, comprehensive benefits, and the chance to work with some of the industry's best engineers.

The role comes with significant responsibility, as you'll be managing critical infrastructure that supports national security operations. You'll need to be detail-oriented, capable of handling complex systems, and able to work in a fast-paced environment. The position requires strong technical skills in Kubernetes, Python, Go, and Linux, combined with excellent problem-solving abilities and communication skills.

Working at SpaceX means being part of a team that's pushing the boundaries of what's possible in space technology. You'll have the opportunity to work on meaningful projects that have real-world impact, while enjoying comprehensive benefits including medical coverage, stock options, and various other perks. This is a chance to be part of a company that's actively working to shape humanity's future in space.

Last updated a few seconds ago

Responsibilities For Site Reliability Engineer, Kubernetes Platform (Starshield)

  • Develop automation to deploy and manage on-premise Kubernetes clusters
  • Deploy and manage core infrastructure such as databases, monitoring and distributed storage
  • Closely collaborate with software engineers to create highly scalable, operable, and maintainable products
  • Engage in and improve the whole lifecycle of services -- from inception and design, through deployment, operation and refinement
  • Monitoring and alerting supporting systems to have high availability
  • Hands-on integration and troubleshooting across the entire Starshield stack
  • Identify areas for improvement and create innovative solutions that enable high system availability

Requirements For Site Reliability Engineer, Kubernetes Platform (Starshield)

Kubernetes
Python
Go
Linux
  • Bachelor's degree in computer science, information systems/IT, or an engineering discipline and 1+ years of professional experience in site reliability engineering or DevOps; OR 3+ years of professional experience in site reliability engineering or DevOps in lieu of a degree
  • 1+ years of professional experience with Linux operating systems
  • Experience with Terraform, Ansible, or other infrastructure tools
  • Experience with containerization technologies (i.e. OCI containers, Kubernetes)
  • Experience scripting in Bash, Python, or other similar languages
  • Development experience in Python, C++, or Go

Benefits For Site Reliability Engineer, Kubernetes Platform (Starshield)

401k
Medical Insurance
Dental Insurance
Vision Insurance
Equity
Parental Leave
  • Medical, vision, and dental coverage
  • 401(k) retirement plan
  • Short and long-term disability insurance
  • Life insurance
  • Paid parental leave
  • 3 weeks paid vacation
  • 10+ paid holidays per year
  • Paid sick leave
  • Employee Stock Purchase Plan
  • Company stock options
  • Long-term cash awards

Interested in this job?

Jobs Related To SpaceX Site Reliability Engineer, Kubernetes Platform (Starshield)