Taro Logo

Linux Site Reliability Engineer

SpaceX is developing technologies to enable human life on Mars, founded under the belief that exploring the stars is fundamentally more exciting than not.
$140,000 - $170,000
DevOps
Mid-Level Software Engineer
In-Person
5,000+ Employees
3+ years of experience
Space

Description For Linux Site Reliability Engineer

SpaceX is seeking an experienced Linux Site Reliability Engineer to join their Information Technology Linux Infrastructure team in Redmond, WA. This role combines DevOps expertise with SpaceX's ambitious mission of enabling human life on Mars.

The position offers a competitive salary range of $140,000-$170,000 per year, along with comprehensive benefits including medical, dental, and vision coverage, 401(k), stock options, and an Employee Stock Purchase Plan. The company provides 3 weeks of paid vacation, paid holidays, and shuttle service from select Seattle locations.

As an SRE, you'll be responsible for managing and optimizing Kubernetes clusters, working with cutting-edge containerization technologies, and building resilient, scalable systems. The role requires deep expertise in Linux systems, containerization, and automation tools like Ansible and Terraform. You'll collaborate with engineering teams to design and implement solutions that support SpaceX's critical business functions.

Key technical areas include Kubernetes, Linux, container runtime environments, infrastructure as code, and programming in Python and Go. The ideal candidate should have experience with monitoring tools like Prometheus and Grafana, and be comfortable with Git-based workflows and CI/CD practices.

This is an excellent opportunity for a skilled DevOps engineer who wants to contribute to space exploration while working with modern infrastructure technologies. The role offers both technical challenges and the chance to make a meaningful impact on humanity's future in space. The position requires participation in on-call rotations and occasional extended hours, reflecting the dynamic nature of SpaceX's mission-critical operations.

Last updated 3 days ago

Responsibilities For Linux Site Reliability Engineer

  • Install, manage, scale and optimize Kubernetes and RKE clusters using Ansible, Terraform
  • Work with engineers to gather requirements, design, deploy, and support software platforms
  • Build highly resilient, high-performance, scalable systems
  • Make recommendations and implement improvements using change control methodology
  • Define, document and follow standards and best practices
  • Foster collaboration and cross-training in Kubernetes expertise
  • Drive scripting, self-service and automation to reduce administrative overhead
  • Participate in on-call rotation for urgent after-hours work

Requirements For Linux Site Reliability Engineer

Kubernetes
Linux
Python
Go
  • Bachelor's degree in Computer Science or STEM discipline and 3+ years of systems engineering experience; OR 5+ years of systems engineering experience
  • Experience deploying and supporting Linux servers in physical and virtualized environments
  • Experience with Linux shell configuration and system extensions
  • Experience supporting and scaling containerized applications
  • Experience using automation frameworks for infrastructure management
  • Must be a U.S. citizen, permanent resident, refugee, or asylee (ITAR requirement)

Benefits For Linux Site Reliability Engineer

Medical Insurance
Vision Insurance
Dental Insurance
401k
Parental Leave
Equity
  • Medical Insurance
  • Vision Insurance
  • Dental Insurance
  • 401k
  • Parental Leave
  • Stock Options
  • Employee Stock Purchase Plan

Jobs Related To SpaceX Linux Site Reliability Engineer