Taro Logo

Staff Software Engineer - Site Reliability and Observability

A global automotive company with a vision of Zero Crashes, Zero Emissions and Zero Congestion.
Austin, TX, USARoswell, GA, USAWarren, MI, USA
Site Reliability
Staff Software Engineer
Hybrid
5,000+ Employees
7+ years of experience
Automotive

Description For Staff Software Engineer - Site Reliability and Observability

General Motors is seeking a Staff Software Engineer specializing in Site Reliability and Observability to join their team. This role is critical in ensuring the reliability, scalability, and performance of GM's software systems. The position requires expertise in cloud platforms (preferably Azure), monitoring tools, and infrastructure automation. The ideal candidate will have 7+ years of SRE experience and strong programming skills in languages like Python, Java, and Go.

The role involves implementing and maintaining observability platforms, collaborating with engineering teams on architecture decisions, and ensuring high availability of production systems. You'll be responsible for monitoring system health, conducting root cause analysis, and implementing best practices for reliability and performance optimization.

GM offers a comprehensive benefits package including healthcare, 401k matching, and educational assistance. The company's mission focuses on achieving Zero Crashes, Zero Emissions, and Zero Congestion, making this an opportunity to work on transformative automotive technology. The position offers hybrid work arrangements in either Austin, TX or Atlanta, GA locations, requiring in-office presence three times per week.

This is an excellent opportunity for an experienced SRE professional looking to make an impact in the automotive industry while working with cutting-edge technology and contributing to GM's vision of future mobility.

Last updated 4 hours ago

Responsibilities For Staff Software Engineer - Site Reliability and Observability

  • Implement scalable, reliable, secure SRE and Observability platform
  • Deliver tools/software to improve reliability, scalability and operability
  • Collaborate with engineering teams on architecture and infrastructure
  • Conduct production readiness reviews and deployments
  • Monitor system availability, latency and service health
  • Participate in on-call engineering duty
  • Perform incident root cause analysis
  • Build run books and tooling for production support
  • Participate in technical discussions with Architecture group

Requirements For Staff Software Engineer - Site Reliability and Observability

Kubernetes
Python
Java
Go
  • 7+ years of hands-on SRE experience with cloud providers (Azure preferred)
  • Experience with high-availability, fault-tolerant distributed systems
  • Experience with monitoring frameworks like Azure Monitor, Datadog, Dynatrace
  • Strong working knowledge of Docker, Kubernetes, Terraform
  • Experience troubleshooting JVM based applications
  • Strong experience in Python, Java, Go, PowerShell, Bash
  • CI/CD automation frameworks knowledge
  • Strong understanding of public cloud networking
  • Experience with GitHub and Azure DevOps
  • BS/MS in Computer Science/Engineering preferred

Benefits For Staff Software Engineer - Site Reliability and Observability

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
Education Budget
  • Paid time off including vacation and holidays
  • Healthcare coverage
  • Dental and vision insurance
  • Life insurance
  • 401k with company match
  • Education assistance
  • Student loan refinancing
  • Vehicle purchase discounts
  • Parental leave

Interested in this job?

Jobs Related To General Motors Staff Software Engineer - Site Reliability and Observability