Taro Logo

SRE (Site Reliability Engineer) - Production Support LMTS

Global leader in CRM software and enterprise cloud computing solutions
$184,000 - $276,100
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
8+ years of experience
Enterprise SaaS

Description For SRE (Site Reliability Engineer) - Production Support LMTS

Join MuleSoft, a Salesforce Company, as a Site Reliability Engineer (SRE) in a role crucial to maintaining the reliability and performance of their cloud infrastructure. This position offers the opportunity to work with cutting-edge technology in a team responsible for full stack observability, event response, and reliability engineering. The role combines technical expertise in cloud infrastructure, automation, and system reliability with the challenge of maintaining high-availability services for a leading enterprise software company.

The position requires deep technical knowledge across multiple domains, including cloud platforms (AWS), monitoring tools, and infrastructure automation. You'll be working on critical systems that directly impact customer experience, implementing best practices in reliability engineering, and ensuring system uptime meets or exceeds industry standards. The role offers exposure to complex, large-scale distributed systems and the opportunity to work with modern technologies like Kubernetes, Terraform, and various monitoring solutions.

This is a unique opportunity for experienced SREs who want to make a significant impact at scale. The position requires U.S. citizenship and ability to obtain security clearances, indicating work with sensitive systems and government clients. The compensation is highly competitive, with salary ranges varying by location (up to $276,100 in California), reflecting the senior nature of the role and the high level of expertise required.

The role combines technical challenges with strategic thinking, requiring both hands-on engineering skills and the ability to drive reliability improvements across the organization. You'll be part of a team that values proactive problem-solving and innovative approaches to maintaining system reliability. This position offers the chance to work with enterprise-scale infrastructure while contributing to the success of a leading technology company.

Last updated 2 days ago

Responsibilities For SRE (Site Reliability Engineer) - Production Support LMTS

  • Maintain and improve service reliability, availability, and performance across distributed systems
  • Design and maintain monitoring, logging, and alerting systems
  • Respond to production incidents and perform root cause analysis
  • Automate repetitive tasks using scripts and infrastructure-as-code
  • Monitor usage trends and perform capacity planning
  • Maintain and improve CI/CD pipelines
  • Collaborate with security teams for compliance requirements
  • Work with development teams to design resilient systems
  • Create and maintain documentation for runbooks and systems

Requirements For SRE (Site Reliability Engineer) - Production Support LMTS

Python
Go
Kubernetes
  • 8+ years experience in a SRE role or related field
  • Experience in Public Cloud environments, specifically with AWS
  • Experience with New Relic, collectd, Splunk, Sumo Logic, Grafana, Terraform, Jenkins, Kubernetes, Spinnaker
  • Excellent knowledge of Internet technologies and protocols
  • Strong experience with API fundamentals
  • Ability to root cause issues in high-traffic distributed systems
  • Experience with development in Python, Go, Bash
  • Experience with FedRAMP environments
  • A related technical degree required
  • Must be a U.S. citizen operating on U.S. Soil

Interested in this job?

Jobs Related To Salesforce SRE (Site Reliability Engineer) - Production Support LMTS

Production Support Engineering LMTS

Senior SRE position at Salesforce focusing on cloud infrastructure reliability, requiring U.S. citizenship and extensive experience with AWS, Kubernetes, and monitoring tools.

Site Reliability Engineer 4, Games Engineering

Senior Site Reliability Engineering role at Netflix Games, focusing on platform reliability, incident management, and developer experience enhancement with competitive compensation.

Staff Site Reliability Engineer

Staff Site Reliability Engineer position at Fivetran, focusing on infrastructure reliability, monitoring, and system evolution with hybrid work in Denver.

Production Support Engineering LMTS

Senior SRE position at Salesforce focusing on cloud infrastructure reliability, requiring U.S. citizenship and extensive experience with AWS, Kubernetes, and monitoring tools.

Site Reliability Developer 3

Site Reliability Developer role at Oracle focusing on cloud infrastructure, automation, and system reliability with emphasis on security and scalability.