Taro Logo

Lead Site Reliability Engineer

Salesforce is the Customer Company, inspiring the future of business with AI+ Data +CRM, helping companies connect with customers in new ways.
$200,800 - $276,100
Site Reliability
Staff Software Engineer
Hybrid
5,000+ Employees
8+ years of experience
Enterprise SaaS

Description For Lead Site Reliability Engineer

Salesforce is seeking a Lead Site Reliability Engineer to join their Marketing Automation Platform & Data Operations team within the Marketing Technology organization. This role is crucial in ensuring the reliability and operational efficiency of their critical Marketing Technology ecosystem. The position combines software engineering and system administration, with a focus on monitoring, visualization, and alerting tools.

The ideal candidate will have extensive experience with cloud platforms and monitoring tools like Datadog, Splunk, and Grafana. They'll need deep expertise in the Salesforce ecosystem, including Salesforce Platform, Slack, Data Cloud, Tableau, and Heroku. The role requires advanced proficiency in languages like Python, Go, and Java, along with strong knowledge of Infrastructure as Code tools.

Key responsibilities include leading incident investigations, managing SLOs, implementing automation solutions, and serving as a technical leader bridging various teams. The role offers competitive compensation ($200,800 - $276,100) and comprehensive benefits including medical insurance, 401(k), and equity opportunities.

This position is perfect for an experienced SRE who excels in both technical leadership and cross-team collaboration, passionate about system reliability and operational excellence. The role offers the opportunity to work with cutting-edge technologies while contributing to Salesforce's mission of helping companies connect with customers in innovative ways.

Working at Salesforce means joining a company that values both business success and positive social impact, with a strong emphasis on equality and inclusion. The position offers professional growth opportunities and the chance to work with industry-leading technologies in a dynamic, fast-paced environment.

Last updated 12 hours ago

Responsibilities For Lead Site Reliability Engineer

  • Ensure reliability, performance, and scalability of critical software systems
  • Lead incident investigations and drive automation initiatives
  • Manage service level objectives (SLOs) and SLAs
  • Conduct root cause analysis using monitoring tools
  • Collaborate across teams including developers, platform engineers, and operations
  • Act as primary point of contact for escalations
  • Participate in developing and executing disaster recovery plans
  • Mentor junior engineers
  • Contribute to internal training and documentation

Requirements For Lead Site Reliability Engineer

Python
Go
Java
Kubernetes
  • 8+ years of relevant industry experience in monitoring, alerting, and visualization systems
  • Advanced expertise with Datadog, Splunk, Grafana, Tableau, New Relic, and PagerDuty
  • Deep knowledge of cloud infrastructures (AWS, Azure, GCP)
  • Experience managing reliability within the Salesforce ecosystem
  • Proven ability in incident escalation and disaster recovery management
  • Strong relationship-building skills across technical and business teams
  • Excellent verbal, written, and interpersonal skills

Benefits For Lead Site Reliability Engineer

Medical Insurance
Dental Insurance
Vision Insurance
401k
Equity
  • Time off programs
  • Medical insurance
  • Dental insurance
  • Vision insurance
  • Mental health support
  • Paid parental leave
  • Life insurance
  • Disability insurance
  • 401(k)
  • Employee stock purchasing program

Interested in this job?

Jobs Related To Salesforce Lead Site Reliability Engineer