Taro Logo

Lead Site Reliability Engineer

A leading technology company providing AI + Data + CRM solutions to help companies connect with customers in new ways.
$200,800 - $276,100
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
8+ years of experience
Enterprise SaaS

Description For Lead Site Reliability Engineer

Salesforce, the Customer Company, is seeking a Lead Site Reliability Engineer to join their Marketing Automation Platform & Data Operations team. This role is crucial in ensuring the reliability and operational efficiency of Salesforce's critical Marketing Technology ecosystem. The position requires an experienced engineer to bridge software engineering and system administration, with emphasis on monitoring, visualization, and alerting tools. The ideal candidate will lead incident investigations, drive automation initiatives, and ensure system stability.

The role involves working with cutting-edge technologies including Datadog, Splunk, Grafana, and cloud platforms. You'll be responsible for maintaining high availability of services, implementing monitoring solutions, and driving continuous improvement in system reliability. The position offers the opportunity to work with enterprise-scale systems and contribute to Salesforce's mission of helping companies connect with customers in innovative ways.

As a technical leader, you'll collaborate with cross-functional teams, mentor junior engineers, and drive best practices in site reliability engineering. The role combines technical expertise with leadership responsibilities, requiring both deep technical knowledge and strong communication skills. You'll be part of a company that values innovation, equality, and making a positive impact on the world through technology.

Working at Salesforce means joining a company that believes in doing well while doing good, with comprehensive benefits, career growth opportunities, and a culture focused on equality and inclusion. The position offers competitive compensation and the chance to work with leading-edge technologies while solving complex challenges at enterprise scale.

Last updated a day ago

Responsibilities For Lead Site Reliability Engineer

  • Ensure reliability, performance, and scalability of critical software systems
  • Lead incident investigations and drive automation initiatives
  • Manage service level objectives (SLOs) and SLAs
  • Conduct root cause analysis using monitoring tools
  • Collaborate across teams including developers, platform engineers, and operations
  • Mentor junior engineers and foster high-performance culture
  • Develop and execute disaster recovery plans

Requirements For Lead Site Reliability Engineer

Python
Go
Java
Kubernetes
  • 8+ years of relevant industry experience in monitoring, alerting, and visualization systems
  • Advanced expertise with Datadog, Splunk, Grafana, Tableau, New Relic, and PagerDuty
  • Deep knowledge of cloud infrastructures (AWS, Azure, GCP)
  • Experience managing reliability within the Salesforce ecosystem
  • Proven ability in incident escalation and disaster recovery management
  • Strong relationship-building skills across technical and business teams
  • Excellent verbal, written, and interpersonal skills

Benefits For Lead Site Reliability Engineer

Medical Insurance
401k
Parental Leave
  • Comprehensive benefits package
  • Career growth opportunities
  • Inclusive work environment
  • Equal opportunity employer

Interested in this job?

Jobs Related To Salesforce Lead Site Reliability Engineer