Taro Logo

Staff Site Reliability Engineer

AI-powered mobile marketing platform transforming brand-consumer engagement through personalized messaging
United States
$156,000 - $240,000
Site Reliability
Staff Software Engineer
Remote
1,000 - 5,000 Employees
7+ years of experience
AI · Enterprise SaaS

Description For Staff Site Reliability Engineer

Attentive, a leading AI-powered mobile marketing platform, is seeking a Staff Site Reliability Engineer to join their Platform Infrastructure team. This role is crucial in maintaining and enhancing the platform that handles billions of events from over 100 million customers daily. As a Staff SRE, you'll be responsible for designing and implementing solutions that improve system reliability and scalability while mentoring others and influencing technical roadmaps.

The position offers an opportunity to work with cutting-edge technologies including Kubernetes, AWS EKS, Istio, and various AWS services. You'll be part of the Infrastructure and Platform organization, specifically the Production Engineering Team, focusing on delivering a fast and reliable platform that empowers Attentive engineers to deliver solutions quickly and safely.

The role combines technical leadership with hands-on engineering, requiring expertise in reliability concepts, strong coding abilities, and excellent communication skills. You'll collaborate across multiple teams including AI/ML, Data, Platform, and Product teams to develop best-in-class services. The position offers competitive compensation ($156,000 - $240,000 annually) plus equity and benefits.

Attentive has been recognized by Deloitte's Fast 500, LinkedIn's Top Startups, and Forbes Cloud 100, making this an excellent opportunity to join a rapidly growing, successful company. The remote work environment and strong company values focusing on action, teamwork, customer success, and ownership create an ideal setting for professional growth and impact.

Last updated 8 days ago

Responsibilities For Staff Site Reliability Engineer

  • Design and implement systems that enhance reliability, observability, traceability, and incident management
  • Take ownership of cross-team collaborations and drive impactful projects
  • Collaborate with engineers from AI/ML, Data, Platform, and Product teams
  • Define and enforce production standards, processes, and tools
  • Advocate for and implement SLIs, SLOs, and other reliability-focused metrics
  • Guide and mentor team members
  • Drive continuous improvement and innovation

Requirements For Staff Site Reliability Engineer

Go
Java
JavaScript
Kubernetes
Python
PostgreSQL
React
Redis
TypeScript
  • 7+ years of experience in Production Engineering, Backend Engineering, SRE, DevOps or similar role
  • Strong coding ability in at least one language (e.g., Golang, Python, Java, Typescript)
  • Demonstrated experience delivering medium to large-scale projects
  • Deep understanding of production reliability concepts, including SLIs, SLOs, and incident management
  • Excellent verbal and written communication skills
  • Familiarity with working in dynamic, reliability-focused production environments

Benefits For Staff Site Reliability Engineer

Medical Insurance
Equity
  • Competitive salary
  • Equity compensation
  • Health & wellness benefits

Interested in this job?

Jobs Related To Attentive Staff Site Reliability Engineer