Taro Logo

Senior Site Reliability Engineer

Enterprise software platform provider focused on delivering reliable and scalable solutions for large enterprises.
Pleasanton, CA, USA
Site Reliability
Staff Software Engineer
In-Person
8+ years of experience
Enterprise SaaS

Description For Senior Site Reliability Engineer

XperiencOps Inc is seeking a Senior Site Reliability Engineer to join their team in maintaining and improving their enterprise software platform. This role is crucial in ensuring 24/7 platform availability and performance for their largest enterprise customers worldwide. The position requires extensive experience with AWS cloud technologies and serverless architectures, particularly AWS Lambda. The SRE will be responsible for designing and implementing highly available systems, managing incident response, developing automation, and providing technical leadership. The role combines hands-on technical work with mentorship opportunities, requiring both deep technical expertise and strong communication skills. The ideal candidate will have 8+ years of experience in SRE/DevOps, strong cloud service knowledge, and a proven track record of maintaining enterprise-grade platforms. This is an opportunity to make a significant impact in a growing startup while working with cutting-edge technologies and solving complex technical challenges. The position offers competitive compensation and comprehensive benefits, including health, dental, and vision insurance, along with paid time off. The role requires participation in an on-call rotation and the ability to thrive in a fast-paced environment.

Last updated 2 months ago

Responsibilities For Senior Site Reliability Engineer

  • Design, implement, and manage highly available and scalable systems
  • Monitor, troubleshoot, and resolve platform incidents
  • Lead post-incident reviews and root cause analysis
  • Develop and maintain automation for infrastructure management
  • Optimize platform performance and scalability
  • Contribute to CI/CD pipelines
  • Partner with L2 engineers to resolve complex customer issues
  • Mentor junior engineers and provide technical leadership
  • Drive cross-functional initiatives to improve platform stability

Requirements For Senior Site Reliability Engineer

Python
Linux
Kubernetes
  • Bachelor's degree in Computer Science or related discipline
  • 8+ years in Site Reliability Engineering or DevOps role
  • 3+ years of experience in cloud services, particularly AWS
  • Experience with New Relic, Cloudwatch or similar observability systems
  • Experience with rate-limiting, API gateways, and load balancing
  • Knowledge of security best practices and compliance frameworks
  • Proficient in infrastructure as code using Terraform or CloudFormation
  • Experience with Python, Go, or Bash
  • Strong troubleshooting and debugging skills
  • Excellent communication and collaboration skills
  • Available for 24/7 on-call rotation

Benefits For Senior Site Reliability Engineer

Medical Insurance
Dental Insurance
Vision Insurance
  • Opportunity to work on cutting-edge products
  • Collaborative and fast-paced work environment
  • Chance to be part of a rapidly growing startup
  • Competitive salary and benefits package
  • Health insurance
  • Dental insurance
  • Vision insurance
  • Paid time off

Interested in this job?

Jobs Related To XperiencOps Inc Senior Site Reliability Engineer