Taro Logo

Senior Site Reliability Engineer

BrainGu is a technology company that builds developer platforms to enable organizations to build high quality software faster and at lower cost.
$150,000 - $170,000
Site Reliability
Senior Software Engineer
In-Person
6+ years of experience
Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:
Senior Site Reliability Engineer

Senior Site Reliability Engineer role at BrainGu focusing on designing and implementing scalable systems and infrastructure automation.

Site Reliability Engineer III - Corporate Oversight and Governance Technology

Senior Site Reliability Engineer role at JPMorgan Chase focusing on implementing and maintaining reliable, scalable systems for Corporate Oversight and Governance Technology division.

Software Engineer III - Site Reliability Engineer

Senior SRE position at JPMorgan Chase focusing on system reliability, observability, and performance optimization using Python/Java and modern DevOps tools.

Sr. Site Reliability Engineer

Senior Site Reliability Engineer role at Adobe focusing on platform reliability, scalability, and DevOps practices, offering competitive compensation and opportunity to work with cutting-edge technology.

Site Reliability Engineer

Senior Site Reliability Engineer role at Dark Wolf Solutions supporting NBIS with cloud operations and monitoring.

Description For Senior Site Reliability Engineer

BrainGu, a technology company specializing in developer platforms, is seeking a Senior Site Reliability Engineer to join their Engineering Operations Value Stream (EngOps) team. This role focuses on supporting their flagship Developer Experience Platform, SmoothGlue, and advancing their SRE strategy and operating model.

The position offers a competitive salary range of $150,000 - $170,000 and is based in Boston, Massachusetts. The role requires a willingness to travel up to 50% and the ability to obtain and maintain a Top Secret Clearance.

As an SRE at BrainGu, you'll be responsible for designing and implementing highly available, scalable systems, working with infrastructure as code, and managing comprehensive monitoring solutions. The role requires expertise in container technologies, cloud platforms (AWS/GCP/Azure), and modern DevOps tools. You'll work closely with the EngOps CTO and Platform Product team to drive roadmaps and organizational maturity.

The ideal candidate brings 6+ years of relevant experience, strong communication skills, and extensive knowledge of SRE practices. Experience with Kubernetes, container technologies, and cloud environments is essential. The role offers significant opportunity for technical leadership and mentorship of junior team members.

BrainGu offers an impressive benefits package including 12 weeks of paid parental leave, comprehensive health coverage, 401(k) matching, and unique perks like a $10k "BrainBudget" for professional development and a $1,500 "Battle Station Budget" for home office setup. The company maintains a strong focus on supporting veterans with additional benefits including a Supplemental Tricare plan and monthly stipends.

This role is perfect for an experienced SRE who wants to make a significant impact on developer platforms while working with cutting-edge technologies in a collaborative environment. The position offers both technical challenges and leadership opportunities, making it ideal for someone looking to advance their career in site reliability engineering while contributing to meaningful projects.

Last updated 25 days ago

Responsibilities For Senior Site Reliability Engineer

  • Design, implement, and manage highly available, scalable, and fault-tolerant systems
  • Collaborate with software engineering teams to optimize application performance
  • Develop and maintain infrastructure as code using tools like Terraform, Ansible
  • Implement CI/CD pipelines
  • Establish and maintain monitoring, alerting, and logging systems
  • Respond to incidents and participate in on-call rotations
  • Analyze system performance and implement optimizations
  • Conduct capacity planning
  • Collaborate with security teams on system and data protection
  • Provide mentorship and technical leadership to junior SREs

Requirements For Senior Site Reliability Engineer

Kubernetes
Go
Python
TypeScript
  • Bachelor's degree or equivalent work experience
  • 6+ years of relevant work experience
  • Highly motivated self-starter with excellent interpersonal and communication skills
  • Highly developed documentation skills
  • Experience working in customer facing role
  • Certification or formal training in site reliability engineering
  • Experience with SLIs, SLOs and observability capabilities at large scale
  • Experience with k8s and container technologies
  • Experience troubleshooting routing and networking in cloud environment
  • Experience with Secrets products such as HashiCorp Vault or CyberArk
  • AWS Solutions Architect - Associate certification preferred
  • Willing to obtain and maintain a Top Secret Clearance
  • Willing to travel up to 50%

Benefits For Senior Site Reliability Engineer

401k
Medical Insurance
Dental Insurance
Vision Insurance
Parental Leave
Education Budget
  • 12 weeks of fully paid parental leave for birth or adoption
  • 31 days of PTO, including federal holidays
  • 100% employer-paid insurance plans (employee-only)
  • 401(k) matching up to 5%
  • $10k BrainBudget for personal and professional growth
  • $1,500 Battle Station Budget for home office
  • 85% paid healthcare premiums for family
  • Monthly cell phone and internet stipend
  • Supplemental Tricare plan for Veterans
  • Monthly stipend for Veterans

Interested in this job?