Senior Site Reliability Engineer

BrainGu

BrainGu is a technology company that builds developer platforms to enable organizations to build high quality software faster and at lower cost.

Boston, MA, USA

$150,000 - $170,000

Site Reliability

Senior Software Engineer

In-Person

6+ years of experience

Enterprise SaaS

This job posting may no longer be active. You may be interested in these related jobs instead:

Senior Site Reliability Engineer

BrainGu

Senior Site Reliability Engineer role at BrainGu focusing on designing and implementing scalable systems and infrastructure automation.

Site Reliability Engineer III - Corporate Oversight and Governance Technology

JPMorgan Chase

Senior Site Reliability Engineer role at JPMorgan Chase focusing on implementing and maintaining reliable, scalable systems for Corporate Oversight and Governance Technology division.

Software Engineer III - Site Reliability Engineer

JPMorgan Chase

Senior SRE position at JPMorgan Chase focusing on system reliability, observability, and performance optimization using Python/Java and modern DevOps tools.

Sr. Site Reliability Engineer

Adobe

Senior Site Reliability Engineer role at Adobe focusing on platform reliability, scalability, and DevOps practices, offering competitive compensation and opportunity to work with cutting-edge technology.

Site Reliability Engineer

Dark Wolf Solutions

Senior Site Reliability Engineer role at Dark Wolf Solutions supporting NBIS with cloud operations and monitoring.

Description For Senior Site Reliability Engineer

BrainGu, a technology company specializing in developer platforms, is seeking a Senior Site Reliability Engineer to join their Engineering Operations Value Stream (EngOps) team. This role focuses on supporting their flagship Developer Experience Platform, SmoothGlue, and advancing their SRE strategy and operating model.

The position offers a competitive salary range of $150,000 - $170,000 and is based in Boston, Massachusetts. The role requires a willingness to travel up to 50% and the ability to obtain and maintain a Top Secret Clearance.

As an SRE at BrainGu, you'll be responsible for designing and implementing highly available, scalable systems, working with infrastructure as code, and managing comprehensive monitoring solutions. The role requires expertise in container technologies, cloud platforms (AWS/GCP/Azure), and modern DevOps tools. You'll work closely with the EngOps CTO and Platform Product team to drive roadmaps and organizational maturity.

The ideal candidate brings 6+ years of relevant experience, strong communication skills, and extensive knowledge of SRE practices. Experience with Kubernetes, container technologies, and cloud environments is essential. The role offers significant opportunity for technical leadership and mentorship of junior team members.

BrainGu offers an impressive benefits package including 12 weeks of paid parental leave, comprehensive health coverage, 401(k) matching, and unique perks like a $10k "BrainBudget" for professional development and a $1,500 "Battle Station Budget" for home office setup. The company maintains a strong focus on supporting veterans with additional benefits including a Supplemental Tricare plan and monthly stipends.

This role is perfect for an experienced SRE who wants to make a significant impact on developer platforms while working with cutting-edge technologies in a collaborative environment. The position offers both technical challenges and leadership opportunities, making it ideal for someone looking to advance their career in site reliability engineering while contributing to meaningful projects.

Last updated 25 days ago

Responsibilities For Senior Site Reliability Engineer

Design, implement, and manage highly available, scalable, and fault-tolerant systems
Collaborate with software engineering teams to optimize application performance
Develop and maintain infrastructure as code using tools like Terraform, Ansible
Implement CI/CD pipelines
Establish and maintain monitoring, alerting, and logging systems
Respond to incidents and participate in on-call rotations
Analyze system performance and implement optimizations
Conduct capacity planning
Collaborate with security teams on system and data protection
Provide mentorship and technical leadership to junior SREs

Requirements For Senior Site Reliability Engineer

Kubernetes

Python

TypeScript

Bachelor's degree or equivalent work experience
6+ years of relevant work experience
Highly motivated self-starter with excellent interpersonal and communication skills
Highly developed documentation skills
Experience working in customer facing role
Certification or formal training in site reliability engineering
Experience with SLIs, SLOs and observability capabilities at large scale
Experience with k8s and container technologies
Experience troubleshooting routing and networking in cloud environment
Experience with Secrets products such as HashiCorp Vault or CyberArk
AWS Solutions Architect - Associate certification preferred
Willing to obtain and maintain a Top Secret Clearance
Willing to travel up to 50%

Benefits For Senior Site Reliability Engineer

401k

Medical Insurance

Dental Insurance

Vision Insurance

Parental Leave

Education Budget

12 weeks of fully paid parental leave for birth or adoption
31 days of PTO, including federal holidays
100% employer-paid insurance plans (employee-only)
401(k) matching up to 5%
$10k BrainBudget for personal and professional growth
$1,500 Battle Station Budget for home office
85% paid healthcare premiums for family
Monthly cell phone and internet stipend
Supplemental Tricare plan for Veterans
Monthly stipend for Veterans

BrainGu

BrainGu is a technology company that builds developer platforms to enable organizations to build high quality software faster and at lower cost.

Boston, MA, USA

$150,000 - $170,000

Site Reliability

Senior Software Engineer

In-Person

6+ years of experience

Enterprise SaaS

Interested in this job?