Taro Logo

Senior Software Engineer, Site Reliability Engineering (SRE)

World's leading Open Payments Platform processing over $50b of GMV annually, providing payments orchestration and secure payment data storage.
Site Reliability
Senior Software Engineer
Remote
5+ years of experience
Finance · Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Senior Software Engineer, Site Reliability Engineering (SRE)

Spreedly, the world's leading Open Payments Platform, is seeking a Senior SRE Engineer to join their team. This role is crucial in maintaining and enhancing their platform that processes over $50b in annual transactions. The position offers a unique opportunity to work on large-scale payment systems while focusing on reliability, performance, and scalability.

The ideal candidate will bring 5+ years of SRE experience and strong programming skills, particularly in Ruby and Elixir. You'll be responsible for implementing observability systems, managing incidents, optimizing performance, and serving as a reliability partner across engineering teams. The role combines hands-on technical work with strategic thinking about system reliability.

Spreedly offers an excellent compensation package including competitive salary, equity, comprehensive healthcare, and flexible PTO. The company maintains a strong focus on work-life balance and professional development, providing stipends for home office setup and continuing education. As a remote position, you'll have the flexibility to work from anywhere in the US while collaborating with a distributed team.

The company's culture emphasizes autonomy, transparency, and collaboration, making it an ideal environment for engineers who want to make a significant impact on a platform that's central to global commerce. This role offers the opportunity to work with modern technologies like AWS, Datadog, PostgreSQL, and Kafka while helping shape the future of payment processing infrastructure.

Last updated 2 months ago

Responsibilities For Senior Software Engineer, Site Reliability Engineering (SRE)

  • Design, implement, and improve observability systems using Datadog, OpenTelemetry, and other tools
  • Lead root cause analysis, incident resolution, and response rotation
  • Diagnose and resolve application-level bottlenecks in Ruby on Rails and Elixir codebases
  • Identify and fix query and indexing inefficiencies in PostgreSQL and CockroachDB
  • Serve as a reliability partner to product and infrastructure teams
  • Build developer tools to automate deployment, monitoring, and diagnostics

Requirements For Senior Software Engineer, Site Reliability Engineering (SRE)

Ruby
PostgreSQL
Kafka
  • 5+ years in SRE or related software engineering roles
  • Proficiency in a modern programming language (Ruby, Rails, and Elixir experience preferred)
  • Hands-on experience with observability tooling
  • Experience with AWS services
  • Knowledge of relational databases
  • Experience supporting incident response and postmortems
  • Prior work developing and improving SLIs/SLOs
  • Understanding of software design patterns
  • Experience mentoring other engineers
  • Application-focused SRE experience

Benefits For Senior Software Engineer, Site Reliability Engineering (SRE)

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
Education Budget
Equity
  • Competitive salary + Equity
  • 100% employer-paid Medical and Dental benefits
  • Company-paid Life and Disability insurance
  • Optional vision and supplemental insurance
  • Open Paid Time Off policy
  • 12 weeks of paid leave for new parents
  • Matching 401(k) plan (5% up to $5,000 yearly)
  • $1,000 annual professional development stipend
  • Monthly home working/digital lifestyle stipend
  • New MacBook and accessory reimbursement
  • Access to company-paid professional coaching service
  • Visits to HQ in Durham, North Carolina for remote employees