Taro Logo

Senior Software Engineer, Site Reliability Tooling

Leading AI lending marketplace partnering with banks and credit unions to expand access to affordable credit using AI technology.
San Mateo, CA, USAColumbus, OH, USAAustin, TX, USA
$163,600 - $226,400
Site Reliability
Senior Software Engineer
Remote
1,000 - 5,000 Employees
6+ years of experience
AI · Finance

Job Description

Upstart is the leading AI lending marketplace partnering with banks and credit unions to expand access to affordable credit. By leveraging Upstart's AI marketplace, Upstart-powered banks and credit unions can have higher approval rates and lower loss rates across races, ages, and genders, while simultaneously delivering the exceptional digital-first lending experience their customers demand. More than 80% of borrowers are approved instantly, with zero documentation to upload.

The Site Reliability Engineering (SRE) team owns the reliability, resiliency, and observability of Upstart's production systems. As a Senior Software Engineer focused on Site Reliability Tooling, you'll build tooling and automation to monitor infrastructure health and create a fast, reliable environment for engineers and customers. You'll define strategy for technology operations risk mitigation, including disaster planning and on-call procedures.

Key Responsibilities:

  • Implement standards for monitoring microservices, web apps, mobile apps, databases, Kubernetes clusters, and ML platforms
  • Improve incident response practices across the company
  • Automate away toil where beneficial
  • Exercise state-of-the-art SRE practices throughout the company
  • Uphold culture of visibility, ownership, and responsibility around service reliability

Requirements:

  • 6+ years of combined Software Engineering, Site Reliability, or DevOps Engineering experience
  • Proficiency in Python, Go, JavaScript/TypeScript
  • Strong infrastructure as code experience (Terraform, CDK, CloudFormation)
  • Software engineering background with internal tooling experience
  • Strong software design & architecture skills
  • Experience with observability tools like Datadog, Prometheus
  • Experience supporting SaaS in microservice cloud environments
  • Data-driven mindset and metrics focus

Benefits include competitive compensation, comprehensive health coverage, 401(k) matching, ESPP, generous PTO, parental leave, wellness programs, and regular team collaboration opportunities. As a digital-first company, most employees work remotely with offices in San Mateo, Columbus, and Austin available for hybrid work.

Last updated a day ago

Responsibilities For Senior Software Engineer, Site Reliability Tooling

  • Implement monitoring standards for microservices, web apps, and ML platforms
  • Improve incident response practices
  • Automate away toil where beneficial
  • Exercise SRE practices throughout the company
  • Uphold culture of visibility and reliability

Requirements For Senior Software Engineer, Site Reliability Tooling

Python
Go
JavaScript
TypeScript
Kubernetes
  • 6+ years of combined Software Engineering, Site Reliability, or DevOps Engineering experience
  • Proficiency in Python, Go, JavaScript/TypeScript
  • Infrastructure as Code experience (Terraform, CDK, CloudFormation)
  • Strong software design & architecture skills
  • Experience with observability tools (Datadog, Prometheus)
  • Experience supporting SaaS in microservice cloud environments
  • Data-driven mindset

Benefits For Senior Software Engineer, Site Reliability Tooling

401k
Medical Insurance
Dental Insurance
Vision Insurance
Parental Leave
Equity
  • Competitive Compensation (base + bonus & equity)
  • Medical, dental, and vision coverage
  • 401(k) with 100% company match up to $4,500
  • Employee Stock Purchase Plan (ESPP)
  • Life and disability insurance
  • Generous PTO and holidays
  • Parental and family care leave
  • Wellness and technology reimbursement
  • Team events and social activities