Taro Logo

Lead Site Reliability Engineer – Cloud Platform (AWS)

Toyota Financial Services (TFS) is the finance and insurance brand for Toyota and Lexus in North America, delivering best-in-class customer experience.
Plano, TX, USA
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
7+ years of experience
Finance · Automotive

Job Description

Toyota Financial Services (TFS) is seeking a Lead Site Reliability Engineer to spearhead their cloud platform operations on AWS. This role sits at the intersection of infrastructure management and engineering excellence, focusing on building resilient, self-healing systems that power TFS's critical financial operations. As part of Toyota, one of the world's most admired brands, you'll work in a collaborative environment that values innovation and continuous improvement.

The position requires deep expertise in cloud infrastructure and SRE best practices, with responsibilities spanning from operating cloud-native infrastructure to implementing advanced observability solutions. You'll work with cutting-edge technologies including EKS, Lambda, and CloudWAN, while building automation workflows that enhance system reliability and reduce manual operations.

The ideal candidate brings 7+ years of relevant experience and a strong foundation in SRE principles. You'll be responsible for defining and tracking service level objectives, managing infrastructure as code with Terraform, and participating in incident management processes. The role offers an opportunity to work with a diverse tech stack including Python, Kubernetes, and various AWS services.

TFS offers a comprehensive benefits package including healthcare, 401(k) with company match, vehicle purchase discounts, and professional development opportunities. The company culture emphasizes teamwork, respect, and innovation, making it an ideal environment for those who want to make a significant impact while working with enterprise-scale cloud infrastructure.

This position represents an excellent opportunity for an experienced SRE professional to join a leading financial services organization and help shape the future of their cloud infrastructure while enjoying the stability and benefits of working for a Fortune 500 company.

Last updated 19 days ago

Responsibilities For Lead Site Reliability Engineer – Cloud Platform (AWS)

  • Operate and optimize cloud-native infrastructure in AWS, focusing on EKS, Lambda, CloudWAN, Systems Manager, and ECR
  • Build and maintain self-healing automation workflows
  • Create and manage AWS Systems Manager Automation Documents
  • Define and track SLIs/SLOs and error budgets
  • Implement observability using Dynatrace and AWS-native tools
  • Develop and maintain infrastructure as code using Terraform
  • Enhance and support CI/CD pipelines using GitHub and Harness
  • Participate in incident management and on-call rotations
  • Lead blameless postmortems
  • Collaborate with cloud development teams
  • Troubleshoot cloud infrastructure and networking issues

Requirements For Lead Site Reliability Engineer – Cloud Platform (AWS)

Python
Kubernetes
  • 7+ years of experience in SRE, DevOps, or Cloud Infrastructure roles
  • Solid understanding of SRE principles: SLIs, SLOs, error budgets, incident response
  • Hands-on experience with AWS services
  • Strong knowledge of network architecture and protocols within AWS
  • Experience building automated remediation and self-healing systems
  • Proficiency with Terraform, Python, Bash, and infrastructure as code principles
  • Experience with CI/CD tools and observability platforms
  • Familiarity with ITSM processes and cloud security best practices
  • Excellent troubleshooting, problem-solving, and collaboration skills

Benefits For Lead Site Reliability Engineer – Cloud Platform (AWS)

401k
Medical Insurance
Dental Insurance
Vision Insurance
Education Budget
Relocation Benefits
  • Professional growth and development programs
  • Tuition reimbursement
  • Team Member Vehicle Purchase Discount
  • Toyota Team Member Lease Vehicle Program
  • Comprehensive health care and wellness plans
  • Toyota 401(k) Savings Plan with company match
  • Annual retirement contribution
  • Paid holidays and paid time off
  • Referral services for prenatal services, adoption, childcare, schools
  • Tax Advantaged Accounts
  • Relocation assistance

Related Jobs