Taro Logo

Lead Site Reliability Engineer – Cloud Platform (AWS)

Toyota Financial Services (TFS) is the finance and insurance brand for Toyota and Lexus in North America, delivering on Toyota's vision to move people beyond what's possible.
Plano, TX, USA
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
7+ years of experience
Finance · Enterprise SaaS

Job Description

Toyota Financial Services (TFS), the finance and insurance brand for Toyota and Lexus in North America, is seeking a Lead Site Reliability Engineer to drive cloud infrastructure reliability and automation. This role sits at the intersection of cloud engineering and operational excellence, focusing on building and maintaining robust AWS-based systems.

The position requires a seasoned professional with 7+ years of experience in SRE or DevOps, who will be responsible for operating and optimizing cloud-native infrastructure in AWS. Key technologies include EKS, Lambda, CloudWAN, and various AWS services. The role involves building self-healing automation workflows, implementing observability solutions, and maintaining infrastructure as code using Terraform.

As a Lead SRE, you'll work closely with Cloud Platform Development, Production Engineering, and Incident Management teams. You'll be responsible for defining and tracking SLIs/SLOs, participating in on-call rotations, and leading blameless postmortems. The role requires strong expertise in AWS services, network architecture, and SRE principles.

Toyota Financial Services offers an impressive benefits package including healthcare, 401(k) with company match, annual retirement contributions, and unique perks like vehicle purchase discounts. The company culture emphasizes teamwork, respect, and professional growth, with opportunities for advancement and continuous learning.

This position is ideal for someone who combines technical expertise with leadership capabilities, enjoys solving complex reliability challenges, and is passionate about building robust, scalable cloud infrastructure. You'll be joining a forward-thinking organization that's essential to Toyota's vision of moving people beyond what's possible, while working with cutting-edge cloud technologies and practices.

Last updated 13 days ago

Responsibilities For Lead Site Reliability Engineer – Cloud Platform (AWS)

  • Operate and optimize cloud-native infrastructure in AWS, with focus on EKS, Lambda, CloudWAN, Systems Manager, and ECR
  • Build and maintain self-healing automation workflows
  • Create and manage AWS Systems Manager (SSM) Automation Documents
  • Define and track SLIs/SLOs and error budgets
  • Implement observability using Dynatrace and AWS-native tools
  • Develop and maintain infrastructure as code using Terraform
  • Enhance and support CI/CD pipelines using GitHub and Harness
  • Participate in incident management and on-call rotations
  • Lead blameless postmortems
  • Collaborate with cloud development teams
  • Troubleshoot cloud infrastructure and networking issues

Requirements For Lead Site Reliability Engineer – Cloud Platform (AWS)

Python
Kubernetes
  • 7+ years of experience in SRE, DevOps, or Cloud Infrastructure roles
  • Solid understanding of SRE principles: SLIs, SLOs, error budgets, incident response
  • Hands-on experience with AWS services
  • Strong knowledge of network architecture and protocols within AWS
  • Experience building automated remediation and self-healing systems
  • Proficiency with Terraform, Python, Bash, and infrastructure as code principles
  • Experience with CI/CD tools and observability platforms
  • Familiarity with ITSM processes and cloud security best practices
  • Excellent troubleshooting, problem-solving, and collaboration skills

Benefits For Lead Site Reliability Engineer – Cloud Platform (AWS)

401k
Medical Insurance
Dental Insurance
Vision Insurance
Education Budget
Relocation Benefits
  • Professional growth and development programs
  • Tuition reimbursement
  • Team Member Vehicle Purchase Discount
  • Toyota Team Member Lease Vehicle Program
  • Comprehensive health care and wellness plans
  • 401(k) Savings Plan with company match
  • Annual retirement contribution
  • Paid holidays and paid time off
  • Referral services for prenatal services, adoption, childcare, schools
  • Tax Advantaged Accounts
  • Relocation assistance

Related Jobs