Toyota Financial Services (TFS), the finance and insurance brand for Toyota and Lexus in North America, is seeking a Lead Site Reliability Engineer to drive cloud infrastructure reliability and automation. This role sits at the intersection of cloud engineering and operational excellence, focusing on building and maintaining robust AWS-based systems.
The position requires a seasoned professional with 7+ years of experience in SRE or DevOps, who will be responsible for operating and optimizing cloud-native infrastructure in AWS. Key technologies include EKS, Lambda, CloudWAN, and various AWS services. The role involves building self-healing automation workflows, implementing observability solutions, and maintaining infrastructure as code using Terraform.
As a Lead SRE, you'll work closely with Cloud Platform Development, Production Engineering, and Incident Management teams. You'll be responsible for defining and tracking SLIs/SLOs, participating in on-call rotations, and leading blameless postmortems. The role requires strong expertise in AWS services, network architecture, and SRE principles.
Toyota Financial Services offers an impressive benefits package including healthcare, 401(k) with company match, annual retirement contributions, and unique perks like vehicle purchase discounts. The company culture emphasizes teamwork, respect, and professional growth, with opportunities for advancement and continuous learning.
This position is ideal for someone who combines technical expertise with leadership capabilities, enjoys solving complex reliability challenges, and is passionate about building robust, scalable cloud infrastructure. You'll be joining a forward-thinking organization that's essential to Toyota's vision of moving people beyond what's possible, while working with cutting-edge cloud technologies and practices.