Okta, The World's Identity Company, is seeking a Staff Site Reliability Engineer to join their Workforce Identity Cloud (WIC) team. As a critical member of Technical Operations, you'll embrace the "Always On" motto while building reliable and performant systems through automation. The role involves working with a critical SaaS platform used by millions of customers daily, managing complex containerized deployments, and driving significant replatforming initiatives.
You'll be instrumental in navigating the transition of critical components between container orchestration systems while ensuring zero downtime. The position requires deep technical expertise in cloud infrastructure, particularly AWS, along with strong programming skills in languages like Python, Rust, or Go. You'll work with cutting-edge technologies including Kubernetes and various cloud services, while being part of a global team supporting 24x7 operations.
The ideal candidate brings 6+ years of SRE experience, strong Linux fundamentals, and expertise in infrastructure as code. You'll have the opportunity to influence architectural decisions, mentor team members, and drive best practices across WIC engineering. Okta offers a dynamic work environment with the best tools and technology, along with comprehensive benefits and opportunities for social impact through Okta for Good.
This role combines technical leadership with hands-on engineering, requiring both depth in systems architecture and breadth across modern cloud technologies. You'll be joining a company at the forefront of identity and access management, serving over 19,300 organizations including major enterprises like JetBlue, Nordstrom, and T-Mobile. The position offers the chance to work on challenging technical problems at scale while contributing to a product that securely connects millions of users to their essential technologies.