As a senior member of the Site Reliability Engineering (SRE) team at Oracle, you'll play a crucial role in maintaining and improving our cloud infrastructure. This position combines deep technical expertise with leadership responsibilities, requiring both hands-on engineering skills and the ability to guide teams toward operational excellence.
You'll be responsible for designing and implementing high-availability architectures for large-scale distributed systems, while serving as the ultimate escalation point for complex operational issues. The role demands expertise in automation, monitoring, and system optimization, with a focus on maintaining robust SLAs and SLOs.
Oracle offers a compelling environment for SRE professionals, with access to cutting-edge cloud technology and the opportunity to work on systems that power thousands of enterprises worldwide. You'll collaborate with talented engineers across teams, mentor junior staff, and drive technical decision-making that impacts our global infrastructure.
The ideal candidate brings 3-5+ years of experience with strong Linux administration skills, Python programming expertise, and deep knowledge of distributed systems. You'll need to be comfortable with both writing production-grade software and managing complex infrastructure, while maintaining a focus on automation and operational efficiency.
This role offers competitive benefits, including medical, life insurance, and retirement options, along with opportunities for professional growth and development. Join Oracle's SRE team to tackle challenging technical problems while building and maintaining systems that operate at massive scale.