A unique opportunity to join Oracle's world-class team engineering cutting-edge cloud technologies and infrastructure. As a Site Reliability Engineer, you'll be part of the Oracle Cloud Infrastructure group, focusing on the Object Storage system - a highly durable and available regional service. The role combines systems engineering, automation, network operations, and data engineering, requiring expertise in Java, Python, and Linux.
You'll work on code and automation to scale deployment and mission-critical operations across multiple global datacenters. Key responsibilities include resiliency engineering, SPOF elimination, incident response, capacity planning, and monitoring. The position requires participation in 24/7 on-call rotations and managing customer escalations.
The ideal candidate should have 6-10+ years of experience, strong software development skills, and deep knowledge of distributed systems. You'll be working with state-of-the-art cloud computing technologies, contributing to the reliability, scalability, and security of Oracle Cloud services. The role offers competitive compensation ($97,500-$199,500) and comprehensive benefits including medical, dental, vision, 401(k), and flexible vacation.
This is an excellent opportunity for experienced engineers who value simplicity and scale, thrive in collaborative environments, and are passionate about building and maintaining large-scale distributed systems. You'll be part of a team ensuring the success of Oracle's cloud infrastructure while tackling interesting technical challenges daily.