Taro Logo

Site Reliability Engineer (SRE)

A world leader in cloud solutions that uses tomorrow's technology to tackle today's challenges, partnering with industry-leaders in almost every sector for over 40+ years.
Site Reliability
Senior Software Engineer
In-Person
5,000+ Employees
3+ years of experience
Enterprise SaaS · Cloud

Job Description

Are you passionate about solving complex distributed systems challenges at scale? Join Oracle as a Site Reliability Engineer and help shape the reliability, scalability, and performance of Oracle Cloud Infrastructure (OCI). As part of the Site Reliability Engineering (SRE) team, you'll contribute to designing, automating, and evolving mission-critical systems that directly impact thousands of customers worldwide.

The role requires advanced Linux systems administration, strong Python coding skills, and experience with CI/CD pipelines. You'll be responsible for ensuring end-to-end reliability across various services, building automation tools, and maintaining system health metrics. Key responsibilities include designing software for enhanced availability, managing SLOs/SLAs, and participating in on-call rotations.

Oracle offers a collaborative environment where you'll work with cutting-edge cloud technology and contribute to large-scale distributed systems. The position combines deep technical expertise with modern software engineering practices, making it ideal for engineers passionate about system reliability and automation.

As an SRE at Oracle, you'll have the opportunity to influence architectural decisions, lead post-incident reviews, and build tools that enhance operational efficiency. The role offers competitive benefits, including medical and life insurance, retirement options, and work-life balance. Oracle is committed to diversity and inclusion, providing equal opportunities for all qualified candidates.

This position requires 3-5+ years of experience and strong English language skills. You'll be based in Zapopan, Mexico, working with global teams to maintain and improve Oracle's cloud infrastructure. The role offers significant growth potential and the chance to work with industry-leading cloud technology while solving complex technical challenges.

Last updated a day ago

Responsibilities For Site Reliability Engineer (SRE)

  • Collaborate with SRE and development teams to ensure end-to-end reliability
  • Design, write, and deploy software and automation tools
  • Own and evolve metrics, SLOs, SLAs, KPIs, and dashboards
  • Build tooling to reduce manual operations
  • Improve CI/CD pipelines and deployment processes
  • Review architectural designs for distributed systems
  • Lead post-incident reviews and capacity planning
  • Provide on-call support on a rotational basis

Requirements For Site Reliability Engineer (SRE)

Python
Linux
  • Advanced Linux systems administration
  • Strong coding skills in Python (automation-focused)
  • Intermediate experience with Bash/Shell scripting
  • Familiarity with networking principles and distributed systems behavior
  • Basic to intermediate knowledge of databases
  • Understanding of unit testing and modern software engineering practices
  • Experience with CI/CD pipelines and deployment automation
  • 3 to 5+ years of experience
  • English language proficiency

Benefits For Site Reliability Engineer (SRE)

Medical Insurance
  • Competitive benefits
  • Medical insurance
  • Life insurance
  • Retirement options
  • Volunteer programs