Taro Logo

Site Reliability Engineer

World leader in cloud solutions, using tomorrow's technology to tackle today's challenges. Partnered with industry-leaders in almost every sector and operating with integrity for 40+ years.
$63,000 - $126,100
Site Reliability
Mid-Level Software Engineer
In-Person
5,000+ Employees
3+ years of experience
Enterprise SaaS · Cloud
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Site Reliability Engineer

At Oracle Cloud Infrastructure (OCI), we build the more intelligent future of cloud. OCI Sovereign Cloud is a team focused on bringing the world's most important work to OCI. We build and operate government, classified, and sovereign cloud regions to be reliable and high-performance, just like our public cloud. This role involves providing cloud operations for OCI ONSR realms as part of a dynamic team with broad knowledge of Oracle's cloud platform. You'll work closely with customer support, service owners, and engineering teams globally to ensure high-quality service. The position requires 24/7 shift rotation including nights, weekends and holidays, managing complex change management, supporting new services, mentoring team members, and driving automation to improve efficiency. You'll work with cutting-edge technologies including Linux, Docker, Kubernetes, and various programming languages while helping maintain mission-critical infrastructure for government and classified environments. The role offers competitive compensation, comprehensive benefits, and the opportunity to work on sophisticated cloud infrastructure at scale.

Last updated 2 months ago

Responsibilities For Site Reliability Engineer

  • Provide cloud operations for OCI ONSR realms
  • Work 24/7 shift rotation with on-call duties
  • Manage and execute complex manual Change Management tickets
  • Support on-boarding of new services and tools
  • Provide mentorship and training to SOEs
  • Create and maintain documentation for operational processes
  • Identify areas of manual work and drive automation
  • Ensure timely resolution of incidents and service requests
  • Collaborate with global service and engineering teams

Requirements For Site Reliability Engineer

Linux
Python
Go
Java
Kubernetes
  • US Citizenship and TS/SCI w/Poly security clearance
  • Technology related bachelor's degree or equivalent work experience
  • Proficient with Python, Bash, Ruby, Perl, JavaScript, or Java
  • Deep knowledge of Linux internals and host-based networking
  • Experience with configuration management solutions
  • Experience with monitoring solutions for large scale environments
  • Knowledge of cloud computing concepts
  • Experience working in mission-critical environment
  • Strong communication skills

Benefits For Site Reliability Engineer

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
  • Medical, dental, and vision insurance
  • Short term and long term disability
  • Life insurance and AD&D
  • Health care and dependent care Flexible Spending Accounts
  • 401(k) with company match
  • Flexible Vacation
  • 11 paid holidays
  • Paid sick leave
  • Paid parental leave
  • Adoption assistance
  • Employee Stock Purchase Plan

Interested in this job?