Taro Logo

Site Reliability Developer 4

A world leader in cloud solutions, using tomorrow's technology to tackle today's challenges. Operating for 40+ years with integrity and partnering with industry leaders across sectors.
United States
$97,500 - $199,500
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
10+ years of experience
Enterprise SaaS · Cloud

Job Description

Oracle is seeking an experienced Site Reliability Engineer (SRE) to join their team in supporting Millennium customers with a focus on performance and stability. This role combines deep technical expertise in Windows systems, debugging, and cloud infrastructure with strategic service reliability oversight.

The ideal candidate will be responsible for analyzing and troubleshooting critical production issues, leveraging extensive knowledge of Windows internals and debugging tools. They will work on complex infrastructure cloud services problems and develop automation solutions to prevent recurrences. The position requires expertise in OCI (Oracle Cloud Infrastructure) services and the ability to design and implement software solutions that enhance Oracle's product availability, scalability, and efficiency.

As a Staff-level SRE, you'll be expected to provide technical leadership, partner with development teams on architectural improvements, and serve as the ultimate escalation point for complex issues. The role offers competitive compensation ranging from $97,500 to $199,500 annually, plus comprehensive benefits including medical coverage, 401(k) matching, and flexible vacation time.

This is an excellent opportunity for a seasoned technical professional who combines deep systems knowledge with strategic thinking and enjoys solving complex infrastructure challenges at scale. The position offers the chance to work with cutting-edge cloud technologies while making a significant impact on Oracle's service reliability and customer satisfaction.

Last updated 2 days ago

Responsibilities For Site Reliability Developer 4

  • Work with SRE team on shared full stack ownership of services
  • Design and deliver mission critical stack focusing on security, resiliency, scale, and performance
  • Partner with development teams in defining and implementing service architecture improvements
  • Act as ultimate escalation point for complex issues
  • Troubleshoot issues and define mitigations
  • Understand and explain product architecture decisions impact on distributed systems
  • Support Millennium customers for performance and stability

Requirements For Site Reliability Developer 4

Java
Kubernetes
  • Strong proficiency in C++, C#, Java programming languages and internals
  • Strong knowledge of Windows Internals, Hangs/Freezes, Memory, GDI and Troubleshooting skills
  • Proficiency in COM, DLL and across application tiers
  • Experience working with OCI services and resources
  • Linux knowledge
  • Proficient using WinDBG and other debugging tools
  • Code Review experience
  • Ability to work across teams to resolve stability and performance issues
  • Technical leadership capabilities
  • 10+ years of experience
  • English language proficiency

Benefits For Site Reliability Developer 4

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
  • Medical, dental, and vision insurance
  • Short term and long term disability
  • Life insurance and AD&D
  • Health care and dependent care Flexible Spending Accounts
  • Pre-tax commuter and parking benefits
  • 401(k) Savings with company match
  • Flexible Vacation
  • 11 paid holidays
  • 72 hours paid sick leave
  • Paid parental leave
  • Adoption assistance
  • Employee Stock Purchase Plan