Principal Cloud Operation Engineer

A world leader in cloud solutions, using tomorrow's technology to tackle today's challenges. Operating for 40+ years, partnering with industry leaders across sectors.
Cloud
Principal Software Engineer
In-Person
5,000+ Employees
8+ years of experience
Cloud · Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:
Senior Principal Performance Engineer-Cloud Storage

Senior Principal Performance Engineer role at Oracle, focusing on cloud storage optimization, requiring 12+ years of experience in storage systems and performance engineering.

Software Developer 4

Principal Software Engineer role at Oracle Health to lead development of cloud-native AI applications for healthcare revenue cycle management.

Principal Cloud Solution Engineer

Lead technical pre-sales and solution architecture for Oracle's cloud solutions, combining deep technical expertise with strategic customer engagement across China.

AWS Cloud Architect

Senior AWS Cloud Architect position at Oracle, focusing on enterprise cloud solutions, requiring AWS certification and 10+ years of experience.

Principal Software Developer, Cloud Performance

Principal Software Developer role at Oracle Cloud Infrastructure focusing on performance optimization and cloud service efficiency.

Description For Principal Cloud Operation Engineer

At Oracle Cloud Infrastructure (OCI), we build the future of the cloud for Enterprises as a diverse team of creators and inventors. We act with the speed of a start-up, with the scale of the leading enterprise software company globally. The Compute organization is core to OCI, responsible for providing VM and BM compute power - fundamental to cloud infrastructure.

As a Principal Cloud Operation Engineer, you'll work with product teams on shared full stack ownership of services and technology areas. You'll be responsible for understanding end-to-end configuration, technical dependencies, and service characteristics. Key responsibilities include mitigating critical customer incidents, managing deployments, and improving security, performance, availability, and scalability.

You'll serve as an authority for end-to-end performance and operability, partnering with development teams to meet SLAs and unblock customers. The role requires deep technical knowledge to troubleshoot complex issues and define mitigations, while understanding distributed systems architecture impacts.

This position involves operating production environments, including systems and databases supporting critical business operations. You'll perform administration and analysis across multiple production environments, recommending solutions to improve availability, performance, and supportability. It's an opportunity to leverage deep technical expertise with Oracle's Cloud Infrastructure to provide escalation support for complex production challenges related to growth, scaling, cloud leveraging, high performance, and availability requirements.

The role is classified as IC4 level, requiring significant expertise in cloud operations and infrastructure management. You'll be part of a team that's essential to maintaining and improving Oracle's cloud services, making direct impact on enterprise-scale cloud operations.

Last updated 8 days ago

Responsibilities For Principal Cloud Operation Engineer

  • Install, monitor, maintain, support, and optimize production server hardware and software
  • Provide escalated technical support for complex technical issues
  • Coordinate support cases and lead internal technical resources
  • Assist with server operating system and application upgrades, bug fixes, and patching
  • Provide on-call support on a rotating basis
  • Incident Management and Production environment troubleshooting
  • Maintain Service High Availability
  • Test and Deploy solutions and automate manual processes
  • Define and build innovative solution methodologies
  • Ensure production security posture and robust monitoring
  • Perform Root Cause Analysis

Requirements For Principal Cloud Operation Engineer

Python
Linux
Kubernetes
  • 8+ years overall experience in IT industry
  • Minimum 4 years of experience as a Sys Admin/Support
  • Strong systems architecture skills
  • Strong Linux administration
  • Experience with virtualization technologies
  • Scripting skills (Python/Bash/Shell)
  • Understanding of Networking, Cloud Computing, Load Balancers
  • Experience with monitoring tools (Prometheus/Grafana, New Relic, Elastic)
  • Experience with high scale deployments
  • Knowledge of system configuration tools (Chef, Terraform, GIT, Jenkins)
  • Experience with Docker, Kubernetes

Interested in this job?