OCI is Oracle's next-generation cloud platform, built for the most demanding enterprise workloads. The AI Platform, Services & Solutions organization within OCI is building a robust ecosystem to support the end-to-end lifecycle of AI and machine learning workloads. From GPU infrastructure and training pipelines to model serving and deployment tools, they empower teams across Oracle and their customers to build and deploy AI at scale.
As a Principal Software Engineer, you'll work on critical components of OCI's AI platform, including high-scale GPU cluster management, self-service ML infrastructure, and model serving systems. You'll be part of a team building cloud services that power Oracle's GenAI and ML initiatives, with high visibility across Oracle Cloud.
This role offers the opportunity to:
The position requires deep technical expertise in distributed systems, cloud infrastructure, and software development, with a focus on building scalable and resilient services. You'll work independently while providing technical leadership to the broader organization, balancing feature development with operational excellence.