OCI is Oracle's next-generation cloud platform, built for the most demanding enterprise workloads. The AI Platform, Services & Solutions organization within OCI is building a robust ecosystem to support the end-to-end lifecycle of AI and machine learning workloads. From GPU infrastructure and training pipelines to model serving and deployment tools, they empower teams across Oracle and their customers to build and deploy AI at scale.
As a Principal Software Engineer, you will work on critical components of OCI's AI platform, including high-scale GPU cluster management, self-service ML infrastructure, and model serving systems. You'll be part of a team building cloud services that power Oracle's GenAI and ML initiatives, with opportunities to contribute to high-impact projects visible across Oracle Cloud.
The role offers the chance to work with top engineers and researchers in a fast-paced, innovation-driven environment, while growing your career in a supportive, mission-driven team building the future of enterprise AI. You'll be responsible for architecting broad systems interactions, diving deep into any part of the stack, and leveraging cloud infrastructure knowledge to build scalable solutions.
The position requires strong technical leadership, experience with high-scale services, and the ability to make cloud-scale services resilient. You'll need to balance speed and quality, drive operational excellence, and use data-driven approaches to recommend and justify major changes to products.