Taro Logo

Principal Cloud Architect – HPC/GPU & AI Platform Solutions

A world leader in cloud solutions using tomorrow's technology to tackle today's challenges, partnering with industry-leaders for over 40+ years.
Cloud
Principal Software Engineer
In-Person
5,000+ Employees
6+ years of experience
AI · Enterprise SaaS · Cloud

Job Description

Oracle is seeking a Principal Cloud Architect specializing in HPC/GPU & AI Platform Solutions to join their team in Singapore. This role combines deep technical expertise in cloud architecture with a focus on high-performance computing and artificial intelligence solutions. As a Principal Cloud Architect, you'll be responsible for designing and implementing large-scale GPU/HPC infrastructure on Oracle Cloud Infrastructure (OCI), working with enterprise customers to deliver cutting-edge solutions.

The position requires extensive experience in cloud architecture, particularly in GPU and HPC environments, with strong skills in automation, containerization, and modern cloud-native technologies. You'll be working with technologies like Terraform, Ansible, Kubernetes, and various cluster management tools while supporting advanced AI/ML platforms and large language models.

This is an excellent opportunity for a seasoned technical professional who combines deep architectural expertise with strong customer-facing skills. The role offers the chance to work with Oracle's largest enterprise customers, influence product roadmaps, and contribute to the evolution of cloud computing and AI infrastructure.

Oracle offers a comprehensive benefits package and promotes a culture of innovation and inclusion. As a world leader in cloud solutions with over 40 years of industry experience, Oracle provides a stable yet dynamic environment for career growth. The company is committed to work-life balance and offers competitive benefits including medical, life insurance, and retirement options.

The ideal candidate will have 6-10+ years of relevant experience, with a proven track record in pre-sales or technical consulting roles. This position requires excellent communication skills, as you'll be interfacing with both technical teams and executive stakeholders. While the role is based in Singapore, you'll be part of a global team working on cutting-edge cloud and AI solutions.

Last updated 2 months ago

Responsibilities For Principal Cloud Architect – HPC/GPU & AI Platform Solutions

  • Architect and deploy large-scale GPU/HPC infrastructure on OCI using tools like Terraform, Ansible, Slurm and Kubernetes
  • Build automated solutions for cluster provisioning, software deployment, and infrastructure as code
  • Collaborate with Oracle's largest enterprise customers to define and tailor solutions
  • Support LLM-based solutions, agentic AI systems, and robotic AI platforms
  • Act as a trusted technical advisor for cloud migration strategies
  • Conduct customer training, workshops, and technical deep dives
  • Collaborate cross-functionally with product, support, and engineering teams
  • Develop technical assets including code samples, demos, blogs, and white papers
  • Identify and work with key AI Partners

Requirements For Principal Cloud Architect – HPC/GPU & AI Platform Solutions

Python
Kubernetes
  • Hands-on expertise with GPU and HPC architecture in cloud and on-prem environments
  • Proficiency in scripting and automation: Python, Bash, PowerShell, Terraform, Ansible
  • Experience with cluster managers (SLURM, PBS, Bright), Kubernetes, and container orchestration
  • Knowledge of RDMA, Infiniband, MPI, and distributed file systems
  • Core Cloud Native experience
  • Familiarity with AI/ML platforms, large language models (LLMs), and inference serving stacks
  • 5+ years in pre-sales, technical consulting, or solution architecture
  • Strong communication and presentation skills
  • Experience working with Oracle Cloud Infrastructure (OCI) or similar cloud platforms

Related Jobs

Principal Technical Operations Engineer

Principal Technical Operations Engineer role at Oracle Cloud Infrastructure, managing production environments for government and sovereign cloud regions in Singapore.

Principal Technical Operations Engineer

Principal Technical Operations Engineer role at Oracle Cloud Infrastructure, focusing on sovereign cloud operations and infrastructure management in Singapore.

Senior/Principal Engineer, Tester Applications, SSD

Senior/Principal Engineer position at Micron Technology, focusing on SSD tester applications development and support, requiring expertise in hardware/software testing and 5+ years of experience.

Principal Memory Circuit Design Engineer

Principal Memory Circuit Design Engineer position at Broadcom, focusing on foundation IP development for AI products, requiring 12+ years of experience in memory circuit design and architecture.

Principal Software Engineer (m/f/d) Manufacturing Systems Engineering

Principal Software Engineer position at GlobalFoundries focusing on manufacturing systems development and enhancement using Java, J2EE, and enterprise technologies.