Taro Logo

Software Developer - AI Infra Compute

World leader in cloud solutions, providing tomorrow's technology to tackle today's challenges for over 40+ years.
$79,800 - $178,100
Backend
Senior Software Engineer
In-Person
5,000+ Employees
4+ years of experience
AI · Enterprise SaaS · Cloud

Description For Software Developer - AI Infra Compute

OCI (Oracle Cloud Infrastructure) AI Infrastructure is leading the development of a cutting-edge, ultra-high-performance GPU platform for AI/ML/HPC workloads. This role is part of the team responsible for designing and developing fundamental architectural changes for GPU delivery, health monitoring, triage automation, and diagnostic services.

The position offers the opportunity to work on innovative projects building groundbreaking solutions from the ground up, as part of a young, fast-growing team working on ambitious initiatives. The environment is dynamic and agile, emphasizing learning and adaptability.

Key Aspects:

  • Work on distributed systems supporting AI/ML/HPC workloads across thousands of GPUs
  • Utilize technologies like RoCE and Infiniband
  • Focus on system scalability and performance optimization
  • Collaborate with various teams including Network and Data Center operations

The ideal candidate will be:

  • A self-motivated engineer with quick learning ability
  • Experienced in distributed systems and algorithms
  • Comfortable with software debugging and low-level systems troubleshooting
  • Passionate about simple and scalable solutions
  • Strong in collaborative work and communication

This role offers the chance to be at the forefront of AI technology advancement while working with cutting-edge infrastructure at scale. The position combines deep technical challenges with the opportunity to impact the future of cloud computing and AI infrastructure.

Last updated 2 hours ago

Responsibilities For Software Developer - AI Infra Compute

  • Designing, implementing, and delivering software, firmware for managing GPU based AI servers
  • Working closely with partner teams to deliver high quality software to manage, triage and repair GPU systems
  • Working closely with product teams to debug, resolve customer's issues

Requirements For Software Developer - AI Infra Compute

Python
Java
Linux
MySQL
Redis
  • BS or MS degree in Computer Science or relevant technical field
  • Deep understanding of operating systems, computer networks, and high-performance applications
  • 4+ years experience delivering and operating large-scale production systems
  • Proficient in one programming language(java/python/c/c++/goLang/shell scripting)
  • Strong background in Linux systems
  • Familiarity with system-level architecture, data synchronization, fault tolerance, and state management
  • Good understanding of databases and SQL (MySQL) and caching technologies

Benefits For Software Developer - AI Infra Compute

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
Mental Health Assistance
  • Medical, dental, and vision insurance
  • Short term and long term disability
  • Life insurance and AD&D
  • Health care and dependent care Flexible Spending Accounts
  • 401(k) Savings and Investment Plan with company match
  • Paid parental leave
  • Flexible Vacation
  • 11 paid holidays
  • 72 hours of paid sick leave
  • Adoption assistance
  • Employee Stock Purchase Plan

Interested in this job?

Jobs Related To Oracle Software Developer - AI Infra Compute