Software Developer - AI Infra Compute

Oracle is a world leader in cloud solutions, using tomorrow's technology to tackle today's challenges. They've partnered with industry-leaders in almost every sector and have been operating with integrity for over 40 years.
$79,800 - $178,100
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
4+ years of experience
AI

Description For Software Developer - AI Infra Compute

OCI (Oracle Cloud Infrastructure) AI Infrastructure is at the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI/ML/HPC workloads. This role offers an opportunity to be part of the AI revolution, creating systems that allow customers to scale from tens to thousands of GPUs without compromising performance.

The team is responsible for designing and developing fundamental architectural changes for GPU delivery, health monitoring, triage automation, and diagnostic services. These are essential for running distributed AI/ML/HPC workloads across thousands of GPUs, leveraging technologies like RoCE and Infiniband.

As a Senior Software Engineer in the AI Infrastructure team, you'll be working on innovative projects building groundbreaking solutions from the ground up. You'll be part of a young, fast-growing team working on ambitious new initiatives in a dynamic, agile environment where learning and adaptability are key.

The ideal candidate is a self-motivated individual with quick learning ability and technical excellence. You should be a rock-solid developer with deep understanding of distributed systems and algorithms, comfortable diving deep into any part of the stack, as well as software debugging and low-level systems troubleshooting.

The role offers competitive compensation ranging from $79,800 to $178,100 per annum, with potential for bonus and equity. Oracle provides comprehensive benefits including medical, dental, vision insurance, 401(k) with company match, flexible vacation, and paid parental leave.

This is an exciting opportunity to join a team that's pushing the boundaries of AI technology while working with cutting-edge GPU infrastructure and distributed systems. You'll be contributing to critical systems that enable large-scale AI/ML workloads, making a direct impact on Oracle's cloud infrastructure capabilities.

Last updated 19 hours ago

Responsibilities For Software Developer - AI Infra Compute

  • Designing, implementing, and delivering software, firmware for managing GPU based AI servers
  • Working closely with partner teams to deliver high quality software to manage, triage and repair GPU systems
  • Working closely with product teams to debug, resolve customer's issues

Requirements For Software Developer - AI Infra Compute

Python
Java
Go
Linux
MySQL
Redis
  • BS or MS degree in Computer Science or relevant technical field
  • Deep understanding of operating systems, computer networks, and high-performance applications
  • 4+ years experience delivering and operating large-scale production systems
  • Proficient in one programming language (java/python/c/c++/goLang/shell scripting)
  • Strong background in Linux systems
  • Familiarity with system-level architecture, data synchronization, fault tolerance, and state management
  • Good understanding of databases and SQL (MySQL) and caching technologies (Redis, Memcache etc)

Benefits For Software Developer - AI Infra Compute

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
Education Budget
  • Medical, dental, and vision insurance
  • Short term and long term disability
  • Life insurance and AD&D
  • Health care and dependent care Flexible Spending Accounts
  • 401(k) with company match
  • Flexible Vacation
  • 11 paid holidays
  • Paid sick leave
  • Paid parental leave
  • Adoption assistance
  • Employee Stock Purchase Plan
  • Financial planning and group legal

Interested in this job?

Jobs Related To Oracle Software Developer - AI Infra Compute

Senior ML Engineer

Senior ML Engineer position at Oracle Health & AI, focusing on LLMs and Generative AI for healthcare solutions, requiring 6+ years of experience in machine learning and MLOps.

Senior Software Engineer - NetSuite AI/ML

Senior Software Engineer position at Oracle NetSuite focusing on AI/ML integration, offering competitive benefits and the opportunity to work on cutting-edge AI technologies.

AI Developer - ACS Business Process

Senior AI Developer role at Oracle focusing on implementing AI solutions for Customer Success using GenAI, RAG, and Cohere LLM models.

Senior Detections Developer with ML/AI

Senior Detections Developer role at Oracle focusing on ML/AI and security, requiring 6+ years of experience in detection engineering and security.

Senior Machine Learning Engineer

Senior Machine Learning Engineer position at Oracle focusing on developing AI and ML services for Oracle Cloud Infrastructure (OCI)