Taro Logo

Software Engineer II (Azure HPC/AI)

Microsoft is a global technology company empowering people and organizations worldwide through innovative software, cloud services, and hardware.
Backend
Mid-Level Software Engineer
Remote
5,000+ Employees
2+ years of experience
AI · Enterprise SaaS · Cloud

Job Description

Microsoft's Azure HPC/AI Software team is expanding their engineering presence in Dublin to accelerate innovation in Azure HPC/AI Images and Microsoft HPC Pack. This role focuses on developing and maintaining performance-optimized OS images for high-performance computing and AI workloads.

As a Software Engineer II, you'll work with cutting-edge technologies including MPI libraries, GPU computing frameworks (CUDA, NCCL, ROCm, RCCL), high-speed networking (NVLink, InfiniBand, RDMA), and parallel file systems. You'll be responsible for integrating and optimizing these technologies for Azure customers while also supporting Microsoft HPC Pack's job scheduling and cluster management capabilities.

The position involves collaborating with experienced engineers and industry partners to power some of the world's most demanding workloads - from physics simulations and climate modeling to AI training on thousands of GPUs. You'll contribute to development, testing, and maintenance of Azure HPC/AI Images, ensuring optimal performance and minimal setup time for customers.

Key responsibilities include determining user requirements, contributing to design documents, implementing code, breaking down work items, and participating in on-call rotations as a Designated Responsible Individual (DRI). The role requires staying current with technological developments to improve system availability, reliability, efficiency, and performance.

This is an excellent opportunity to work with enterprise-class cluster management systems and contribute to infrastructure supporting scientific research and AI innovation worldwide. The position offers comprehensive benefits including healthcare, educational resources, investment options, parental leave, and work-life balance.

Last updated 3 days ago

Responsibilities For Software Engineer II (Azure HPC/AI)

  • Determine user requirements for features
  • Contribute to design documents and identify dependencies
  • Create and implement code for products, services, or features
  • Break down and estimate work items
  • Monitor system/product/service as DRI during on-call rotations
  • Stay current with technological developments
  • Improve system availability, reliability, efficiency, and performance

Requirements For Software Engineer II (Azure HPC/AI)

Python
Go
Linux
Kubernetes
  • Bachelor's Degree in Computer Science or related technical field with technical engineering experience
  • Experience with coding in Bash, C, C++, C#, Python, or Go
  • Familiarity with Linux or Windows system programming
  • Experience with distributed systems or cloud services
  • Hands-on experience with HPC/AI clusters and workloads
  • Knowledge of MPI libraries, GPU computing, or large-scale cluster environments
  • Ability to work collaboratively in a global team environment
  • Must pass Microsoft Cloud Background Check

Benefits For Software Engineer II (Azure HPC/AI)

Medical Insurance
Education Budget
Parental Leave
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Opportunities to network and connect