Site Reliability Engineer

Microsoft is a global technology company that develops, manufactures, and sells computer software, consumer electronics, and personal computers.
Site Reliability
Senior Software Engineer
Hybrid
5,000+ Employees
5+ years of experience
Enterprise SaaS · Cloud

Description For Site Reliability Engineer

Microsoft's Azure Customer Experience (CXP) Customer Reliability Engineering (CRE) Team is seeking a passionate Site Reliability Engineer to join their world-class customer reliability initiatives. This role is part of Azure Engineering's top-level pillar that leads customer-centric experiences at scale.

The position involves working on one of Microsoft's most exciting products - Azure, where you'll be responsible for improving customer experience, diagnosing and troubleshooting mission-critical customer applications, and driving platform reliability improvements. You'll work directly with customers, support teams, and engineering to ensure optimal service delivery and reliability.

As a Site Reliability Engineer, you'll be accountable for various aspects of Azure's platform reliability, including availability, resiliency, and uptime at scale. The role requires participation in on-call rotations, collaboration with engineering teams, and driving continuous improvement through customer feedback and data analysis.

The ideal candidate will bring strong technical expertise in cloud platforms, automation skills, and a deep understanding of high availability and disaster recovery principles. You'll need excellent communication skills to work effectively with diverse stakeholders, from high-profile customers to executive management.

This is a hybrid role based in Sydney, allowing up to 50% work from home, offering a perfect balance between collaborative office work and remote flexibility. Microsoft provides comprehensive benefits including industry-leading healthcare, educational resources, parental leave, and investment opportunities.

The role offers exciting opportunities to work on cutting-edge cloud technology while directly impacting customer success. You'll be part of a team that values customer empathy, technical excellence, and continuous innovation, making it an ideal position for someone passionate about reliability engineering and customer success at scale.

Join Microsoft's Azure team to help shape the future of cloud computing while working with some of the industry's brightest minds in a collaborative, innovation-driven environment.

Last updated 13 hours ago

Responsibilities For Site Reliability Engineer

  • Participate in an on-call coverage rotation (approximately 15% of the time) for platform communications and security
  • Collaborate closely with engineering and product management teams to drive product improvements based on customer feedback
  • Improve the customer experience by analyzing signals from various sources and driving root cause analyses (RCAs)
  • Drive continuous improvement in the Azure platform by incorporating feedback
  • Identify and drive requirements for enhanced customer resiliency and platform reliability
  • Participate in the design of next-generation architecture for cloud infrastructure services
  • Be enthusiastic, self-motivated, and a great team player
  • Demonstrate excellent collaboration, organizational, and time management skills

Requirements For Site Reliability Engineer

Linux
  • Must have service engineering experience in a 24/7/365 enterprise environment
  • Technical expertise in Azure services and capabilities or cloud platforms (desired)
  • Fluency in one or more automation languages (e.g., PowerShell, CLI)
  • Strong communication skills
  • Understanding of high availability, disaster recovery, business continuity, and performance tuning
  • Strong knowledge of the Windows platform or Linux
  • BS/BA in computer science, engineering, mathematics, or equivalent experience
  • Must pass Microsoft Cloud Background Check

Benefits For Site Reliability Engineer

Medical Insurance
Parental Leave
Education Budget
Vision Insurance
Dental Insurance
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Opportunities to network and connect

Interested in this job?

Jobs Related To Microsoft Site Reliability Engineer

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at Microsoft focusing on cloud infrastructure reliability, monitoring, and automation, offering competitive pay and benefits in Redmond, WA.

Senior Site Reliability Engineer

Senior Site Reliability Engineer position at Microsoft focusing on Enterprise Identity and Access Management systems and security infrastructure.

Senior Site Reliability Engineer - CTJ - Top Secret

Senior Site Reliability Engineer role at Microsoft working on Office 365 government cloud services, requiring Top Secret clearance and strong distributed systems experience.

Senior Site Reliability Engineer - CTJ - POLY

Senior Site Reliability Engineer role at Microsoft working on Azure SQL services for government clouds, requiring Top Secret clearance and expertise in distributed systems.

Senior Site Reliability Engineer

Senior Site Reliability Engineer position at Microsoft, focusing on O365 Enterprise Cloud services with emphasis on AI/ML integration and system reliability.