Site Reliability Engineer

Microsoft empowers every person and organization on the planet to achieve more through innovative technology solutions.
$98,300 - $193,200
Site Reliability
Staff Software Engineer
Hybrid
5,000+ Employees
4+ years of experience
Enterprise SaaS · Cloud

Description For Site Reliability Engineer

Microsoft is seeking a Site Reliability Engineer to join their Cloud+AI Silver Team, focusing on deploying and operating Secure Work Areas including infrastructure for collaboration within airgapped environments. This role offers an exciting opportunity to work with engineers enabling Azure services for highly secured and regulated industries.

As an SRE, you'll be responsible for ensuring the reliability and performance of large-scale distributed systems. The position requires a blend of software engineering and operations expertise, with a focus on automation, monitoring, and incident response. You'll work in an environment where billions of users depend on the services you maintain.

The role involves being part of an on-call rotation, where you'll be responsible for responding to and resolving production issues within SLA timeframes. You'll contribute to developing automation solutions, implementing security policies, and ensuring compliance with various regulatory requirements. The position offers exposure to cutting-edge technology and the chance to work on systems at massive scale.

Microsoft offers a comprehensive benefits package including industry-leading healthcare, educational resources, savings and investment options, and generous parental leave. The company culture emphasizes growth mindset, innovation, and collaboration, making it an ideal environment for technical professionals looking to make a significant impact.

The position offers competitive compensation with a base salary range of $98,300 - $193,200 USD (higher in SF Bay Area and NYC), along with additional benefits and compensation opportunities. You'll be working in a hybrid environment with up to 50% work from home flexibility and minimal travel requirements (0-25%).

This is an excellent opportunity for experienced engineers who enjoy solving complex problems, working with distributed systems, and contributing to the reliability of services that impact billions of users worldwide. The role combines technical challenges with the opportunity to work on secure, mission-critical systems for both public and private sector customers.

Last updated 3 days ago

Responsibilities For Site Reliability Engineer

  • Monitor service for degradation, downtime, or interruptions as a Designated Responsible Individual (DRI)
  • Develop automation within production and deployment of complex product features
  • Ensure security, privacy, safety, and accessibility compliance
  • Maintain and improve product availability, reliability, efficiency, observability, and performance
  • Build reliable code following best practices
  • Communicate with key partners across Microsoft ecosystem
  • Maintain operations of live service through rotational on-call duties
  • Implement solutions for complex issues and write postmortems

Requirements For Site Reliability Engineer

  • 4+ years technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science with 1+ year experience OR Master's Degree in Computer Science
  • 2 years of experience working on large-scale distributed services with on-call responsibilities
  • Must pass Microsoft Cloud Background Check
  • Experience with PowerShell, C#, or C++ (preferred)
  • Project management and communication skills
  • Ability to build and influence broadly towards common goals

Benefits For Site Reliability Engineer

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
Education Budget
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Opportunities to network and connect

Interested in this job?

Jobs Related To Microsoft Site Reliability Engineer

Senior Site Reliability Engineer

Senior Site Reliability Engineer position at Microsoft Security, focusing on building and managing critical infrastructure for red team operations with emphasis on security and automation.

Site Reliability Engineer II/Senior Site Reliability Engineer - CTJ - Top Secret

Senior Site Reliability Engineer position at Microsoft requiring TS/SCI clearance, focusing on cloud infrastructure and government solutions.

Site Reliability Developer 3

Site Reliability Developer role at Oracle focusing on cloud infrastructure, automation, and system reliability with emphasis on security and scalability.

Site Reliability Developer 3

Site Reliability Developer role at Oracle focusing on cloud infrastructure, automation, and system reliability with emphasis on security and scalability.

Site Reliability Developer 3

Oracle is hiring a Site Reliability Developer 3 to design, implement, and maintain secure, scalable infrastructure for cloud services, focusing on automation and system reliability.