Taro Logo

Site Reliability Engineer

Microsoft is a leading global technology company that provides computing services and solutions to governments, utilities, schools, and organizations worldwide.
Site Reliability
Mid-Level Software Engineer
Hybrid
5,000+ Employees
1+ year of experience
Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Site Reliability Engineer

Microsoft is seeking a Site Reliability Engineer (SRE) to join the OneDrive SharePoint (ODSP) team. As an SRE, you'll be responsible for ensuring the reliability and performance of Microsoft's critical services. The role combines software engineering and operations, focusing on making large-scale systems more efficient and reliable.

SREs at Microsoft take an engineering-based approach to solve operations problems, supporting customers and improving complex systems. The position requires expertise in distributed systems, monitoring, and incident response. You'll work on building, monitoring, and maintaining systems that ensure customers can quickly access their data and run workloads whenever needed.

The role involves responding to customer escalations, identifying service problems, and implementing solutions. You'll collaborate closely with product engineering teams, participate in code reviews, and join on-call rotations. The work directly impacts the success of many Microsoft services, making it a critical position within the organization.

Key aspects of the role include:

  • Optimizing code and improving system observability
  • Participating in incident response and on-call rotations
  • Troubleshooting complex distributed systems
  • Implementing automation and tooling improvements
  • Collaborating with engineering teams on design and code reviews

The ideal candidate will have 1-3 years of technical experience or relevant education, with a strong interest in reliability engineering. This position offers the opportunity to work with cutting-edge technology at scale, while contributing to services used by millions of customers worldwide.

Microsoft offers a comprehensive benefits package, including industry-leading healthcare, educational resources, and generous time off. The position is based in Dublin, Ireland, with a hybrid work arrangement allowing up to 50% work from home. This is an excellent opportunity for someone passionate about reliability engineering to join a global technology leader and make a significant impact.

Last updated 12 days ago

Responsibilities For Site Reliability Engineer

  • Develops technical expertise in code, features, and operations of specific products
  • Develops, tests, and implements changes to optimize code and improve observability, reliability and operability
  • Participates in code/design reviews and regular meetings with engineering teams
  • Responds to incidents during on-call rotations
  • Troubleshoots problems affecting availability, reliability, performance, and efficiency
  • Implements configuration and data changes using code, tooling, and automation

Requirements For Site Reliability Engineer

Kubernetes
Linux
  • 1-3 years technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science
  • Must be legally authorized to work in Ireland
  • Must pass Microsoft Cloud background check
  • Experience with infrastructure, scale, performance, and distributed systems (preferred)
  • 4+ years technical experience preferred

Benefits For Site Reliability Engineer

Medical Insurance
Education Budget
Parental Leave
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Interested in this job?