Microsoft is seeking a Site Reliability Engineer (SRE) to join the OneDrive SharePoint (ODSP) team. As an SRE, you'll be responsible for ensuring the reliability and performance of Microsoft's critical services. The role combines software engineering and operations, focusing on making large-scale systems more efficient and reliable.
SREs at Microsoft take an engineering-based approach to solve operations problems, supporting customers and improving complex systems. The position requires expertise in distributed systems, monitoring, and incident response. You'll work on building, monitoring, and maintaining systems that ensure customers can quickly access their data and run workloads whenever needed.
The role involves responding to customer escalations, identifying service problems, and implementing solutions. You'll collaborate closely with product engineering teams, participate in code reviews, and join on-call rotations. The work directly impacts the success of many Microsoft services, making it a critical position within the organization.
Key aspects of the role include:
The ideal candidate will have 1-3 years of technical experience or relevant education, with a strong interest in reliability engineering. This position offers the opportunity to work with cutting-edge technology at scale, while contributing to services used by millions of customers worldwide.
Microsoft offers a comprehensive benefits package, including industry-leading healthcare, educational resources, and generous time off. The position is based in Dublin, Ireland, with a hybrid work arrangement allowing up to 50% work from home. This is an excellent opportunity for someone passionate about reliability engineering to join a global technology leader and make a significant impact.