Taro Logo

Senior Site Reliability Engineer

Microsoft is a global technology company that empowers every person and organization on the planet to achieve more.
Site Reliability
Senior Software Engineer
Hybrid
5,000+ Employees
8+ years of experience
Enterprise SaaS

Description For Senior Site Reliability Engineer

Microsoft's Surface Team is seeking a Senior Site Reliability Engineer to enhance enterprise customer experience in managing Surface devices. This role focuses on building and maintaining critical infrastructure for commercial customers, including online portals, backend APIs, and microservices.

The position offers an exciting opportunity to work on cutting-edge devices while creating solutions that leverage AI and Copilots to enhance productivity. You'll be responsible for designing and deploying reliable distributed platforms that empower commercial customers to self-serve, manage, and monitor Surface devices at scale.

As an SRE, you'll champion DevOps practices, ensure system reliability, and drive operational excellence. The role involves owning application uptime, driving incident response, developing automation, and partnering with various teams to design scalable systems. You'll also have the opportunity to mentor junior engineers and shape the team's technical direction.

Microsoft offers comprehensive benefits including industry-leading healthcare, educational resources, investment options, and generous parental leave. The position is hybrid, allowing up to 50% work from home, with 0-25% travel required. The company's mission to empower every person and organization on the planet provides a meaningful context for your work.

The ideal candidate brings 8+ years of technical experience, strong expertise in cloud services and distributed systems, and excellent communication skills. This is a chance to make a significant impact on Microsoft's device management capabilities while working with cutting-edge technology and talented colleagues.

Last updated 4 days ago

Responsibilities For Senior Site Reliability Engineer

  • Champion and implement DevOps and Site Reliability Engineering best practices
  • Own the uptime and performance of applications built on Azure Containers, APIs, and modern UI frameworks
  • Drive incident response, root cause analysis, and postmortem processes
  • Develop and maintain automation for deployment, monitoring, alerting, and self-healing systems
  • Partner closely with software engineering, product owners, design scalable and fault-tolerant systems
  • Monitor system performance and plan for future growth
  • Ensure systems are secure, compliant, and aligned with Microsoft's security standards
  • Guide and mentor junior engineers

Requirements For Senior Site Reliability Engineer

Kubernetes
  • Bachelors/Masters degree in Computer Science or other Engineering field
  • 8+ years of technical experience in software engineering and DevOps
  • 4+ years of experience with software development in C#, WebAPIs, Cosmos, SQL Azure, Microsoft fabric
  • Experience in developing Monitoring & Telemetry tools, Containers(Azure Kubernetes Service), CICDs
  • Excellent technical design, problem solving and debugging skills
  • Excellent leadership, communication, teamwork and collaboration skills
  • Experience with building dashboards, code analysis, secure practices

Benefits For Senior Site Reliability Engineer

Medical Insurance
Dental Insurance
Vision Insurance
Parental Leave
Education Budget
401k
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Opportunities to network and connect

Interested in this job?

Jobs Related To Microsoft Senior Site Reliability Engineer