Taro Logo

Sr. Software Dev Engineer, Infrastructure Reliability Engineering

Global technology and e-commerce company leading in cloud computing, digital streaming, and artificial intelligence.
DevOps
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
Enterprise SaaS
This job posting is no longer active. Check out these related jobs instead:

Job Description

Join Amazon's Infrastructure Reliability Engineering team as a Senior Software Development Engineer to make a significant impact on reducing MTTR and improving service availability across Fulfillment Technology and Robotics. This role focuses on innovating in global network device monitoring, telemetry collection, and pioneering new relational monitoring tools. You'll be leveraging AI to build automatic detection and remediation solutions, reducing human intervention during high-severity events.

The position offers extensive scope to work across organizations and influence the technical direction of multiple teams. You'll be responsible for architecting and developing systems that monitor and maintain service health at scale, spanning thousands of global sites. The role combines cutting-edge technology with practical problem-solving, as you'll work with diverse telemetry sources including software applications, AWS services, network paths, and device fleets.

As part of Amazon's Infrastructure Reliability Engineering team, a global organization with presence in both the USA and Europe, you'll collaborate with talented engineers worldwide. The team is dedicated to building tools that improve the availability of network and service infrastructure across Amazon's global fulfillment network.

This role offers comprehensive benefits including medical, dental, and vision coverage, parental leave options, PTO, and a 401(k) plan. It's an excellent opportunity for experienced engineers who want to make a lasting impact on global infrastructure reliability while working with cutting-edge technology and leading teams.

Last updated a month ago

Responsibilities For Sr. Software Dev Engineer, Infrastructure Reliability Engineering

  • Lead architectural design and development of new and existing applications
  • Design and implement systems that detect, correlate, and visualize service health at scale across thousands of global sites
  • Drive the development of AI-powered solutions for automatic incident detection and remediation
  • Create and enhance automation systems for incident response and management
  • Architect solutions that integrate telemetry from diverse sources
  • Build self-service integration capabilities for service owners
  • Develop APIs and interfaces that enable automated decision-making for deployments and changes
  • Mentor junior engineers and foster a culture of engineering excellence

Requirements For Sr. Software Dev Engineer, Infrastructure Reliability Engineering

Linux
  • 5+ years of non-internship professional software development experience
  • 5+ years of programming with at least one software programming language experience
  • 5+ years of leading design or architecture of new and existing systems experience
  • Experience as a mentor, tech lead or leading an engineering team

Benefits For Sr. Software Dev Engineer, Infrastructure Reliability Engineering

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
  • Medical, Dental, and Vision Coverage
  • Maternity and Parental Leave Options
  • Paid Time Off (PTO)
  • 401(k) Plan