Datadog, a leading global SaaS company, is seeking a Software Engineer to join their Incident Management SRE team. This role is perfect for engineers passionate about building resilient systems and fostering a culture of continuous learning through incident response and analysis.
The position offers a unique opportunity to work at significant scale - processing trillions of data points daily while serving tens of thousands of companies. As part of the Incident Management SRE team, you'll play a crucial role in enhancing the company's incident response capabilities and on-call experience. The role combines technical expertise in Go, Python, and distributed systems with the soft skills needed to facilitate cross-team collaboration and learning.
Your responsibilities will span from developing software platforms supporting on-call rotations to leading post-mortem processes and training other engineers. The ideal candidate brings at least 3 years of software engineering experience, along with a strong background in incident response and distributed systems. You'll work in a hybrid environment that values both in-office collaboration and flexible work arrangements.
The compensation package is highly competitive, ranging from $130,000 to $300,000 USD, complemented by comprehensive benefits including equity grants, healthcare coverage, and professional development opportunities. Datadog's culture emphasizes pragmatism, honesty, and simplicity in solving complex problems, making it an ideal environment for engineers who want to make a significant impact while growing their careers.
This role offers the chance to work with cutting-edge technologies while helping shape how one of the fastest-growing observability platforms handles incidents and maintains reliability. If you're passionate about both technical excellence and teaching others, and want to be part of a company that values continuous learning and improvement, this position at Datadog could be your next career move.