Taro Logo

Staff Software Engineer - Observability Platform

Databricks is the data and AI company that enables data teams to solve the world's toughest problems by building and running the world's best data and AI infrastructure platform.
$190,900 - $253,750
Backend
Staff Software Engineer
In-Person
1,000 - 5,000 Employees
7+ years of experience
AI · Enterprise SaaS

Description For Staff Software Engineer - Observability Platform

At Databricks, we are at the forefront of enabling data teams to tackle the world's most challenging problems. As a Staff Software Engineer in the Observability Platform team, you'll be working on one of the largest-scale software platforms, managing millions of virtual machines that generate terabytes of logs and process exabytes of data daily.

The role involves developing cutting-edge observability solutions that provide crucial insights into the health and performance of Databricks' products and infrastructure. You'll be responsible for building next-generation platforms handling billions of active time series and processing petabytes of logs daily, while managing infrastructure across nearly a hundred cloud regions.

Your impact will be significant as you develop advanced workflows that accelerate incident diagnosis, leverage Databricks' data intelligence platform, and set industry standards for troubleshooting practices. You'll also play a crucial role in upleveling monitoring and reliability practices across Databricks engineering, developing opinionated tools for managing structured logs, metrics, alerts, dashboards, and oncall rotations.

The ideal candidate brings 7+ years of production-level experience in languages like Go, Python, Java, Scala, or Rust, along with deep expertise in large-scale distributed systems and cloud technologies. You'll work with cutting-edge cloud technologies across AWS, Azure, and GCP, while having the opportunity to mentor and uplevel other engineers.

At Databricks, you'll be part of a team that's passionate about technical excellence and innovation, working on solutions that directly impact the reliability and performance of one of the most sophisticated data and AI platforms in the industry. The role offers competitive compensation, comprehensive benefits, and the opportunity to work on challenging technical problems at scale.

Last updated 10 days ago

Responsibilities For Staff Software Engineer - Observability Platform

  • Build next generation observability platforms supporting billions of active time series and processing petabytes of logs daily
  • Manage infrastructure across nearly a hundred cloud regions
  • Develop advanced workflows for incident diagnosis
  • Uplevel monitoring and reliability practices across Databricks engineering
  • Mentor and uplevel engineers, fostering technical excellence

Requirements For Staff Software Engineer - Observability Platform

Go
Python
Java
Scala
Rust
Kubernetes
  • BS (or higher) in Computer Science, or a related field
  • 7+ years of production-level experience in one of: Go, Python, Java, Scala, Rust, C++, or similar languages
  • Experience in software development, in large-scale distributed systems
  • Experience driving large projects involving multiple teams
  • Experience with cloud technologies, e.g. AWS, Azure, GCP, Docker, or Kubernetes
  • Familiarity with observability infrastructure, monitoring patterns, and reliability practices

Benefits For Staff Software Engineer - Observability Platform

Medical Insurance
Dental Insurance
Vision Insurance
  • Comprehensive benefits package
  • Performance bonus eligibility
  • Equity

Interested in this job?

Jobs Related To Databricks Staff Software Engineer - Observability Platform