Senior Software Engineer - Distributed Data Systems

Databricks

Databricks is the data and AI company that enables organizations to unify and democratize data, analytics and AI, serving over 10,000 organizations worldwide including 50% of Fortune 500.

San Francisco, CA, USA

$166,000 - $225,000

Senior Software Engineer

In-Person

1,000 - 5,000 Employees

5+ years of experience

AI · Enterprise SaaS

Job Description

Databricks is seeking a Senior Software Engineer to join their Runtime team, focusing on distributed data systems. This role is at the forefront of building next-generation distributed data storage and processing systems that combine high performance with versatile programming abstractions.

The position involves working on several critical projects including Apache Spark™, the industry-standard framework for big data processing, Data Plane Storage for managing cloud storage backends, Delta Lake for reliable data management, and Delta Pipelines for orchestrating data engineering workflows. You'll be instrumental in developing systems that outperform traditional SQL query engines while supporting diverse workloads from ETL to data science.

As a senior engineer, you'll be working on sophisticated technical challenges, from optimizing query performance to designing scalable storage solutions. The role requires strong expertise in distributed systems, with hands-on experience in Java, Scala, or C++. You'll be part of a team that's passionate about enabling data teams to solve the world's toughest problems, from revolutionizing transportation to accelerating medical breakthroughs.

The position offers competitive compensation ($166,000 - $225,000) and comprehensive benefits. You'll be working at Databricks' San Francisco office, contributing to a company that serves over 10,000 organizations worldwide, including 50% of Fortune 500 companies. This is an opportunity to shape the future of data infrastructure while working with cutting-edge technologies and solving complex technical challenges at scale.

The ideal candidate should have 5+ years of production-level experience, strong algorithmic foundations, and a track record of delivering customer value. You'll be working towards multi-year visions while delivering incremental value, making this role perfect for someone who combines technical excellence with strategic thinking and customer focus.

Last updated 2 days ago

Responsibilities For Senior Software Engineer - Distributed Data Systems

Build next generation distributed data storage and processing systems
Develop Apache Spark™ framework
Work on Data Plane Storage services and client libraries
Contribute to Delta Lake storage management system
Develop Delta Pipelines for orchestrating data engineering pipelines
Build next generation query optimizer and execution engine

Requirements For Senior Software Engineer - Distributed Data Systems

Java

Scala

BS (or higher) in Computer Science, related technical field or equivalent practical experience
Comfortable working towards a multi-year vision with incremental deliverables
Motivated by delivering customer value and impact
5+ years of production level experience in either Java, Scala or C++
Strong foundation in algorithms and data structures and their real-world use cases
Experience with distributed systems, databases, and big data systems (Apache Spark™, Hadoop)

Benefits For Senior Software Engineer - Distributed Data Systems

Medical Insurance

Dental Insurance

Vision Insurance

Medical Insurance
Dental Insurance
Vision Insurance