Taro Logo

Senior Software Engineer - Distributed Data Systems

Databricks is the data and AI company that enables organizations to unify and democratize data, analytics and AI, serving over 10,000 organizations worldwide including 50% of Fortune 500.
$166,000 - $225,000
Senior Software Engineer
In-Person
1,000 - 5,000 Employees
5+ years of experience
AI · Enterprise SaaS · Data

Description For Senior Software Engineer - Distributed Data Systems

At Databricks, we are seeking a Senior Software Engineer to join our Runtime team, focusing on building next-generation distributed data storage and processing systems. This role is at the forefront of developing technology that enables data teams to solve the world's toughest problems.

The position involves working on critical projects including Apache Spark™, the industry standard for big data processing, Data Plane Storage for managing cloud storage backends, Delta Lake for reliable data management, and Delta Pipelines for orchestrating thousands of data pipelines. You'll be building systems that combine high-performance query capabilities with the flexibility to support diverse workloads from ETL to data science.

As a senior engineer, you'll be working on sophisticated challenges like developing storage management systems with ACID transactions, building high-performance query optimizers, and creating scalable execution engines. The role requires strong expertise in distributed systems, databases, and big data technologies, with at least 5 years of production experience in Java, Scala, or C++.

The position offers competitive compensation ($166,000 - $225,000) and comprehensive benefits. You'll be joining a team of passionate engineers at a company that serves over 10,000 organizations worldwide, including 50% of Fortune 500 companies. This is an opportunity to work on technology that's fundamentally changing how organizations handle data and AI, while solving complex technical challenges in distributed systems and data processing.

The ideal candidate will have a strong computer science foundation, experience with distributed systems, and the ability to work towards long-term technical visions while delivering incremental value. You'll be part of a team that values customer impact, technical excellence, and innovation in data infrastructure.

Last updated 19 days ago

Responsibilities For Senior Software Engineer - Distributed Data Systems

  • Build next generation distributed data storage and processing systems
  • Develop Apache Spark™ framework
  • Work on Data Plane Storage services and client libraries
  • Contribute to Delta Lake storage management system
  • Develop Delta Pipelines for orchestrating data engineering pipelines
  • Build next generation query optimizer and execution engine

Requirements For Senior Software Engineer - Distributed Data Systems

Java
Scala
  • BS (or higher) in Computer Science, related technical field or equivalent practical experience
  • Comfortable working towards a multi-year vision with incremental deliverables
  • Motivated by delivering customer value and impact
  • 5+ years of production level experience in either Java, Scala or C++
  • Strong foundation in algorithms and data structures and their real-world use cases
  • Experience with distributed systems, databases, and big data systems (Apache Spark™, Hadoop)

Benefits For Senior Software Engineer - Distributed Data Systems

Medical Insurance
Dental Insurance
Vision Insurance
  • Comprehensive benefits package
  • Medical Insurance
  • Dental Insurance
  • Vision Insurance

Jobs Related To Databricks Senior Software Engineer - Distributed Data Systems