Taro Logo

Senior Software Engineer - Distributed Data Systems

Databricks is the data and AI company that enables organizations to unify and democratize data, analytics and AI, serving over 10,000 organizations worldwide including 50% of Fortune 500.
$166,000 - $225,000
Senior Software Engineer
In-Person
1,000 - 5,000 Employees
5+ years of experience
AI · Enterprise SaaS · Data

Job Description

At Databricks, we are seeking a Senior Software Engineer to join our Runtime team, focusing on building next-generation distributed data storage and processing systems. This role is at the forefront of developing technology that enables data teams to solve the world's toughest problems.

The position involves working on critical projects including Apache Spark™, the industry standard for big data processing, Delta Lake, a sophisticated storage management system, and Delta Pipelines for orchestrating thousands of data pipelines. You'll be developing systems that combine high-performance query capabilities with the flexibility to support diverse workloads from ETL to data science.

As a senior engineer, you'll be tackling challenges in distributed systems, working with cloud storage backends like AWS S3 and Azure Blob Store, and building high-performance query optimizers. The role requires expertise in Java, Scala, or C++, along with a strong foundation in algorithms and data structures.

Databricks offers a competitive compensation package ranging from $166,000 to $225,000 USD, along with comprehensive benefits including medical, dental, and vision insurance, 401k, and equity. The company serves over 10,000 organizations worldwide, including 50% of Fortune 500 companies, making it an excellent opportunity to work on technology that impacts major enterprises globally.

The position is based in Mountain View, California, where you'll be part of a team that's passionate about enabling data teams to solve complex problems through innovative technology. This role offers the chance to work on cutting-edge distributed systems while contributing to open-source projects that are shaping the future of big data processing.

Last updated 2 days ago

Responsibilities For Senior Software Engineer - Distributed Data Systems

  • Build next generation distributed data storage and processing systems
  • Develop Apache Spark™ framework
  • Work on Data Plane Storage services and client libraries
  • Develop Delta Lake storage management system
  • Build Delta Pipelines for orchestrating data engineering pipelines
  • Performance engineering for query optimizer and execution engine

Requirements For Senior Software Engineer - Distributed Data Systems

Java
Scala
  • BS (or higher) in Computer Science, related technical field or equivalent practical experience
  • Comfortable working towards a multi-year vision with incremental deliverables
  • Motivated by delivering customer value and impact
  • 5+ years of production level experience in either Java, Scala or C++
  • Strong foundation in algorithms and data structures and their real-world use cases
  • Experience with distributed systems, databases, and big data systems (Apache Spark, Hadoop)

Benefits For Senior Software Engineer - Distributed Data Systems

Medical Insurance
Dental Insurance
Vision Insurance
401k
Equity
  • Medical Insurance
  • Dental Insurance
  • Vision Insurance
  • 401k
  • Equity