Taro Logo

Senior Software Engineer - Distributed Data Systems

Databricks is the data and AI company that enables organizations to unify and democratize data, analytics and AI, serving over 10,000 organizations worldwide including 50% of Fortune 500 companies.
$166,000 - $225,000
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Description For Senior Software Engineer - Distributed Data Systems

Databricks is seeking a Senior Software Engineer to join their Runtime team, focusing on distributed data systems. This role is at the forefront of building next-generation distributed data storage and processing systems that power modern data analysis and machine learning workloads.

The position involves working on critical projects including Apache Spark™, the industry-standard framework for big data processing, Delta Lake storage management system, and Data Plane Storage services. You'll be responsible for developing systems that can outperform specialized SQL query engines while providing sophisticated programming abstractions for diverse workloads from ETL to data science.

As a senior engineer, you'll contribute to significant projects like Delta Pipelines, which aims to simplify the orchestration and operation of thousands of data pipelines. You'll also work on performance engineering, building next-generation query optimizers and execution engines that are fast, scalable, and robust.

The role requires strong technical expertise in distributed systems, with at least 5 years of production-level experience in Java, Scala, or C++. You should have a solid foundation in algorithms and data structures, along with experience in big data systems like Apache Spark™ and Hadoop.

Databricks offers a competitive compensation package ranging from $166,000 to $225,000 USD, along with equity, performance bonus eligibility, and comprehensive benefits. The company serves over 10,000 organizations worldwide, including 50% of Fortune 500 companies, making this an opportunity to work on technology that powers some of the world's largest data and AI initiatives.

The position is based in San Francisco, California, where you'll join a team of passionate engineers working to solve complex technical challenges in data infrastructure. This is an excellent opportunity for someone who is motivated by delivering customer value and impact, and comfortable working towards multi-year visions with incremental deliverables.

Last updated 3 days ago

Responsibilities For Senior Software Engineer - Distributed Data Systems

  • Build next generation distributed data storage and processing systems
  • Develop Apache Spark™ framework
  • Work on Data Plane Storage services and client libraries
  • Contribute to Delta Lake storage management system
  • Build Delta Pipelines for orchestrating data engineering pipelines
  • Performance engineering for query optimization and execution engine

Requirements For Senior Software Engineer - Distributed Data Systems

Java
Scala
  • BS (or higher) in Computer Science, related technical field or equivalent practical experience
  • Comfortable working towards a multi-year vision with incremental deliverables
  • Motivated by delivering customer value and impact
  • 5+ years of production level experience in either Java, Scala or C++
  • Strong foundation in algorithms and data structures and their real-world use cases
  • Experience with distributed systems, databases, and big data systems (Apache Spark™, Hadoop)

Benefits For Senior Software Engineer - Distributed Data Systems

Medical Insurance
Dental Insurance
Vision Insurance
  • Comprehensive benefits package
  • Performance bonus eligibility
  • Equity

Interested in this job?

Jobs Related To Databricks Senior Software Engineer - Distributed Data Systems