Taro Logo

Senior Data Platform Engineer

Taro Verified
This job posting is no longer active. Check out these related jobs instead:

Crustdata

Crustdata provides live company and people data via APIs and full dataset delivery. We make hard to get data easy to use at scale. We have developed technology that allows us to pipe in live data from over a dozen different data sources and deliver this data instantly to our customers.
San Francisco, CA, USA
$140,000 - $200,000
Data
Senior
In-person
11-50 Employees
1+ year of experience

Job Description

Crustdata, a Y Combinator-backed company, is building the gateway to the internet for AI agents by creating APIs for accessing real-time data from sources of truth. They serve dozens of enterprise customers, are profitable, and growing rapidly with backing from top Silicon Valley investors.

They are seeking a Senior Data Platform Engineer to be a foundational member of their engineering team. This role involves owning the design, creation, and evolution of Crustdata’s data platform, including data ingestion and management infrastructure. Responsibilities include architecting and building the core data infrastructure (data warehouse and data lake) using cloud technologies (AWS, GCP, or Azure). The engineer will develop and scale robust data pipelines (ETL/ELT), support data scientists and ML engineers, implement workflow orchestration for daily data jobs using tools like Airflow, and build real-time data streaming pipelines using technologies like Kafka. The engineer will also champion data quality and governance.

This is a great opportunity to work directly with the founders on a new core product and influence customers directly in a profitable, fast-growing company.


Responsibilities

  • Design, build, and maintain core data infrastructure, including data warehouse and data lake, using modern cloud technologies (AWS, GCP, or Azure)
  • Develop and scale robust, fault-tolerant data pipelines (ETL/ELT) to ingest and process massive volumes of structured and unstructured data from diverse sources
  • Create the foundational platform to support data scientists and ML engineers. This includes building systems for feature engineering, model training, and deploying ML models into production
  • Implement and manage workflow orchestration for hundreds of daily data jobs, ensuring reliability, monitorability, and efficiency using tools like Airflow, Dagster, or Prefect
  • Build and manage real-time data streaming pipelines using technologies like Kafka or Flink to power live dashboards and time-sensitive product features
  • Implement frameworks for data validation, testing, and monitoring to ensure our data is accurate and trustworthy

Requirements

Kafka
  • Relevant professional experience

Visa Policy

Crustdata isn't offering any visa sponsorship support at this time.


Benefits

Equity
  • Competitive salary
  • Equity package
  • Work directly with founders