Taro Logo

Principal Data Engineer

A startup studio founded in 2021 building industry-defining companies, with teams distributed throughout the US and Latin America using cutting-edge cloud computing technologies.
Data
Principal Software Engineer
Remote
11 - 50 Employees
6+ years of experience
Enterprise SaaS

Job Description

RYZ Labs, a dynamic startup studio founded in 2021, is seeking a Principal Data Engineer to join their distributed team across the US and Latin America. This role represents an exciting opportunity to design and implement modern data platforms from the ground up, combining traditional data engineering with robust software development practices.

The ideal candidate will be responsible for building end-to-end data solutions, from ingestion pipelines to cloud infrastructure, while maintaining high-quality, production-grade code. This position requires someone who excels in high-pressure environments and values direct communication and craftsmanship, even when facing ambiguous requirements.

As a Principal Data Engineer, you'll work with cutting-edge technologies, including Python, SQL, and Spark/Databricks, while having the autonomy to own your development process. The role involves designing and maintaining both batch and streaming data pipelines, implementing cloud resources through Infrastructure as Code, and ensuring robust data quality and governance practices.

RYZ Labs offers an environment focused on learning, growth, and challenging projects. The company's values emphasize customer-first mentality, bias for action, ownership, humility, and continuous improvement. You'll be part of a team building industry-defining companies in a post-pandemic world, working with the latest cloud computing technologies to create scalable and resilient applications.

The position requires at least 6 years of data engineering experience, with proven expertise in Databricks production environments. You'll need to demonstrate mastery in Python, SQL, and Spark, along with strong knowledge of cloud infrastructure and monitoring tools. This role is perfect for someone who wants to make a significant impact while working with a team of accomplished professionals in a remote-first environment.

Last updated 3 days ago

Responsibilities For Principal Data Engineer

  • Design & build batch and streaming data pipelines in Python, SQL, and Spark/Databricks
  • Provision and evolve cloud resources via Infrastructure as Code
  • Enforce version control, automated testing, CI/CD, and code reviews
  • Produce architectural diagrams, ADRs, and technical design documents
  • Monitor pipelines, build dashboards, and tune performance
  • Implement data validation, lineage, and observability
  • Collaborate with product, data science, and platform teams

Requirements For Principal Data Engineer

Python
  • ≥ 6 years data engineering experience and proven Databricks production use
  • Databricks certified data engineer
  • Cloud specific data engineering certifications
  • Python mastery - OOP, type hints, packaging, and performance tuning
  • Spark mastery - Performance tuning and data engineering optimizations
  • SQL mastery - Expert SQL and data-model design
  • Infrastructure-as-Code (Terraform, CDK, or ARM) experience
  • Knowledge of observability/monitoring tools
  • Experience with data quality and integrity pipelines
  • Technical documentation writing skills

Related Jobs