Taro Logo

Senior Software Engineer, Data

A mission-driven organization focused on advancing scientific discovery through large-scale data engineering and AI research.
Washington, USA
$146,880 - $220,320
Data
Senior Software Engineer
Remote
8+ years of experience
AI · Education

Job Description

Join the Allen Institute for AI (AI2), a pioneering organization dedicated to advancing scientific discovery through cutting-edge data engineering and artificial intelligence. As a Senior Software Engineer specializing in Data, you'll be at the forefront of developing sophisticated data pipelines and machine learning services that process and analyze vast patent and academic datasets.

Your role will be crucial in building tools that millions of researchers worldwide use, directly contributing to the future of open science and innovation. You'll work with state-of-the-art technologies including Airflow for pipeline orchestration, machine learning models for entity disambiguation and classification, and modern data engineering practices.

The position offers an excellent opportunity to work on meaningful projects that bridge the gap between academic research and practical applications. You'll be responsible for developing and maintaining scalable data pipelines, implementing machine learning models, and creating robust APIs that provide structured access to valuable research data.

Working in a collaborative, mission-driven environment, you'll have the chance to tackle complex technical challenges while maintaining a focus on data quality, scalability, and long-term maintainability. The role combines technical expertise in data engineering with the opportunity to contribute to scientific advancement.

The position comes with competitive compensation, comprehensive benefits, and the flexibility of remote work within the U.S. You'll be part of a supportive culture that values work-life balance, personal growth, and inclusive practices. If you're passionate about using your technical skills to make a meaningful impact on scientific research and innovation, this role offers an exceptional opportunity to do so while working with cutting-edge technologies and brilliant colleagues.

Last updated 7 days ago

Responsibilities For Senior Software Engineer, Data

  • Build and maintain scalable data pipelines using Airflow for integrating complex corpora and resolving citations
  • Develop and deploy lightweight machine learning models to disambiguate inventors/authors and classify patents
  • Train or adapt topic models to label patents using various text sources
  • Extend and maintain REST APIs to provide structured access to linked metadata and classification results
  • Create dashboards and internal tools to evaluate data quality and model performance
  • Collaborate with other engineers to ensure strong testing practices, documentation, and operational stability
  • Contribute to architecture and design discussions

Requirements For Senior Software Engineer, Data

Python
PostgreSQL
  • Bachelor's degree and 8+ years of relevant technical experience
  • Expertise in Python for data engineering, including pipeline development and automation
  • Proficiency in SQL and production-grade schema design (PostgreSQL preferred)
  • Hands-on experience with ML pipelines: training, fine-tuning, and inference for structured data
  • Strong familiarity with structured data formats (JSON, XML, Parquet) and ETL practices
  • Experience with Airflow or similar workflow orchestration tools, AWS and container technologies like Docker
  • Strong ownership mindset and communication skills

Benefits For Senior Software Engineer, Data

401k
Medical Insurance
Dental Insurance
Vision Insurance
  • Base salary range: $146,880 – $220,320, with additional performance-based annual bonuses
  • Comprehensive medical, dental, and vision insurance for you and family
  • Flexible spending accounts (FSA), HSA, and HRA plans
  • 401(k) retirement plan with employer contributions
  • Monthly stipends: $125 for internet/commuting and $200 for fitness/wellbeing
  • Up to 20 vacation days, 7 personal days, 10 sick days, and 12 paid holidays annually
  • Remote work flexibility (within the U.S.)
  • Supportive work environment emphasizing work-life balance

Related Jobs