Taro Logo

Software Engineer, Data Infrastructure & Acquisition

A text-to-speech technology company that helps over 50 million people turn text into audio for faster reading and better comprehension.
Data
Senior Software Engineer
Remote
101 - 500 Employees
5+ years of experience
AI · Education

Job Description

Speechify, a leading text-to-speech technology company serving over 50 million users, is seeking a Senior Software Engineer specializing in Data Infrastructure & Acquisition for their Seoul, South Korea location. This role is part of their 100% distributed team of nearly 200 global professionals.

The position sits within the AI team, focusing on all aspects of data collection to support model training operations. The team manages petabyte-scale datasets through an integrated approach combining infrastructure, engineering, and research work. This is a critical role that will directly impact Speechify's ability to deliver high-quality text-to-speech solutions.

The ideal candidate should have at least 5 years of industry experience and strong technical skills in Python, Linux environments, and cloud infrastructure (particularly GCP). They'll be responsible for discovering and integrating new audio data sources, managing cloud infrastructure using Terraform, and collaborating with scientists to optimize data pipeline efficiency.

What makes this opportunity unique is the chance to work on technology that genuinely impacts people's lives, particularly those with learning differences like dyslexia, ADD, and visual impairments. Speechify has received notable recognition, including being named Chrome Extension of the Year by Google and winning Apple's 2025 Design Award for Inclusivity.

The company offers a distinctive work environment that values autonomy, entrepreneurial thinking, and rapid iteration. Team members include talented professionals from leading companies like Amazon, Microsoft, Google, and high-growth startups, as well as graduates from top PhD programs. The flat organizational structure allows for leadership opportunities based on technical excellence and consistent delivery.

Benefits include competitive compensation, complete work location flexibility in a 100% distributed setting, and the satisfaction of building products that millions use daily. The role offers exposure to cutting-edge developments in AI and audio technology, making it an excellent opportunity for someone who wants to make a significant impact in a transformative industry.

Last updated a month ago

Responsibilities For Software Engineer, Data Infrastructure & Acquisition

  • Find new sources of audio data and bring it into our ingestion pipeline
  • Operate and extend the cloud infrastructure for our ingestion pipeline on GCP
  • Collaborate with Scientists to improve cost/throughput/quality
  • Collaborate with AI Team and Leadership on dataset roadmap
  • Support next-generation consumer and enterprise products

Requirements For Software Engineer, Data Infrastructure & Acquisition

Python
Linux
  • BS/MS/PhD in Computer Science or a related field
  • 5+ years of industry experience in software development
  • Proficiency with bash/Python scripting in Linux environments
  • Proficiency in Docker and Infrastructure-as-Code concepts
  • Professional experience with at least one major Cloud Provider (GCP)
  • Strong communication skills, both written and verbal
  • Ability to handle multiple tasks and adapt to changing priorities

Benefits For Software Engineer, Data Infrastructure & Acquisition

  • Competitive salaries
  • Fast-growing environment with opportunity to shape company and product
  • Hands-off management approach
  • Friendly and laid-back atmosphere
  • Opportunity to work on life-changing product
  • Work in fast-growing AI and audio sector
  • 100% distributed work environment