Scribd, a leading platform in digital content sharing, is seeking a Software Engineer II specializing in Python and Data pipelines to join their ML Data Engineering team. This role is central to managing and processing hundreds of millions of documents and billions of images across their platforms including Everand, Scribd, and Slideshare. The position offers a unique opportunity to work with diverse datasets including UGC documents, ebooks, and audiobooks at an unprecedented scale. The tech stack includes Python, Scala, Ruby on Rails, Airflow, Databricks, Spark, and various AWS services. The role combines technical expertise in data engineering with the challenge of building robust systems for content discovery and metadata management. The company offers a flexible work environment through their Scribd Flex program, competitive compensation including equity, and comprehensive benefits. They foster a culture of curiosity, boldness, and customer-first thinking, making it an ideal place for engineers passionate about working with data at scale.