Taro Logo

Junior Data Engineer (PySpark) - E-Learning

Leading provider of nearshore staff augmentation services headquartered in New York, delivering top-tier technology solutions for over two decades.
Data
Entry-Level Software Engineer
Remote
501 - 1,000 Employees
1+ year of experience
Education · Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Junior Data Engineer (PySpark) - E-Learning

Truelogic, a New York-based leader in nearshore staff augmentation, is seeking a Junior to Semi-Senior PySpark Data Engineer for their E-Learning initiative. With over 600 tech professionals across Latin America, we're looking for talented individuals to join our dynamic team.

The role focuses on developing high-performance, scalable data pipelines using PySpark and Apache Spark. You'll be working in a distributed computing environment, handling big data processing, and collaborating with cross-functional teams. This position is perfect for someone early in their career who wants to grow their technical expertise while working on impactful projects.

The ideal candidate should have 1-3 years of experience with PySpark and big data technologies, strong SQL skills, and knowledge of various databases and cloud storage solutions. You'll be working with cutting-edge technologies and will have the opportunity to contribute to innovative solutions that drive efficiency.

We offer an excellent remote work environment, competitive USD compensation, and comprehensive benefits. You'll join a global network of over 600 professionals across 25+ countries, working with top American companies on transformative projects. Our culture emphasizes work-life balance, continuous learning, and professional growth.

If you're passionate about data engineering, eager to learn, and want to work in a collaborative, multicultural environment while building cutting-edge solutions, this role presents an excellent opportunity for career growth and development.

Last updated 2 months ago

Responsibilities For Junior Data Engineer (PySpark) - E-Learning

  • Design, develop, and optimize data pipelines using PySpark and Apache Spark
  • Integrate and process data from multiple sources (databases, APIs, files, streaming)
  • Implement efficient data transformations for Big Data in distributed environments
  • Optimize code to improve performance, scalability, and efficiency in data processing
  • Collaborate with Data Science, BI, and DevOps teams to ensure seamless integration
  • Monitor and debug data processes to ensure quality and reliability
  • Apply best practices in data engineering and maintain clear documentation
  • Stay up to date with the latest trends in Big Data and distributed computing

Requirements For Junior Data Engineer (PySpark) - E-Learning

Python
MongoDB
MySQL
PostgreSQL
  • 1-3 years of experience working with PySpark and Apache Spark in Big Data environments
  • Experience with SQL and relational and NoSQL databases (PostgreSQL, MySQL, MongoDB, etc.)
  • Knowledge of ETL processes and data processing in distributed environments
  • Familiarity with Apache Hadoop, Hive, or Delta Lake
  • Experience with cloud storage (AWS S3, Google Cloud Storage, Azure Blob)
  • Proficiency in Git and version control
  • Strong problem-solving skills and a proactive attitude
  • A passion for learning and continuous improvement

Benefits For Junior Data Engineer (PySpark) - E-Learning

  • 100% Remote Work
  • Highly Competitive USD Pay
  • Paid Time Off
  • Work with Autonomy
  • Work with Top American Companies

Interested in this job?