Junior Data Engineer (PySpark) - E-Learning

Truelogic

Leading provider of nearshore staff augmentation services headquartered in New York, delivering top-tier technology solutions for over two decades.

Data

Entry-Level Software Engineer

Remote

501 - 1,000 Employees

1+ year of experience

Education · Enterprise SaaS

This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Junior Data Engineer (PySpark) - E-Learning

Truelogic, a New York-based leader in nearshore staff augmentation, is seeking a Junior to Semi-Senior PySpark Data Engineer for their E-Learning initiative. With over 600 tech professionals across Latin America, we're looking for talented individuals to join our dynamic team.

The role focuses on developing high-performance, scalable data pipelines using PySpark and Apache Spark. You'll be working in a distributed computing environment, handling big data processing, and collaborating with cross-functional teams. This position is perfect for someone early in their career who wants to grow their technical expertise while working on impactful projects.

The ideal candidate should have 1-3 years of experience with PySpark and big data technologies, strong SQL skills, and knowledge of various databases and cloud storage solutions. You'll be working with cutting-edge technologies and will have the opportunity to contribute to innovative solutions that drive efficiency.

We offer an excellent remote work environment, competitive USD compensation, and comprehensive benefits. You'll join a global network of over 600 professionals across 25+ countries, working with top American companies on transformative projects. Our culture emphasizes work-life balance, continuous learning, and professional growth.

If you're passionate about data engineering, eager to learn, and want to work in a collaborative, multicultural environment while building cutting-edge solutions, this role presents an excellent opportunity for career growth and development.

Last updated 2 months ago

Responsibilities For Junior Data Engineer (PySpark) - E-Learning

Design, develop, and optimize data pipelines using PySpark and Apache Spark
Integrate and process data from multiple sources (databases, APIs, files, streaming)
Implement efficient data transformations for Big Data in distributed environments
Optimize code to improve performance, scalability, and efficiency in data processing
Collaborate with Data Science, BI, and DevOps teams to ensure seamless integration
Monitor and debug data processes to ensure quality and reliability
Apply best practices in data engineering and maintain clear documentation
Stay up to date with the latest trends in Big Data and distributed computing

Requirements For Junior Data Engineer (PySpark) - E-Learning

Python

MongoDB

MySQL

PostgreSQL

1-3 years of experience working with PySpark and Apache Spark in Big Data environments
Experience with SQL and relational and NoSQL databases (PostgreSQL, MySQL, MongoDB, etc.)
Knowledge of ETL processes and data processing in distributed environments
Familiarity with Apache Hadoop, Hive, or Delta Lake
Experience with cloud storage (AWS S3, Google Cloud Storage, Azure Blob)
Proficiency in Git and version control
Strong problem-solving skills and a proactive attitude
A passion for learning and continuous improvement

Benefits For Junior Data Engineer (PySpark) - E-Learning

100% Remote Work
Highly Competitive USD Pay
Paid Time Off
Work with Autonomy
Work with Top American Companies