Junior Data Engineer (PySpark) - E-Learning (Latam)

Truelogic

Leading provider of nearshore staff augmentation services headquartered in New York, delivering top-tier technology solutions with 600+ tech professionals in Latin America.

São Paulo, State of São Paulo, Brazil

Data

Entry-Level Software Engineer

Remote

501 - 1,000 Employees

1+ year of experience

Education · Enterprise SaaS

This job posting may no longer be active. You may be interested in these related jobs instead:

Business Intel Engineer I, Global Operations - Artificial Intelligence

Amazon

Business Intelligence Engineer role at Amazon's GO-AI team, focusing on data analysis and visualization for global operations and AI-driven automation systems.

Software Engineer-Data Quality Engineering-Associate

BlackRock

Associate Software Engineer position focused on data quality engineering at BlackRock.

Big data developer Jr

BBVA

Junior Big Data Developer position at BBVA in Mexico City, perfect for early-career professionals looking to work with large-scale data systems in financial services.

Big data developer Jr

BBVA

Entry-level Big Data Developer position at BBVA in Mexico City, focusing on data engineering and analytics for one of the largest financial institutions.

Data Engineer, Accounting

Amazon

Data Engineer position at Amazon supporting global Accounting organization with data infrastructure, ETL processes, and analytics solutions using AWS and big data technologies.

Description For Junior Data Engineer (PySpark) - E-Learning (Latam)

Truelogic, a New York-based leader in nearshore staff augmentation, is seeking a Junior to Semi-Senior PySpark Data Engineer for their e-learning initiative. With over two decades of experience and 600+ tech professionals across Latin America, they're offering a unique opportunity to join their dynamic team.

The role focuses on building and optimizing data pipelines using PySpark and Apache Spark in a fully remote environment. You'll be working on cutting-edge solutions, handling big data transformations, and collaborating with cross-functional teams including Data Science, BI, and DevOps.

This position is perfect for early-career data engineers with 1-3 years of experience who are passionate about learning and growing in the field. You'll need experience with PySpark, SQL, various databases (PostgreSQL, MySQL, MongoDB), and cloud storage solutions. The role offers hands-on experience with distributed computing and big data technologies.

What makes this opportunity stand out is the combination of technical growth and excellent benefits. You'll receive competitive USD compensation, flexible remote work arrangements, paid time off, and the chance to work with leading U.S. companies. The company culture emphasizes work-life balance and professional development, with access to a diverse, global network of over 600 professionals across 25+ countries.

If you're looking to accelerate your career in data engineering while working on impactful projects in a supportive, multicultural environment, this role offers the perfect blend of challenge and opportunity.

Last updated 2 days ago

Responsibilities For Junior Data Engineer (PySpark) - E-Learning (Latam)

Design, develop, and optimize data pipelines using PySpark and Apache Spark
Integrate and process data from multiple sources (databases, APIs, files, streaming)
Implement efficient data transformations for Big Data in distributed environments
Optimize code to improve performance, scalability, and efficiency in data processing
Collaborate with Data Science, BI, and DevOps teams to ensure seamless integration
Monitor and debug data processes to ensure quality and reliability
Apply best practices in data engineering and maintain clear documentation
Stay up to date with the latest trends in Big Data and distributed computing

Requirements For Junior Data Engineer (PySpark) - E-Learning (Latam)

Python

MongoDB

MySQL

PostgreSQL

1-3 years of experience working with PySpark and Apache Spark in Big Data environments
Experience with SQL and relational and NoSQL databases (PostgreSQL, MySQL, MongoDB, etc.)
Knowledge of ETL processes and data processing in distributed environments
Familiarity with Apache Hadoop, Hive, or Delta Lake
Experience with cloud storage (AWS S3, Google Cloud Storage, Azure Blob)
Proficiency in Git and version control
Strong problem-solving skills and a proactive attitude
A passion for learning and continuous improvement

Benefits For Junior Data Engineer (PySpark) - E-Learning (Latam)

100% Remote Work
Highly Competitive USD Pay
Paid Time Off
Work with Autonomy
Work with Top American Companies