Junior Data Engineer (PySpark) - E-Learning (Latam)

Leading provider of nearshore staff augmentation services headquartered in New York, delivering top-tier technology solutions with 600+ tech professionals in Latin America.
Data
Entry-Level Software Engineer
Remote
501 - 1,000 Employees
1+ year of experience
Education · Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:
Business Intel Engineer I, Global Operations - Artificial Intelligence

Business Intelligence Engineer role at Amazon's GO-AI team, focusing on data analysis and visualization for global operations and AI-driven automation systems.

Software Engineer-Data Quality Engineering-Associate

Associate Software Engineer position focused on data quality engineering at BlackRock.

Big data developer Jr

Junior Big Data Developer position at BBVA in Mexico City, perfect for early-career professionals looking to work with large-scale data systems in financial services.

Big data developer Jr

Entry-level Big Data Developer position at BBVA in Mexico City, focusing on data engineering and analytics for one of the largest financial institutions.

Data Engineer, Accounting

Data Engineer position at Amazon supporting global Accounting organization with data infrastructure, ETL processes, and analytics solutions using AWS and big data technologies.

Description For Junior Data Engineer (PySpark) - E-Learning (Latam)

Truelogic, a New York-based leader in nearshore staff augmentation, is seeking a Junior to Semi-Senior PySpark Data Engineer for their e-learning initiative. With over two decades of experience and 600+ tech professionals across Latin America, they're offering a unique opportunity to join their dynamic team.

The role focuses on building and optimizing data pipelines using PySpark and Apache Spark in a fully remote environment. You'll be working on cutting-edge solutions, handling big data transformations, and collaborating with cross-functional teams including Data Science, BI, and DevOps.

This position is perfect for early-career data engineers with 1-3 years of experience who are passionate about learning and growing in the field. You'll need experience with PySpark, SQL, various databases (PostgreSQL, MySQL, MongoDB), and cloud storage solutions. The role offers hands-on experience with distributed computing and big data technologies.

What makes this opportunity stand out is the combination of technical growth and excellent benefits. You'll receive competitive USD compensation, flexible remote work arrangements, paid time off, and the chance to work with leading U.S. companies. The company culture emphasizes work-life balance and professional development, with access to a diverse, global network of over 600 professionals across 25+ countries.

If you're looking to accelerate your career in data engineering while working on impactful projects in a supportive, multicultural environment, this role offers the perfect blend of challenge and opportunity.

Last updated 2 days ago

Responsibilities For Junior Data Engineer (PySpark) - E-Learning (Latam)

  • Design, develop, and optimize data pipelines using PySpark and Apache Spark
  • Integrate and process data from multiple sources (databases, APIs, files, streaming)
  • Implement efficient data transformations for Big Data in distributed environments
  • Optimize code to improve performance, scalability, and efficiency in data processing
  • Collaborate with Data Science, BI, and DevOps teams to ensure seamless integration
  • Monitor and debug data processes to ensure quality and reliability
  • Apply best practices in data engineering and maintain clear documentation
  • Stay up to date with the latest trends in Big Data and distributed computing

Requirements For Junior Data Engineer (PySpark) - E-Learning (Latam)

Python
MongoDB
MySQL
PostgreSQL
  • 1-3 years of experience working with PySpark and Apache Spark in Big Data environments
  • Experience with SQL and relational and NoSQL databases (PostgreSQL, MySQL, MongoDB, etc.)
  • Knowledge of ETL processes and data processing in distributed environments
  • Familiarity with Apache Hadoop, Hive, or Delta Lake
  • Experience with cloud storage (AWS S3, Google Cloud Storage, Azure Blob)
  • Proficiency in Git and version control
  • Strong problem-solving skills and a proactive attitude
  • A passion for learning and continuous improvement

Benefits For Junior Data Engineer (PySpark) - E-Learning (Latam)

  • 100% Remote Work
  • Highly Competitive USD Pay
  • Paid Time Off
  • Work with Autonomy
  • Work with Top American Companies

Interested in this job?