Taro Logo

Process Mining Engineer - PySpark

Capco, a Wipro company, is a global technology and management consulting firm supporting 100+ clients across banking, financial and Energy sectors.
Data
Senior Software Engineer
Finance
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Process Mining Engineer - PySpark

Capco, a Wipro company, is a global technology and management consulting firm supporting 100+ clients across banking, financial and Energy sectors. As a Process Mining Engineer, you will:

  • Develop data transformation procedures (e.g. PySpark) for large scale (100GB+) process mining data sets based on big data file systems (e.g. Hadoop, Delta Lake)
  • Architect and build modular process mining data framework to maximize data transformation scalability, reuse, robustness, and performance
  • Lead technical project delivery and ongoing maintenance of data pipelines
  • Monitor ETL runs and optimize execution speed and robustness
  • Embrace professional software development and large-scale data engineering practices
  • Collaborate with PM architecture for data quality and business volume validation
  • Develop delta update methodology and procedures
  • Engage with the PM community of practice (engineering angle)
  • Design and PM engineering training sessions
  • Present process mining engineering approach, lighthouse project briefs and lessons learned in various tech forums

Key responsibilities include:

  • Strategy: Shape future evolutions of the process mining engineering framework
  • Process Mining: Architect and build modular process mining data framework
  • Community: Engage with the PM community of practice
  • Risk Management: Identify and raise risks and missing/ineffective controls
  • Governance: Own and lead technical PM delivery methodology

Capco offers:

  • Work on engaging projects with large international and local banks
  • Innovative thinking and delivery excellence
  • Open culture valuing diversity, inclusivity, and creativity
  • Career advancement opportunities
  • Commitment to diversity and inclusion

Join Capco to make an impact in transforming the financial services industry!

Last updated 10 months ago

Responsibilities For Process Mining Engineer - PySpark

  • Develop data transformation procedures for large scale process mining data sets
  • Architect and build modular process mining data framework
  • Lead technical project delivery and maintenance of data pipelines
  • Monitor and optimize ETL runs for speed and robustness
  • Collaborate with PM architecture team for data quality and validation
  • Develop delta update methodology and procedures
  • Engage with the PM community of practice and design training sessions
  • Present process mining engineering approaches in tech forums
  • Identify and mitigate risks in project delivery
  • Own and lead technical PM delivery methodology

Requirements For Process Mining Engineer - PySpark

Python
  • Experience with PySpark
  • Knowledge of big data file systems (e.g. Hadoop, Delta Lake)
  • Experience in large scale (100GB+) data processing
  • Proficiency in software development and large-scale data engineering practices
  • Understanding of process mining concepts
  • Experience in ETL processes and optimization
  • Strong collaboration and communication skills
  • Banking domain experience

Interested in this job?