Process Mining Engineer - PySpark

Capco, a Wipro company, is a global technology and management consulting firm supporting 100+ clients across banking, financial and Energy sectors.
Bengaluru, Karnataka, India · Chennai, Tamil Nadu, India
Data
Senior Software Engineer
Contact Company
Finance

Description For Process Mining Engineer - PySpark

Capco, a Wipro company, is a global technology and management consulting firm supporting 100+ clients across banking, financial and Energy sectors. As a Process Mining Engineer, you will:

  • Develop data transformation procedures (e.g. PySpark) for large scale (100GB+) process mining data sets based on big data file systems (e.g. Hadoop, Delta Lake)
  • Architect and build modular process mining data framework to maximize data transformation scalability, reuse, robustness, and performance
  • Lead technical project delivery and ongoing maintenance of data pipelines
  • Monitor ETL runs and optimize execution speed and robustness
  • Embrace professional software development and large-scale data engineering practices
  • Collaborate with PM architecture for data quality and business volume validation
  • Develop delta update methodology and procedures
  • Engage with the PM community of practice (engineering angle)
  • Design and PM engineering training sessions
  • Present process mining engineering approach, lighthouse project briefs and lessons learned in various tech forums

Key responsibilities include:

  • Strategy: Shape future evolutions of the process mining engineering framework
  • Process Mining: Architect and build modular process mining data framework
  • Community: Engage with the PM community of practice
  • Risk Management: Identify and raise risks and missing/ineffective controls
  • Governance: Own and lead technical PM delivery methodology

Capco offers:

  • Work on engaging projects with large international and local banks
  • Innovative thinking and delivery excellence
  • Open culture valuing diversity, inclusivity, and creativity
  • Career advancement opportunities
  • Commitment to diversity and inclusion

Join Capco to make an impact in transforming the financial services industry!

Last updated 13 days ago

Responsibilities For Process Mining Engineer - PySpark

  • Develop data transformation procedures for large scale process mining data sets
  • Architect and build modular process mining data framework
  • Lead technical project delivery and maintenance of data pipelines
  • Monitor and optimize ETL runs for speed and robustness
  • Collaborate with PM architecture team for data quality and validation
  • Develop delta update methodology and procedures
  • Engage with the PM community of practice and design training sessions
  • Present process mining engineering approaches in tech forums
  • Identify and mitigate risks in project delivery
  • Own and lead technical PM delivery methodology

Requirements For Process Mining Engineer - PySpark

Python
  • Experience with PySpark
  • Knowledge of big data file systems (e.g. Hadoop, Delta Lake)
  • Experience in large scale (100GB+) data processing
  • Proficiency in software development and large-scale data engineering practices
  • Understanding of process mining concepts
  • Experience in ETL processes and optimization
  • Strong collaboration and communication skills
  • Banking domain experience

Interested in this job?

Jobs Related To Capco Process Mining Engineer - PySpark

Silicon Yield and Test data analysis engineer, Annapurna Silicon Operations

Experienced Silicon Yield Data Analysis Engineer needed for AWS Annapurna Labs to work on data systems, analysis dashboards, and yield optimization for advanced machine learning accelerator servers.

Sr. Data Engineer, GS S&O - Operational Excellence

Sr. Data Engineer role at AWS, designing and maintaining data pipelines and warehousing solutions for global services.

Senior Business Intelligence Engineer - Speed & Yield Management, Amazon AIR

Senior Business Intelligence Engineer role at Amazon AIR, focusing on speed and yield management to optimize fast delivery across long distances.

Sr. Business Intelligence Engineer, AIS Supply Chain - Fixed

Sr. Business Intelligence Engineer role at AWS Infrastructure Services, focusing on supply chain resiliency and data-driven insights using AI/ML.

Marketing Analytics & Intelligence Engineer, Amazon Music DISCO

Lead analytics and insights for Amazon Music Customer Acquisition, driving marketing strategies and delivering data-driven solutions.