Taro Logo

Data Curation Developer

GSK is a global biopharma company focused on uniting science, technology and talent to get ahead of disease together, developing vaccines, specialty and general medicines.
Data
Mid-Level Software Engineer
Remote
5,000+ Employees
3+ years of experience
Healthcare · AI

Description For Data Curation Developer

GSK is seeking a Data Curation Developer to join their R&D team, focusing on transforming complex scientific and clinical data into analysis-ready assets. This role is crucial in supporting GSK's Disease Area Strategies by making data accessible and actionable for decision-making across therapeutic areas. The position combines technical expertise in data engineering with domain knowledge in healthcare and clinical data.

The ideal candidate will lead the development of business requirements for data curation, working closely with R&D business and data platform teams. They will be responsible for handling various data types, including clinical trials, real-world data, and omics, ensuring they meet privacy and analysis requirements. The role requires expertise in Python, Databricks, and other modern data engineering tools.

GSK offers a compelling environment for career growth, with their ambitious goal of positively impacting the health of 2.5 billion people by the end of the decade. The company provides a comprehensive benefits package including healthcare, annual bonuses, and a hybrid working model through their Performance with Choice programme.

This position offers an opportunity to work at the intersection of healthcare and technology, applying data engineering skills to meaningful healthcare challenges. The role combines technical expertise with business impact, requiring both strong coding abilities and excellent communication skills. GSK's commitment to diversity, inclusion, and work-life balance makes this an attractive opportunity for professionals looking to make a difference in healthcare through data.

Last updated 15 hours ago

Responsibilities For Data Curation Developer

  • Lead development of business requirements for data curation
  • Maintain connections with analytical groups and R&D Data Platform teams
  • Provide coaching and peer review for data curation activities
  • Deliver pre-packaged, curated datasets aligned to business requirements
  • Integrate diverse datasets into unified format
  • Write clean, readable code
  • Ensure deliverables are quality controlled and documented

Requirements For Data Curation Developer

Python
  • BSc/MSc/PhD in Computer Science, Mathematics, Statistics, or related subject
  • Experience handling scientific clinical data including trial data and real world data
  • Ability to handle large structured, semi-structured, and unstructured datasets
  • Experience in Python, Databricks, Delta Lake, PySpark, Pandas
  • Strong communication skills
  • Agile mindset with ability to deliver prototypes quickly

Benefits For Data Curation Developer

Medical Insurance
Vision Insurance
Dental Insurance
401k
  • Annual bonus based on company performance
  • Healthcare and wellbeing programmes
  • Pension plan membership
  • Shares and savings programme
  • Hybrid working model through Performance with Choice programme

Interested in this job?

Jobs Related To GSK Data Curation Developer