Taro Logo

Data Curation Developer

A global biopharma company focused on uniting science, technology, and talent to get ahead of disease together.
Data
Mid-Level Software Engineer
Hybrid
5,000+ Employees
3+ years of experience
Healthcare · AI

Description For Data Curation Developer

GSK is seeking a Data Curation Developer to focus on technical experience required to curate data for R&D analysis. This role is crucial in supporting GSK's Disease Area Strategies by making data analysis-ready and enabling efficient decision-making across therapeutic areas.

The position involves leading development of business requirements for data curation, maintaining connections with analytical groups, providing coaching, and delivering pre-packaged, curated datasets. You'll work on integrating diverse datasets including clinical trials, real-world data, and omics into unified formats.

GSK offers a competitive package including annual bonus, healthcare benefits, pension plan, and shares program. Their Performance with Choice program provides flexible hybrid working arrangements.

The ideal candidate will have BSc/MSc/PhD in Computer Science, Mathematics, Statistics, or related field, with proven experience handling various scientific clinical data. Strong skills in Python, Databricks, Delta Lake, PySpark, and other data engineering frameworks are essential.

GSK is a global biopharma company with ambitious goals to impact the health of 2.5 billion people by decade's end. They focus on vaccines and medicines, combining immune system understanding with cutting-edge technology. The company promotes an inclusive culture where people can grow, be their best, and feel valued.

This role offers an opportunity to work with cutting-edge data technologies while contributing to meaningful healthcare advancements. You'll be part of a team focused on transforming complex data into actionable insights that drive medical research and development forward.

Last updated 11 hours ago

Responsibilities For Data Curation Developer

  • Lead development of business requirements for data curation
  • Maintain connections with analytical groups and R&D Data Platform teams
  • Provide coaching and peer review for data curation activities
  • Deliver pre-packaged, curated datasets aligned to business requirements
  • Integrate diverse datasets into unified format
  • Ensure datasets meet analysis-ready and privacy requirements
  • Write clean, readable code
  • Document and quality control deliverables

Requirements For Data Curation Developer

Python
  • BSc/MSc/PhD in Computer Science, Mathematics, Statistics, or related subject
  • Experience handling scientific clinical data including clinical trials, biomarkers, and real world data
  • Ability to handle large structured, semi-structured, and unstructured datasets
  • Expertise in translating business needs into technical data requirements
  • Experience in Python, Databricks, Delta Lake, PySpark, Pandas
  • Strong communication skills
  • Agile mindset with ability to deliver prototypes quickly

Benefits For Data Curation Developer

Medical Insurance
Dental Insurance
Vision Insurance
401k
Equity
  • Annual bonus based on company performance
  • Healthcare and wellbeing programmes
  • Pension plan
  • Shares and savings programme
  • Hybrid working model

Interested in this job?

Jobs Related To GSK Data Curation Developer