Taro Logo

Data Curation Developer

Global biopharmaceutical company focused on developing vaccines and medicines, combining understanding of the immune system with cutting-edge technology.
Data
Mid-Level Software Engineer
Remote
5,000+ Employees
3+ years of experience
Healthcare · AI
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Data Curation Developer

GSK is seeking a Data Curation Developer to focus on technical experience required to curate data for R&D analysis. This role is crucial in supporting GSK's Disease Area Strategies by making data analysis-ready for efficient decision-making across therapeutic areas.

The position involves leading development of business requirements for data curation, maintaining connections with analytical groups, and providing coaching to ensure best practices. You'll be responsible for delivering pre-packaged, curated datasets and integrating diverse data types including clinical trials, real-world data, and omics.

GSK offers a competitive package including annual bonus, healthcare benefits, pension plan, and shares program. The company embraces modern work practices through their Performance with Choice program, offering hybrid working models.

The ideal candidate will have strong technical skills in Python, Databricks, and data engineering frameworks, combined with expertise in handling various scientific clinical data types. Experience with industry data standards like CDISC and OMOP is preferred.

This role sits at the intersection of data engineering and life sciences, requiring both technical expertise and domain knowledge. You'll be part of GSK's mission to positively impact the health of 2.5 billion people by the end of the decade.

The company culture emphasizes being ambitious for patients, accountable for impact, and doing the right thing. GSK provides an inclusive environment where people can grow, be their best, and feel welcome, valued, and included.

Working at GSK means joining a global leader in healthcare innovation, with strong focus on R&D and cutting-edge technology. The role offers opportunities to work on meaningful projects that directly impact patient outcomes while developing your career in a supportive, forward-thinking environment.

Last updated 25 days ago

Responsibilities For Data Curation Developer

  • Lead development of business requirements for data curation
  • Maintain connections with analytical groups and R&D Data Platform teams
  • Provide coaching and peer review for data curation activities
  • Deliver pre-packaged, curated datasets aligned to business requirements
  • Integrate diverse datasets into unified format
  • Ensure datasets meet analysis-ready and privacy requirements
  • Write clean, readable code
  • Document and quality control deliverables

Requirements For Data Curation Developer

Python
  • BSc/MSc/PhD in Computer Science, Mathematics, Statistics, or related subject
  • Experience handling scientific clinical data including clinical trials, real world data, and omics
  • Ability to handle large structured, semi-structured, and unstructured datasets
  • Expertise in translating business needs into technical data requirements
  • Experience in Python, Databricks, Delta Lake, PySpark, Pandas
  • Strong communication skills
  • Agile mindset with ability to deliver prototypes quickly

Benefits For Data Curation Developer

Medical Insurance
Dental Insurance
Vision Insurance
401k
Equity
  • Annual bonus based on company performance
  • Healthcare and wellbeing programmes
  • Pension plan
  • Shares and savings programme
  • Hybrid working model