Taro Logo

Data Curation Developer

GSK is a global biopharma company focused on uniting science, technology, and talent to get ahead of disease together, developing vaccines and medicines while focusing on the immune system and new technologies.
Data
Mid-Level Software Engineer
Remote
5,000+ Employees
3+ years of experience
Healthcare · Biotech

Description For Data Curation Developer

GSK is seeking a Data Curation Developer to join their R&D team in a role that combines technical expertise with data science. This position focuses on curating, processing, and harmonizing various types of scientific and clinical data to produce high-quality data assets for R&D analysis. The role supports GSK's Disease Area Strategies and key R&D priorities by making data analysis-ready, enabling efficient decision-making across therapeutic areas.

The ideal candidate will have strong technical skills in Python, Databricks, and data engineering frameworks, combined with experience handling clinical trial data, real-world data, and omics data. They will lead the development of business requirements for data curation while maintaining strong connections with analytical groups and R&D Data Platform teams.

GSK offers a competitive compensation package including an annual bonus, healthcare benefits, pension plan, and shares program. The company embraces modern work practices through their Performance with Choice programme, offering a hybrid working model that balances remote and in-office work.

This is an excellent opportunity for a data professional who wants to make a significant impact in the healthcare and biotech industry. GSK's mission is to positively impact the health of 2.5 billion people by the end of the decade, and this role directly contributes to that goal by ensuring data quality and accessibility for critical R&D initiatives.

The company culture emphasizes being ambitious for patients, accountable for impact, and doing the right thing. They focus on accelerating significant assets that meet patients' needs and have the highest probability of success. This role offers the chance to work with cutting-edge technology while contributing to meaningful healthcare advancements.

Last updated 4 days ago

Responsibilities For Data Curation Developer

  • Lead development of business requirements for data curation
  • Maintain connections with analytical groups and R&D Data Platform teams
  • Provide coaching and peer review for data curation activities
  • Deliver pre-packaged, curated datasets aligned to business requirements
  • Integrate diverse datasets into unified format
  • Write clean, readable code
  • Ensure deliverables are quality controlled and documented

Requirements For Data Curation Developer

Python
  • BSc/MSc/PhD in Computer Science, Mathematics, Statistics, or related subject
  • Experience handling scientific clinical data
  • Ability to process large structured and unstructured datasets
  • Expertise in translating business needs into technical requirements
  • Experience in Python, Databricks, Delta Lake, PySpark, Pandas
  • Strong communication skills
  • Agile mindset

Benefits For Data Curation Developer

Medical Insurance
Dental Insurance
Vision Insurance
401k
Equity
  • Annual bonus based on company performance
  • Healthcare and wellbeing programmes
  • Pension plan
  • Shares and savings programme
  • Hybrid working model
  • Performance with Choice programme

Interested in this job?

Jobs Related To GSK Data Curation Developer