Standardise and harmonise data across multiple clinical studies with varying conventions and standards
Identify patterns, inconsistencies, and anomalies across datasets and define mapping/validation rules
Improve FAIR (Findable, Accessible, Interoperable, Reusable) data maturity (metadata, lineage, quality rules)
Translate ambiguous business questions into clear analytical requirements and data solutions
Perform data sourcing, transformation, reconciliation (Python, SQL, AWS)
Ensure GDPR-compliant and ethical use of patient/study data
Communicate insights, assumptions, and limitations clearly to stakeholders
Strong analytical problem-solving skills in ambiguous environments
Proven ability to standardise and harmonise heterogeneous datasets
Strong pattern recognition across multiple datasets/studies
Advanced Python and SQL skills (profiling, validation, reconciliation)
Experience working in AWS-based data environments (e.g., S3, query/processing services)
Experience with data visualisation tools (e.g., Power BI or similar)
Strong documentation and stakeholder communication skills
Ability to quickly learn new domains and data standards
Familiarity with clinical trial data standards (e.g., CDISC SDTM, ADaM)
Experience working with SAS datasets or legacy clinical environments
Understanding of clinical study conduct and patient data governance
Exposure to Machine Learning / AI use cases
Experience exploring LLM-based or agentic tooling in data workflows
