Linkage of multiple electronic health record datasets using a 'spine linkage' approach compared with all 'pairwise linkages'.

Helen A Blake ORCID logo; Linda D Sharples ORCID logo; Katie Harron ORCID logo; Jan H van der Meulen ORCID logo; Kate Walker ORCID logo; (2022) Linkage of multiple electronic health record datasets using a 'spine linkage' approach compared with all 'pairwise linkages'. International Journal of Epidemiology, 52 (1). pp. 214-226. ISSN 0300-5771 DOI: 10.1093/ije/dyac130
Copy

BACKGROUND: Methods for linking records between two datasets are well established. However, guidance is needed for linking more than two datasets. Using all 'pairwise linkages'-linking each dataset to every other dataset-is the most inclusive, but resource-intensive, approach. The 'spine' approach links each dataset to a designated 'spine dataset', reducing the number of linkages, but potentially reducing linkage quality. METHODS: We compared the pairwise and spine linkage approaches using real-world data on patients undergoing emergency bowel cancer surgery between 31 October 2013 and 30 April 2018. We linked an administrative hospital dataset (Hospital Episode Statistics; HES) capturing patients admitted to hospitals in England, and two clinical datasets comprising patients diagnosed with bowel cancer and patients undergoing emergency bowel surgery. RESULTS: The spine linkage approach, with HES as the spine dataset, created an analysis cohort of 15 826 patients, equating to 98.3% of the 16 100 patients identified using the pairwise linkage approach. There were no systematic differences in patient characteristics between these analysis cohorts. Associations of patient and tumour characteristics with mortality, complications and length of stay were not sensitive to the linkage approach. When eligibility criteria were applied before linkage, spine linkage included 14 509 patients (90.0% compared with pairwise linkage). CONCLUSION: Spine linkage can be used as an efficient alternative to pairwise linkage if case ascertainment in the spine dataset and data quality of linkage variables are high. These aspects should be systematically evaluated in the nominated spine dataset before spine linkage is used to create the analysis cohort.



picture_as_pdf
Blake_etal_2022_Linkage-of-multiple-electronic-health.pdf
subject
Published Version
Available under Creative Commons: NC-ND 4.0

View Download

Explore Further

Read more research from the creator(s):

Find work associated with the faculties and division(s):

Find work from this publication: