Exploring surveillance data biases when estimating the reproduction number: with insights into subpopulation transmission of Covid-19 in England

Katharine Sherratt ORCID logo; Sam Abbott ORCID logo; Sophie R Meakin; Joel Hellewell ORCID logo; James D Munday ORCID logo; Nikos Bosse; Mark Jit ORCID logo; Sebastian Funk ORCID logo; (2021) Exploring surveillance data biases when estimating the reproduction number: with insights into subpopulation transmission of Covid-19 in England. Philosophical transactions of the Royal Society of London Series B, Biological sciences, 376 (1829). ISSN 0962-8436 DOI: 10.1101/2020.10.18.20214585
Copy

<jats:title>Abstract</jats:title><jats:p>The time-varying reproduction number (<jats:italic>R</jats:italic><jats:sub><jats:italic>t</jats:italic></jats:sub>: the average number secondary infections caused by each infected person) may be used to assess changes in transmission potential during an epidemic. While new infections are not usually observed directly, they can be estimated from data. However, data may be delayed and potentially biased. We investigated the sensitivity of <jats:italic>R</jats:italic><jats:sub><jats:italic>t</jats:italic></jats:sub> estimates to different data sources representing Covid-19 in England, and we explored how this sensitivity could track epidemic dynamics in population sub-groups.</jats:p><jats:p>We sourced public data on test-positive cases, hospital admissions, and deaths with confirmed Covid-19 in seven regions of England over March through August 2020. We estimated <jats:italic>R</jats:italic><jats:sub><jats:italic>t</jats:italic></jats:sub> using a model that mapped unobserved infections to each data source. We then compared differences in <jats:italic>R</jats:italic><jats:sub><jats:italic>t</jats:italic></jats:sub> with the demographic and social context of surveillance data over time.</jats:p><jats:p>Our estimates of transmission potential varied for each data source, with the relative inconsistency of estimates varying across regions and over time. <jats:italic>R</jats:italic><jats:sub><jats:italic>t</jats:italic></jats:sub> estimates based on hospital admissions and deaths were more spatio-temporally synchronous than when compared to estimates from all test-positives. We found these differences may be linked to biased representations of subpopulations in each data source. These included spatially clustered testing, and where outbreaks in hospitals, care homes, and young age groups reflected the link between age and severity of disease.</jats:p><jats:p>We highlight that policy makers could better target interventions by considering the source populations of <jats:italic>R</jats:italic><jats:sub><jats:italic>t</jats:italic></jats:sub> estimates. Further work should clarify the best way to combine and interpret <jats:italic>R</jats:italic><jats:sub><jats:italic>t</jats:italic></jats:sub> estimates from different data sources based on the desired use.</jats:p>


picture_as_pdf
Exploring surveillance data biases when estimating the reproduction number with insights into subpopulation transmission of .pdf
subject
Submitted Version
Available under Creative Commons: 3.0

View Download
picture_as_pdf

Published Version


Atom BibTeX OpenURL ContextObject in Span Multiline CSV OpenURL ContextObject Dublin Core Dublin Core MPEG-21 DIDL EndNote HTML Citation JSON MARC (ASCII) MARC (ISO 2709) METS MODS RDF+N3 RDF+N-Triples RDF+XML RIOXX2 XML Reference Manager Refer Simple Metadata ASCII Citation EP3 XML
Export

Downloads