Complete case logistic regression with a dichotomised continuous outcome led to biased estimates.

Rosaleen PeggyCornish; Jonathan William Bartlett ORCID logo; JohnMacleod; KateTilling; (2023) Complete case logistic regression with a dichotomised continuous outcome led to biased estimates. Journal of Clinical Epidemiology, 154. pp. 33-41. ISSN 0895-4356 DOI: 10.1016/j.jclinepi.2022.11.022
Copy

OBJECTIVES: To investigate whether a complete case logistic regression gives a biased estimate of the exposure odds ratio (OR) if missingness depends on a continuous outcome, but a binary version is used for analysis; to examine whether any bias could be reduced by including a misclassified form of the incomplete outcome as an auxiliary variable in multiple imputation (MI). STUDY DESIGN AND SETTING: Analytical investigation, simulation study, and data from a UK cohort. RESULTS: There was bias in the exposure OR when the probability of being a complete case was independently associated with the exposure and (continuous) outcome but this was generally small unless the association with the outcome was strong. Where exposure and (continuous) outcome interacted in their effect on this probability, the bias was large, particularly at high levels of missing data. Inclusion of the auxiliary variable resulted in important bias reductions when this had high sensitivity and specificity. CONCLUSION: The robustness of logistic regression to missing data is not maintained when the outcome is a binary version of an underlying continuous measure, but the bias will be small unless the association between the continuous outcome and missingness is strong.



picture_as_pdf
Cornish_etal_2022_Complete-case-logistic-regression.pdf
subject
Published Version
Available under Creative Commons: 4.0

View Download
picture_as_pdf

Supplemental Material


Explore Further

Read more research from the creator(s):

Find work associated with the faculties and division(s):

Find work from this publication: