Reducing Bias Due to Outcome Misclassification for Epidemiologic Studies Using EHR-derived Probabilistic Phenotypes : Epidemiology

Secondary Logo

Journal Logo

Methods

Reducing Bias Due to Outcome Misclassification for Epidemiologic Studies Using EHR-derived Probabilistic Phenotypes

Hubbard, Rebecca A.; Tong, Jiayi; Duan, Rui; Chen, Yong

Author Information
Epidemiology 31(4):p 542-550, July 2020. | DOI: 10.1097/EDE.0000000000001193

Abstract

Epidemiologic studies using electronic health record (EHR)-derived phenotypes as outcomes are subject to bias due to phenotyping error. In the case of dichotomous phenotypes, existing methods for misclassified outcomes can be used to reduce bias. In this article, we present a bias correction approach for EHR-derived probabilistic phenotypes: continuous predicted probabilities of the outcome of interest. This approach makes use of correction factors that can be computed by hand and do not require specialized software. We used simulation studies to investigate the performance of the proposed approach under a variety of scenarios for accuracy of the probabilistic phenotype, strength of the outcome/exposure association, and prevalence of the outcome of interest. Across all scenarios investigated, the proposed approach substantially reduced bias in association parameter estimates relative to a naive approach. We demonstrate the application of this approach to a study of pediatric type 2 diabetes using data from the PEDSnet network of children’s hospitals. This straightforward correction factor can substantially reduce bias and improve the validity of EHR-based epidemiology.

Copyright © 2020 Wolters Kluwer Health, Inc. All rights reserved.

You can read the full text of this article if you:

Access through Ovid