Social determinants of health (SDH) at the area level are understood to influence the likelihood of having poor glycemic control for patients with type 2 diabetes mellitus (T2DM).
To develop a model for predicting whether a person with T2DM has uncontrolled diabetes (hemoglobin A1c ≥9%), incorporating individual and area-level (census tract) covariates.
Development and validation of machine learning models.
Total of N=1,015,808 privately insured persons in claims data with T2DM.
C-statistic, sensitivity, specificity, positive predictive value, negative predictive value, and accuracy.
A standard logistic regression model selecting among the available individual-level covariates and area-level SDH covariates (at the census tract level) performed poorly, with a C-statistic of 0.685, sensitivity of 25.6%, specificity of 90.1%, positive predictive value of 56.9%, negative predictive value of 70.4%, and accuracy of 68.4% on a 25% held-out validation subset of the data. By contrast, machine learning models improved upon risk prediction, with the highest performance from a random forest algorithm with a C-statistic of 0.928, sensitivity of 68.5%, specificity of 94.6%, positive predictive value of 69.8%, negative predictive value of 94.3%, and accuracy of 90.6%. SDH variables alone explained 16.9% of variation in uncontrolled diabetes.
A predictive model developed through a machine learning approach may assist health care organizations to identify which area-level SDH data to monitor for prediction of diabetes control, for potential use in risk-adjustment and targeting.
*Research and Analytics, Collective Health, San Francisco, CA
†Center for Primary Care, Harvard Medical School, Boston, MA
‡School of Public Health, Imperial College, London, UK
§KPMG LLP, San Francisco, CA
Supported by the National Institute On Minority Health And Health Disparities of the National Institutes of Health under Award Number R21MD012867 (S.B.) and by KPMG (S.B. and R.N.). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health or of any current or former employer of the authors.
The authors declare no conflict of interest.
Reprints: Sanjay Basu, MD, PhD, 635 Huntington Avenue, Boston, MA 02115. E-mail: Sanjay_Basu@hms.harvard.edu.