Accurately stratifying patients in the preoperative period according to mortality risk informs treatment considerations and guides adjustments to bundled reimbursements. We developed and compared three machine learning models to determine which best predicts 30-day mortality after hip fracture.
The 2016 to 2017 National Surgical Quality Improvement Program for hip fracture (AO/OTA 31-A-B-C) procedure-targeted data were analyzed. Three models—artificial neural network, naive Bayes, and logistic regression—were trained and tested using independent variables selected via backward variable selection. The data were split into 80% training and 20% test sets. Predictive accuracy between models was evaluated using area under the curve receiver operating characteristics. Odds ratios were determined using multivariate logistic regression with P < 0.05 for significance.
The study cohort included 19,835 patients (69.3% women). The 30-day mortality rate was 5.3%. In total, 47 independent patient variables were identified to train the testing models. Area under the curve receiver operating characteristics for 30-day mortality was highest for artificial neural network (0.92), followed by the logistic regression (0.87) and naive Bayes models (0.83).
Machine learning is an emerging approach to develop accurate risk calculators that account for the weighted interactions between variables. In this study, we developed and tested a neural network model that was highly accurate for predicting 30-day mortality after hip fracture. This was superior to the naive Bayes and logistic regression models. The role of machine learning models to predict orthopaedic outcomes merits further development and prospective validation but shows strong promise for positively impacting patient care.