The aim of this study was to assess the potential of machine learning with multiparametric magnetic resonance imaging (mpMRI) for the early prediction of pathological complete response (pCR) to neoadjuvant chemotherapy (NAC) and of survival outcomes in breast cancer patients.
Materials and Methods
This institutional review board–approved prospective study included 38 women (median age, 46.5 years; range, 25–70 years) with breast cancer who were scheduled for NAC and underwent mpMRI of the breast at 3 T with dynamic contrast-enhanced (DCE), diffusion-weighted imaging (DWI), and T2-weighted imaging before and after 2 cycles of NAC. For each lesion, 23 features were extracted: qualitative T2-weighted and DCE-MRI features according to BI-RADS (Breast Imaging Reporting and Data System), quantitative pharmacokinetic DCE features (mean plasma flow, volume distribution, mean transit time), and DWI apparent diffusion coefficient (ADC) values. To apply machine learning to mpMRI, 8 classifiers including linear support vector machine, linear discriminant analysis, logistic regression, random forests, stochastic gradient descent, decision tree, adaptive boosting, and extreme gradient boosting (XGBoost) were used to rank the features. Histopathologic residual cancer burden (RCB) class (with RCB 0 being a pCR), recurrence-free survival (RFS), and disease-specific survival (DSS) were used as the standards of reference. Classification accuracy with area under the receiving operating characteristic curve (AUC) was assessed using all the extracted qualitative and quantitative features for pCR as defined by RCB class, RFS, and DSS using recursive feature elimination. To overcome overfitting, 4-fold cross-validation was used.
Machine learning with mpMRI achieved stable performance as shown by mean classification accuracies for the prediction of RCB class (AUC, 0.86) and DSS (AUC, 0.92) based on XGBoost and the prediction of RFS (AUC, 0.83) with logistic regression. The XGBoost classifier achieved the most stable performance with high accuracies compared with other classifiers. The most relevant features for the prediction of RCB class were as follows: changes in lesion size, complete pattern of shrinkage, and mean transit time on DCE-MRI; minimum ADC on DWI; and peritumoral edema on T2-weighted imaging. The most relevant features for prediction of RFS were as follows: volume distribution, mean plasma flow, and mean transit time; DCE-MRI lesion size; minimum, maximum, and mean ADC with DWI. The most relevant features for prediction of DSS were as follows: lesion size, volume distribution, and mean plasma flow on DCE-MRI, and maximum ADC with DWI.
Machine learning with mpMRI of the breast enables early prediction of pCR to NAC as well as survival outcomes in breast cancer patients with high accuracy and thus may provide valuable predictive information to guide treatment decisions.