Background: Accurate information regarding race, ethnicity, and national origins is critical for identifying disparities in the cancer burden.
Objectives: To examine the use of a Spanish surname list to improve the quality of race-related information obtained from rapid case ascertainment (RCA) and to estimate the accuracy of race-related information obtained from cancer registry records collected by routine reporting.
Subjects: Self-reported survey responses of 3954 participants from California enrolled in the Cancer Care Outcomes Research and Surveillance Consortium.
Measures: Sensitivity, specificity, positive predictive value, and percent agreement. We used logistic regression to identify predictors of underreporting and overreporting of a race/ethnicity.
Results: Use of the Spanish surname list increased the sensitivity of RCA for Latino ethnicity from 37% to 83%. Sensitivity for cancer registry records collected by routine reporting was ≥95% for whites, blacks, and Asians, and specificity was high for all groups (86%–100%). However, patterns of misclassification by race/ethnicity were found that could lead to biased cancer statistics for specific race/ethnicities. Discordance between self-reported and registry-reported race/ethnicity was more likely for women, Latinos, and Asians.
Conclusions: Methods to improve race and ethnicity data, such as using Spanish surnames in RCA and instituting data collection guidelines for hospitals, are needed to ensure minorities are accurately represented in clinical and epidemiological research.