Covariates may affect continuous responses differently at various points of the response distribution. For example, some exposure might have minimal impact on conditional means, whereas it might lower conditional 10th percentiles sharply. Such differential effects can be important to detect. In studies of the determinants of birth weight, for instance, it is critical to identify exposures like the one above, since low birth weight is a risk factor for later health problems. Effects of covariates on the tails of distributions can be obscured by models (such as linear regression) that estimate conditional means; however, effects on tails can be detected by quantile regression. We present 2 approaches for exploring high-dimensional predictor spaces to identify important predictors for quantile regression. These are based on the lasso and elastic net penalties. We apply the approaches to a prospective cohort study of adverse birth outcomes that includes a wide array of demographic, medical, psychosocial, and environmental variables. Although tobacco exposure is known to be associated with lower birth weights, the analysis suggests an interesting interaction effect not previously reported: tobacco exposure depresses the 20th and 30th percentiles of birth weight more strongly when mothers have high levels of lead in their blood compared with those who have low blood lead levels.
SUPPLEMENTAL DIGITAL CONTENT IS AVAILABLE IN THE TEXT.
From the aDepartment of Statistical Science, Duke University, Durham, NC; and bNicholas School of the Environment and Department of Pediatrics, Duke University, Durham, NC.
Submitted October 2010; accepted May 2011.
Supported by U.S. Environmental Protection Agency (grant R833293).
Supplemental digital content is available through direct URL citations in the HTML and PDF versions of this article (www.epidem.com).
Correspondence: Lane F. Burgette, Department of Statistical Science, Box 90251, Duke University, Durham, NC 27708. E-mail: firstname.lastname@example.org.