Background: The goal of many studies in environmental epidemiology is to assess the relationship between chemical exposure and disease outcome. Often various assays can be used to measure a particular environmental exposure, with some assays being more invasive or expensive than others.
Methods: We consider the situation in which 2 assays can be used to measure an environmental exposure. The first assay has measurement error and is subject to a lower detection limit (LOD), and the second assay has less measurement error and is not subject to a lower LOD. In this situation, the first assay is less invasive or less expensive and is measured in all study participants, whereas the second assay is more invasive or more expensive and is only measured in a subset of individuals. We develop a flexible class of regression models that incorporates both sets of assay measurements and allows for continuous or binary outcomes. We explore different design strategies for selecting the subset of patients in whom to measure the second assay. One design strategy is to measure the second more invasive or expensive assay only when the first assay is below LOD. We compare these designs with a simple design in which the second assay is measured in a random subset of patients without regard to the results of the first assay.
Results: We develop estimation approaches for these regression models. We demonstrate through simulations that there are efficiency advantages of measuring the second assay in at least a fraction of cases in which the first assay is above LOD. We illustrate the methodology by using data from a study examining the effect of environmental polychlorinated biphenyl exposure on the risk of endometriosis.
Conclusion: The proposed methodology has good statistical properties and will be a useful methodological technique for studying the effect of exposure on outcome when exposure assays are subject to LOD.
From the aBiostatistics and Bioinformatics Branch, Division of Epidemiology, Statistics, and Prevention Research, Eunice Kennedy Shriver National Institute of Child Health and Human Development, Rockville, MD; bDepartment of Statistics, University of Connecticut, Storrs, CT; cEpidemiology Branch, Division of Epidemiology, Statistics, and Prevention, Eunice Kennedy Shriver National Institute of Child Health, Rockville, MD; and dDepartment of Clinical Laboratory Sciences, University at Buffalo, State University of New York, Buffalo, NY.
Submitted 29 August 2008; accepted 3 February 2009; posted 8 April 2010.
Supported partially by the Long Range Initiative of the American Chemistry Council and funding from the Intramural Research Program of the Eunice Kennedy Shriver National Institute of Child Health and Human Development; National Institutes of Health.
Correspondence: Paul S. Albert, Biostatistics and Bioinformatics Branch, Division of Epidemiology, Statistics, and Prevention Research, Eunice Kennedy Shriver National Institute of Child Health and Human Development, 6100 Executive Blvd Room 7B05F, Bethesda, MD 20906. E-mail: firstname.lastname@example.org.