Accounting for Bias Due to Selective Attrition: The Example of Smoking and Cognitive Decline : Epidemiology

Secondary Logo

Journal Logo


Accounting for Bias Due to Selective Attrition

The Example of Smoking and Cognitive Decline

Weuve, Jennifer; Tchetgen Tchetgen, Eric J.; Glymour, M. Maria; Beck, Todd L.; Aggarwal, Neelum T.; Wilson, Robert S.; Evans, Denis A.; Mendes de Leon, Carlos F.

Author Information
Epidemiology 23(1):p 119-128, January 2012. | DOI: 10.1097/EDE.0b013e318230e861


Decline in cognitive function is a common occurrence with aging and the hallmark manifestation of dementia.1,2 Few modifiable risk factors for cognitive decline and dementia have been identified.3 This may be due in part to the particular methodological challenges that affect longitudinal studies of cognitive aging and other late-life health outcomes. One important challenge is addressing selection bias from selective mortality or other forms of attrition that occur after study enrollment. These selection processes will bias estimates of a risk factor's association with an outcome if selection is influenced by both the risk factor and the outcome or, alternatively, by determinants of the risk factor and the outcome (Figure 1A).5 Although selection bias is a concern in all longitudinal studies of aging-related outcomes, it is especially relevant in studies of cognitive decline because impaired cognition strongly predicts morbidity,69 mortality,10,11 and attrition after study enrollment.12,14 Studies of risk factors that are themselves associated with substantial morbidity and mortality, such as smoking, are especially vulnerable to bias due to selective attrition (Figure 1B).5,15

Figure 1:
A, Directed acyclic graph (DAG) depicting general causal structure underlying attrition-related selection bias. In this DAG, the risk factor of interest directly influences postenrollment survival or continuation in the study. The outcome is associated with survival or continuation through its relation to the unmeasured factor, U (eg, a genetic variant that results in more efficient detoxification). Survival/continuation is a collider on the path between the risk factor and the outcome. Conventional unweighted analyses of follow-up data are restricted to the group of participants who survive and continue in the study, a form of conditioning indicated by the box around “survival/continuation.” As shown in the DAG, this restriction can induce a spurious association between the risk factor and U, and thus between the risk factor and the outcome, even in the absence of a true causal relation between the two. B, Directed acyclic graph (DAG) depicting causal structure underlying attrition-related selection bias in the relation of smoking to cognitive decline. This DAG shows that smoking decreases postenrollment survival or continuation. Cognitive decline during the course of the study is inversely related to survival or continuation through its association with previous cognitive decline. Conventional unweighted analyses of follow-up data are restricted to the group of participants who survive and continue in the study, a form of conditioning indicated by the box around “survival/continuation.” Continuing survivors who smoke will have had less than expected previous cognitive decline, and the restriction to continuing survivors can induce a downward bias in the association between smoking and cognitive decline, resulting in underestimates of harm or overestimates of protection. For an introduction to DAGs, see Glymour and Greenland.4

Smoking is thought to increase risk of cognitive decline and dementia in older age, mainly through its well-established vascular effects, although some data suggest potential benefits of nicotine.1618 Findings from previous longitudinal studies of smoking and cognitive decline have been mixed.16,1927 As is typical in longitudinal studies of older adults, many of these studies during the course of follow-up lost a substantial proportion of their baseline populations—often >20%—to attrition. Among 5 studies reporting on attrition in relation to smoking and cognition,19,21,25,27 3 reported that attrition was associated with smoking, cognition, or both.19,25,27 Given the strong impact of smoking on morbidity and mortality—smokers have 2 to 3 times the mortality rate of never-smokers28—and its overall association with study attrition,14 previous studies may have underestimated the adverse relation of smoking to cognitive decline. To our knowledge, no prior studies have quantified and corrected for the potential influence of selective attrition on the estimated relation of smoking to cognitive decline.

Until recently, epidemiologists have had few accessible tools for addressing differential attrition.29 A common approach in regression models of the association between an exposure and outcome of interest is to include terms for factors that predict attrition. This approach is unsatisfactory because some predictors may be influenced by the exposure, and adjustment for such postexposure variables is generally known to bias effect estimates.4 For example, although prior cognitive function is a strong predictor of both attrition and future cognitive decline, adjusting for baseline or intermediate measurements of cognitive function could produce estimates that are substantially inflated if the exposure is associated with baseline cognitive score (eFigure 1, Robins, Hernán, Cole, and others31,32 have developed an inverse-probability-weighting approach to “correct” analyses for differential attrition, based on observed covariate history. This approach allows for use of information on potential intermediates and previous cognitive function while avoiding the pitfalls of conventional adjustment for these variables.

Inverse-probability-weighting methods should be particularly relevant to longitudinal studies of aging-related health outcomes, given the high rates of attrition that are common in this research. We used inverse-probability-of-attrition weighting (IPAW) to examine the influence of attrition-related selection bias on the estimated association between smoking and cognitive decline. We first developed models of the probability of continuing in the study—that is, remaining alive and not lost to follow-up—and from these models, we computed predicted probabilities of continuation for each observation. For greater specificity, we distinguished between attrition due to mortality and attrition due to other causes (study dropout), which is often related to frail health.13,14 We then used these probabilities to compute analytical weights that are in inverse proportion to the probability of remaining alive and in the study. Observations with characteristics associated with a lower probability of continuation, for example, physical frailty, were assigned larger weights, thereby “compensating” for the underrepresentation of these types of observations in the observed follow-up data. We then applied the weights to our analyses of the association between smoking and cognitive decline.

We hypothesized that, compared with never-smokers, persons who were current smokers at baseline would experience faster cognitive decline during the 12 years of follow-up. Further, we anticipated that the association between current smoking and cognitive change would be larger after accounting for selective mortality and non–death-related dropout. Because the evaluation of former smoking and cognitive decline entails additional complexities (eg, accounting for determinants of cessation), we assessed only the contrast between current and never smoking in this analysis.


Study Population

We conducted our analyses using data from participants in the ongoing Chicago Health and Aging Project.33 The first wave of recruitment began in 1993 with a door-to-door census of residents living in 3 geographically defined neighborhoods on the south side of Chicago. Of 8501 adults, aged ≥65 years, who were identified in this recruitment wave, 1655 declined to participate, 439 died, and 249 moved before an in-person assessment could be conducted. This left 6158 participants in the cohort, 18 of whom did not report their smoking status. We focused our analyses on the 3768 participants who, on enrollment, reported that they were current smokers or that they had never smoked. Of these participants, 55 (1.5%) were missing data on key covariates used for the computation of weights or for the analytical regression models, leaving 3713 persons (891 smokers and 2822 never- smokers) for our primary analyses. Participants undergo in-home assessments every 3 years, and those in our analyses contributed 10,096 observations from 5 assessment cycles. Information on time-varying covariates was missing for 11 observations, leaving 10,085 observations for our analyses. Some participants returned to the study after skipping a cycle (279, 9.5% of those censored); we classified these participants as permanently censored on their first missed cycle, effectively treating dropout as permanent. This permitted fewer assumptions in estimating inverse-probability-of-attrition (IPA) weights (mentioned in the Attrition Weight Estimation section).

Smoking Assessment

At their baseline interviews, participants were asked: “Do you smoke cigarettes now?” and “Did you ever smoke cigarettes regularly?” We defined never-smokers as participants who responded “no” to both questions. We defined current smokers as participants who responded affirmatively to the first question. We used baseline smoking status as the exposure of interest, although during the course of follow-up, a small fraction of never-smokers (0.7%) reported smoking, and approximately 35% of current smokers reported quitting.

Cognitive Assessment

Participants underwent cognitive assessments at each in-home visit. This assessment consisted of 4 tests of cognitive function: immediate and delayed recall of 12 ideas in the East Boston story, measures of episodic memory34; the oral version of the Symbol Digit Modalities test, which measures perceptual speed by giving the participants 90 seconds to identify as many digit-symbol matches as possible35; and the Mini-Mental State Examination, a measure of global cognition.36 Because the 4 individual measures are highly correlated, we computed a composite measure of global cognition by first converting the raw scores from each test to z scores, using the baseline mean and standard deviation (SD) in the population, and then averaging the z scores.37

Covariate Assessment

We used information on both baseline and time-updated covariates. Baseline covariates included self-reported race and ethnicity (assessed with the US census question on race and ethnicity and categorized as African American or non-Hispanic white) and number of years of completed formal schooling. Time-updated covariates (assessed at baseline and reassessed at each interview) were all self-reported through structured interview and included usual alcohol intake; self-rated health; diagnosis of heart attack, stroke, diabetes, and high blood pressure; physical disability measured by the Nagi score38 (ability to perform basic upper and lower extremity functions, where a lower score indicates greater disability); and a composite measure of social networks, where a higher score indicates a more highly populated network.39

Analytic Approach

Attrition Weight Estimation

To account for potentially informative attrition in our analyses, we estimated weights to apply to each observation in models of smoking and cognitive decline. For each wave of visits contributing to our analysis, the weights were based on the inverse of the wave-specific probability of being observed at that wave, and thus of being alive and uncensored at that wave. The intuition behind these weights is that respondents with characteristics similar to the observations missing due to attrition are up-weighted in the analyses of smoking and cognitive decline, so as to represent their original contribution as well as their missing contributions. Because determinants of death may differ from determinants of study dropout for other reasons, we separately modeled attrition due to death and attrition by other causes.

For each of the 2 sources of attrition, we first developed separate models of not being censored during the course of follow-up.32 For each planned assessment, let Cikr indicate whether person i is no longer in the study by wave k for reason r, where r is either death (r = 1) or loss to follow-up (r = 2). Each weight represents the reciprocal of individual i's probability of remaining both alive and in the study at wave k. We classified a death as occurring at wave k if the participant died between waves k1 and k, so that for such an individual, Cik1Ci(k−1)1=1.

For each wave of follow-up, we modeled and estimated the probability of being alive in that wave, using pooled logistic regression,31 conditional on remaining alive and uncensored in the previous wave. We separately modeled the probability that such a living and previously uncensored participant remained uncensored. To specify the models, we defined a set of variables L, some of which varied over time, we thought likely to influence death or censoring and also affect cognitive function: age, race (African American vs. white), sex (male vs. female), education (0–8 years, 9–12 years [referent], 13–16 years, 17–30 years), alcohol consumption at the previous visit (none [referent], up to 1 drink/d, >1 drink/d), social network score at the previous visit, cognitive activity at the previous visit, disability score at the previous visit, self-rated health at the previous visit (per unit worsening in rating), chronic cardiovascular conditions, diabetes, global cognitive score at the previous visit, and smoking status (current vs. never). We estimated models that included the following as predictors: the baseline time-constant covariates in L, smoking status (Xi), and the most recent prior values of the time-varying covariates (Li(k− 1)), including past measurements of cognitive function. We explored weighting models including additional variables representing the history of the time-varying covariates (eg, L[Combining Macron]i(k− 1) = (Li0, …, Li(k− 1))), but these covariates did not predict censorship or death independently of Li(k− 1), and so were dropped from the model. These models were used together to calculate the cumulative probability of surviving up to a given follow-up wave and of participating in the assessment at that wave. Weights were applied at the level of observations within individuals, such that for each person-wave contribution to our analysis at wave j, the weight was the inverse of the probability of the conjunction of these 2 events. These weights can be obtained by the simple product formula:

Implicit in these models is the Markov assumption that an individual's probability of contributing to the analysis at wave k (and thus of being alive and uncensored at wave k) depends on his or her history of the collection of time-varying covariates (L[Combining Macron]i(k − 1) only through its most recent value Li(k− 1). Such an assumption may be relaxed by incorporating additional lagged covariate values, or a user-specified function of such values (eg, cum(L[Combining Macron]i(k− 1)) = Li0 +…+ Li(k− 1)) as potential predictors in the weight models. To optimize the fit of our attrition models, we explored several functional forms of time, including as a continuous variable and as a set of cycle indicators. We also evaluated several potentially important cross-products, including cognitive score with smoking, as well as time with cognitive score, smoking, and age. We used the same set of covariates in the death and dropout models, selecting the final covariate set (shown in Table 1), as the set that contained variables with modest-to-strong associations with attrition and for which there were minimal missing data.

Table 1:
Baseline Characteristics of the Study Population, and Adjusted Hazard Ratios (HRs) and (95% Confidence Interval) of Attrition Over 5 Study Cycles, Estimated From Models of Continuation

We present model-based 95% confidence intervals (CIs) for the hazard ratios (HRs) relating each covariate to censoring, under the assumption that the pooled logistic regressions correctly model the hazard of continuation in the study given the entire history of covariates.40 We used the Bayesian information criterion as an indicator of global goodness of fit. To describe each model's ability to discriminate those who were censored from those who were not censored, we computed the discordance percentage and the c-statistic. We used the Hosmer-Lemeshow test to describe each model's calibration across a range of observed risks.41,42

From the combination of the 2 cause-specific models, we computed IPA weights according to Equation 1. These are also called nonstabilized weights because, as the reciprocal of a probability, they are guaranteed to be >1 for contributing observations, and may potentially be very large for a person with a small probability of staying alive and uncensored. As a potential remedy, we also computed wave-specific, stabilized IPA weights by multiplying the individual's nonstabilized weight at that wave by the conditional probability of remaining alive and uncensored up to that wave, given a subset of baseline covariates Vi (a subset of Li0) and smoking status. Thus, as the ratio of 2 probabilities, we generally expect this stabilization to reduce the undue influence of a highly variable nonstabilized weight, and therefore to result in confidence intervals that are narrower than those in analyses using nonstabilized, potentially highly variable weights. Under our assumptions, both nonstabilized and stabilized weights give unbiased effect estimates, provided Vi is entered into the regression model relating smoking to cognitive function over time, and thus effect estimates conditional on Vi are reported in both analyses.43 Applying stabilized weights does not adjust for the covariates Vi that were used in the estimation of the numerator of the model. It is instead necessary to include the Vi as regression covariates in the primary analytic model. The stabilized weight for an individual's contribution to wave j is thus given by:

As with the denominators, we obtained estimates of the numerators through pooled logistic regression analysis, in which V consisted of baseline age, sex, race, education, baseline alcohol consumption, and baseline smoking status.

Several assumptions underlie the IPA weight estimation. First, we assume that the conditional probability of remaining alive and in the study in the next wave, given that one has survived and remained uncensored up to the current wave, does not further depend on one's future cognitive function, given past observed covariates and cognitive measurements (the “ignorability” assumption).44 We also assume that for any given wave of the study, and for any possible realization of the covariates, smoking status, and past cognitive function up to the current wave, there is a positive probability that a person with that observed history remains alive and in the study in the next wave, given that he or she is alive and uncensored in the current wave (the standard “positivity” assumption).44

It is noteworthy that if attrition were jointly independent of time-varying correlates of cognitive function, then a standard unweighted GEE analysis would produce valid statistical inferences about the effects of smoking on cognitive function. Remarkably, under the aforementioned assumptions of the attrition process's ignorability and positivity, given the observed time-varying correlates of cognitive function, our analytic approach corrects for selection bias due to attrition, to the extent that the model recovers the effect of smoking on cognitive function (possibly conditional on a subset of baseline variables) one would have obtained using a standard GEE analysis, had attrition been jointly independent of all time-varying predictors of cognitive function (possibly conditional on a subset of variables).

Analyses of Smoking and Cognitive Decline

We evaluated the association between current smoking at baseline and cognitive decline using unweighted and IPA-weighted generalized-estimating-equations (GEE) regression models,40 with working exchangeable correlation matrices, in which we estimated the difference between current and never- smokers in rates of decline in global cognitive score. In all models, we regressed the global score on the set of predictors Vi, by including main effect terms for age, sex, race, education (4 categories, described previously), baseline alcohol intake (3 categories, described previously), smoking status, time (years, continuous), and the cross-products of each covariate with time. These analyses included data from all eligible person-wave contributions from participants who had a baseline cognitive score.

For comparison, we fitted unweighted models as well as models that weighted observations using the 2 sets of IPA weight estimates (nonstabilized weights and stabilized weights). Our primary hypothesis on the relation of smoking to cognitive decline was assessed with the cross product between smoking and time, that is, the estimated difference between current and never-smokers in their rates of cognitive decline. To make the estimates easier to interpret, we multiplied all estimated annual changes and differences in annual change by 10, obtaining estimates of change and differences in change during a period of 10 years. To place these effect estimates in context, we compared them with the average rate of cognitive decline among never-smokers, represented in the main effect term for time, leaving all other covariates at their referent levels. Supposing that the rate of cognitive decline among never-smokers represents “smoking-free cognitive aging,” we then estimated “excess years of cognitive aging” (during a 10-year interval) among current smokers by dividing the difference in 10-year change by the annual rate of change among never-smokers.


Because standard errors from conventional IPA-weighted GEE models can be conservative,40 we generated bootstrap parameter estimates and standard errors.45 Using this approach, we repeated the entire set of analyses—from weight estimation to the estimation of the association of smoking with cognitive decline—on each of 1000 bootstrap samples. A given bootstrap estimate of the difference in rate of cognitive change among current smokers versus never- smokers was the mean of the 1000 datasets in which individuals' observed histories were sampled with replacement from the original dataset. We used the bootstrap standard errors to compute 95% confidence intervals (CIs) for the difference in rate of cognitive change.

Using the bootstrap samples, we formally compared each weighted estimate of the association between smoking and cognitive decline with its unweighted counterpart using a Hausman-type specification test, which tests the null hypothesis that the unweighted estimate is consistent with the weighted estimate (the “consistent” estimator).46,47 We also compared the estimates as rates of cognitive decline among current smokers as a percentage increase over the rate of decline among never-smokers, where the latter rate is the estimate corresponding to time.


Of the 3713 participants who had baseline cognitive assessments and nonmissing data on key covariates, 2634 (71%) remained in the sample at the first follow-up, 1722 (46%) remained at the second follow-up, 1274 (34%) remained at the third follow-up, and 756 (20%) remained at the fourth follow-up. Mortality accounted for most (68%) of the attrition.

The variables included in our final censoring models are listed in Table 1. In these multivariable-adjusted analyses, the current smoker group, relative to never-smokers, experienced substantially increased mortality risk (HR = 1.93; 95% CI = 1.67 to 2.23), but no difference in other-cause attrition (Table 1). By contrast, higher cognitive score was associated with markedly reduced risk of both mortality and other-cause attrition. Other strong predictors of mortality included older age, being male, white race, greater degree of disability at the previous visit, worse self-rated health at the previous visit, and diabetes at baseline. Those with the lowest level of education had reduced mortality, whereas those with the highest level of education were least likely to drop out. The estimates in the predictive model of mortality were markedly different in magnitude from those in the predictive model of other-cause attrition.

Inverse Probability of Attrition Weights

Fit statistics for the weighting models and the distribution of the IPA weights are shown in Table 2. The models of noncensoring generally fit the data well, with good to excellent discrimination between those who died or dropped out and those who continued in the study. The models were also well calibrated in that they generated predicted risks of attrition that generally matched observed risks, although they tended to perform somewhat more poorly at the highest decile of risk (eAppendix, eFigure 2, Weights generated by the model for censoring due to dropout were fairly narrowly distributed, reflecting, in part, that we were unable to identify strong predictors of this type of attrition to the extent that we did for death-related censoring.

Table 2:
Characteristics of Attrition Models and the Weights They Generateda

Smoking and Cognitive Decline

In unweighted analyses, the estimated rate of decline on the cognitive tests among never-smokers was 0.53 points (in standard units) during a period of 10 years (Table 3). Current smokers' estimated rate of decline was 0.11 points worse (95% confidence interval = −0.20 to −0.02), on average, resulting in an average rate of decline of 0.64 points during a period of 10 years. With the application of the nonstabilized IPAWs, this difference in rates of cognitive decline increased by 56% to −0.17 points (−0.31 to −0.02). When we applied the stabilized weights, the difference in rates increased further to −0.20 points during a period of 10 years (−0.36 to −0.04), 86% larger than the unweighted estimate. Estimates derived from the application of stabilized weights were slightly more efficient—indicated by the standard error as a fraction of the effect estimate—than those derived from the application of nonstabilized weights. Although the weighted difference estimates were substantially larger than the unweighted estimates, the CIs overlapped substantially, and the Hausman tests did not indicate that the unweighted estimate was significantly different from either the estimates from the analyses using nonstabilized (P = 0.3) or stabilized weights (P = 0.2).

Table 3:
Unweighted and Weighted Multivariable-adjusteda Difference Between Current and Never-smokers in Cognitive Change Over 10 Years

Notably, weighted analyses also yielded larger estimates of the average rate of change in cognitive score among never-smokers, probably because time in the study also predicted attrition. Nonetheless, the correction for attrition had a slightly greater impact on the estimates for current smokers. For example, considering the reference group (75-year-old women with 9–12 years of education and no alcohol use), the unweighted results suggested that smokers' decline during a period of 10 years would be as severe as the decline experienced by a nonsmoker during a period of 12.1 years. The weighted results suggested that, during a period of 10 years, the smoker would decline as much as a nonsmoker would decline during a period of 12.5 years.


In this well-characterized longitudinal study of aging, both smoking and cognitive function were strong predictors of attrition after enrollment. Current smoking was associated with substantially faster rates of cognitive decline in all analyses. The application of IPAWs to these analyses to account for differential attrition patterns yielded estimates that were 56% to 86% larger than unweighted estimates. In weighted analyses, estimates of cognitive decline among never-smokers increased as well, yet weighting had an even larger impact on estimates of decline among smokers. The Hausman test did not reject at conventional thresholds of statistical significance, indicating we cannot rule out the possibility that the difference in estimates is due to chance variation. Even so, the difference in point estimates of the magnitude observed may be considered substantively important. Imprecision in either the weighted or unweighted estimator reduce the statistical power of the Hausman test, so such tests may have insufficient power to reject the null even when point estimates differ substantially.

Differential selection processes distort the joint distribution of smoking and cognitive decline in a study population if cognitive change also influences selection, or if there are other uncontrolled factors that influence both selection and cognitive change. For example, given the lethality of cigarette smoking, smokers who survive may have other beneficial characteristics (eg, genetic background) that protect them from cognitive decline (eAppendix, eFigure 3, This selection induces an association between the risk factor and cognitive decline even if there is no true effect. Analyses of the risk factor and cognitive decline will be biased, often toward—and possibly beyond—the null. Our findings may offer some insight into the previously mentioned mixed findings in earlier longitudinal studies of smoking and cognitive decline.16,2026 In light of our findings, it seems plausible that many of the earlier studies may have underestimated the adverse relation of smoking to cognitive decline. It bears mention that the use of more sophisticated measures of cognitive aging—such as more elaborate cognitive testing batteries, imaging, and clinical diagnoses–will not resolve biases stemming from differential attrition.

Our study has some limitations. Although we took a detailed approach toward addressing death and dropout after study enrollment, we did not address attrition prior to enrollment (also known as left truncation). In a study that begins when participants have already reached advanced age, mortality and debilitating morbidity related to smoking and cognitive function have occurred prior to study enrollment, leaving a study population that is already differentially selected. This possibility was highlighted by work complementary to ours, a meta-analysis of cohort studies of smoking and dementia that found relative risks tended to be smaller—sometimes <1.0—in studies whose participants were enrolled at older ages.48 Left truncation processes that generated our study's population of age-eligible participants may have resulted in conservative estimates of smoking's association with cognitive decline, even from the IPA-weighted analyses. Indeed, in unweighted analyses stratified by age, we observed associations between smoking and cognitive decline that diminished in the oldest age group (eAppendix, eTable 1,

Many people who smoked at baseline quit during follow-up, so some people we classified as “smokers” were in fact “former smokers” at the time of later cognitive assessments. If cessation is more likely with poor cognition, then, in general, the use of baseline smoking status remediates this source of bias. The same tools of inverse probability weighting used here to handle attrition could in principal also be applied to account for time-varying smoking status.31,32 This extension requires a separate model for the determinants of quitting (or initiating) smoking. Our study also did not evaluate the effect of former smoking at baseline on cognitive decline, because such an evaluation entails far more complex methodological considerations around factors influencing cessation, including those that may be causal intermediates. Our IPA-weighted results were premised on the assumption that, conditional on the covariates, people who remained in the study and those who did not were “exchangeable” with respect to cognitive outcomes, and, further, that the censoring models were correctly specified. Although the first assumption is not empirically testable, goodness-of-fit tests indicated that our models fit the data adequately. In fact, when we used IPAWs based on attrition models composed of other variables (eg, time instead of cycle, history of hypertension, cognitive activity, previous cognitive change), we obtained IPA-weighted estimates that were consistent with those reported here (examples in eAppendix, eTable 2,

Our analysis also has several important strengths. The underlying study has complete data on many variables for most participants, which permitted us to explore a wide range of predictors for our censoring models, and ultimately, to develop models that included many strong censoring predictors. In particular, the censoring models allowed us to consider important information on variables that we would otherwise avoid including in models of smoking and cognitive decline, specifically, previous cognitive function and potential intermediate factors such as self-rated health and disability. Finally, this analysis represents one of the first applications of inverse probability weighting to analyses of risk factors for cognitive decline and has demonstrated that accounting for differential attrition may unveil associations that are larger than those obtained from unweighted analyses, particularly when the risk factor of interest is strongly related to mortality.

Differential selection is likely to influence findings on other risk factors for cognitive aging, and, more generally, other aging-related outcomes. Risk factors for mortality such as smoking, elevated blood pressure, hypercholesterolemia, diabetes, and socioeconomic position have often been observed to have diminished impact in “older older” adults as compared with “younger older” adults.4951 This pattern appears in findings on risk factors for dementia, too, whereby a factor that predicts dementia risk among younger older cohorts more weakly predicts dementia risk—or fails to predict dementia risk at all—among the “oldest old” adults.48,5256 Although some age-dependent patterns have a hypothesized biologic basis,5658 determining which patterns are related to selection (and to what degree) has critical implications for extrapolating study findings to clinical practice and health policy. This study makes an advance in that direction by addressing the influence of differential attrition on the estimated relation between smoking and cognitive decline.


1. Linn RT, Wolf PA, Bachman DL, et al.. The “preclinical phase” of probable Alzheimer's disease. A 13-year prospective study of the Framingham cohort. Arch Neurol. 1995;52:485–490.
2. Bennett DA, Wilson RS, Schneider JA, et al.. Natural history of mild cognitive impairment in older persons. Neurology. 2002;59:198–205.
3. National Institutes of Health State-of-the-Science Conference Statement. Preventing Alzheimer's Disease and Cognitive Decline. Bethesda, MD: National Institute on Aging and Office of Medical Applications Research, of the National Institutes of Health; 2010.
4. Glymour MM, Greenland S. Chapter 12: Causal diagrams. In: Rothman KJ, Greenland S, Lash TL. eds Modern Epidemiology. 3rd ed. New York: Wolters Kluwer; 2008:183–209.
5. Hernán MA, Hernández-Diaz S, Robins JM. A structural approach to selection bias. Epidemiology. 2004;15:615–625.
6. Chodosh J, Seeman TE, Keeler E, et al.. Cognitive decline in high-functioning older persons is associated with an increased risk of hospitalization. J Am Geriat. 2004;52:1456–1462.
7. Welmerink DB, Longstreth WT Jr, Lyles MF, Fitzpatrick AL. Cognition and the risk of hospitalization for serious falls in the elderly: results from the Cardiovascular Health Study. J Gerontol A Biol Sci Med Sci. 2010.
8. Greiner PA, Snowdon DA, Schmitt FA. The loss of independence in activities of daily living: the role of low normal cognitive function in elderly nuns. Am J Public. 1996;86:62–66.
9. Raji MA, Al Snih S, Ray LA, Patel KV, Markides KS. Cognitive status and incident disability in older Mexican Americans: findings from the Hispanic established population for the epidemiological study of the elderly. Ethn Dis. 2004;14:26–31.
10. Yaffe K, Lindquist K, Vittinghoff E, et al.. The effect of maintaining cognition on risk of disability and death. J Am Geriatr Soc. 2010.
11. Bassuk SS, Wypij D, Berkman LF. Cognitive impairment and mortality in the community-dwelling elderly. Am J Epidemiol. 2000;151:676–688.
12. Euser SM, Schram MT, Hofman A, Westendorp RG, Breteler MM. Measuring cognitive function with age: the influence of selection by health and survival. Epidemiology. 2008;19:440–447.
13. Chatfield MD, Brayne CE, Matthews FE. A systematic literature review of attrition between waves in longitudinal studies in the elderly shows a consistent pattern of dropout between differing studies. J Clin Epid. 2005;58:13–19.
14. Matthews FE, Chatfield M, Brayne C. An investigation of whether factors associated with short-term attrition change or persist over ten years: data from the Medical Research Council Cognitive Function and Ageing Study (MRC CFAS). BMC Public Health. 2006;6:185.
15. Glymour MM, Weuve J, Chen JT. Methodological challenges in causal research on racial and ethnic patterns of cognitive trajectories: measurement, selection, and bias. Neuropsychol Rev. 2008;18:194–213.
16. Anstey KJ, von Sanden C, Salim A, O'Kearney R. Smoking as a risk factor for dementia and cognitive decline: a meta-analysis of prospective studies. Am J Epidemi. 2007;166:367–378.
17. Brayne C. Smoking and the brain. BMJ. 2000;320:1087–1088.
18. Sacco KA, Bannon KL, George TP. Nicotinic receptor mechanisms and cognition in normal states and neuropsychiatric disorders. J Psychopha. 2004;18:457–474.
19. Sabia S, Marmot M, Dufouil C, Singh-Manoux A. Smoking history and cognitive function in middle age from the Whitehall II study. Arch Intern Med. 2008;168:1165–1173.
20. Peters R, Poulter R, Warner J, Beckett N, Burch L, Bulpitt C. Smoking, dementia and cognitive decline in the elderly, a systematic review. BMC Geriatr. 2008;8:36.
21. Herbert LE, Scherr PA, Beckett LA, et al.. Relation of smoking and low-to-moderate alcohol consumption to change in cognitive function: a longitudinal study in a defined community of older persons. Am J Epidemi. 1993;137:881–891.
22. Knopman D, Boland LL, Mosley T, et al.. Cardiovascular risk factors and cognitive decline in middle-aged adults. Neurology. 2001;56:42–48.
23. Nooyens AC, van Gelder BM, Verschuren WM. Smoking and cognitive decline among middle-aged men and women: the Doetinchem Cohort Study. Am J Public. 2008;98:2244–2250.
24. Peters R, Beckett N, Geneva M, et al.. Sociodemographic and lifestyle risk factors for incident dementia and cognitive decline in the HYVET. Age Ageing. 2009;38:521–527.
25. Richards M, Jarvis MJ, Thompson N, Wadsworth ME. Cigarette smoking and cognitive decline in midlife: evidence from a prospective birth cohort study. Am J Public. 2003;93:994–998.
26. Yaffe K, Fiocco AJ, Lindquist K, et al.. Predictors of maintaining cognitive function in older adults: the Health ABC study. Neurology. 2009;72:2029–2035.
27. Launer LJ, Feskens EJ, Kalmijn S, Kromhout D. Smoking, drinking, and thinking. The Zutphen Elderly Study. Am J Epidemi. 1996;143:219–227.
28. Doll R, Peto R, Boreham J, Sutherland I. Mortality in relation to smoking: 50 years' observations on male British doctors. BMJ.2004;328:1519.
29. Greenland S. Chapter 19: Basic methods for sensitivity analysis and external adjustment. In: Rothman KJ, Greenland S eds. Modern Epidemiology. 2nd ed. Philadelphia: Lippincott-Raven; 1998:343–357.
30. Glymour MM, Weuve J, Berkman LF, Kawachi I, Robins JM. When is baseline adjustment useful in analyses of change? An example with education and cognitive change. Am J Epidemi. 2005;162:267–278.
31. Hernán MA, Brumback B, Robins JM. Marginal structural models to estimate the causal effect of Zidovudine on the survival of HIV-positive men. Epidemiology. 2000;11:561–570.
32. Cole SR, Hernán MA, Margolick JB, Cohen MH, Robins JM. Marginal structural models for estimating the effect of highly active antiretroviral therapy initiation on CD4 cell count. Am J Epidemi. 2005;162:471–478.
33. Bienias JL, Beckett LA, Bennett DA, Wilson RS, Evans DA. Design of the Chicago Health and Aging Project (CHAP). J Alzheimers Dis. 2003;5:349–355.
34. Albert M, Smith LA, Scherr PA, Taylor JO, Evans DA, Funkenstein HH. Use of brief cognitive tests to identify individuals in the community with clinically diagnosed Alzheimer's disease. Int J Neurosci. 1991;57:167–178.
35. Smith A. Symbol Digit Modalities Test Manual—Revised. Los Angeles, CA: Western Psychological Services; 1982.
36. Folstein MF, Folstein SE, McHugh PR. Mini-Mental State: a practical method for grading the state of patients for the clinician. J Psychiatr. 1975;12:189–198.
37. Wilson RS, Bennett DA, Bienias JL, Mendes de Leon CF, Morris MC, Evans DA. Cognitive activity and cognitive decline in a biracial community population. Neurology. 2003;61:812–816.
38. Nagi SZ. An epidemiology of disability among adults in the United States. Milbank Mem Fund Q Health Soc. 1976;54:439–467.
39. Barnes LL, Mendes de Leon CF, Bienias JL, Evans DA. A longitudinal study of black-white differences in social resources. J Gerontol B Psychol Sci Soc Sci. 2004;59:S146–S153.
40. Robins JM, Rotnitzky A. Semiparametric efficiency in multivariate regression models with missing data. J Am Stat A. 1995;90:122–129.
41. Hosmer DW, Lemeshow S. Chapter 5: assessing the fit of the model. Applied Logistic Regression. New York: John Wiley & Sons; 1989;135–175.
42. Cook NR. Use and misuse of the receiver operating characteristic curve in risk prediction. Circulation. 2007;115:928–935.
43. Cole SR, Hernán MA. Constructing inverse probability weights for marginal structural models. Am J Epidemi. 2008;168:656–664.
44. Rotnitzky A, Robins JM. Semiparametric regression estimation in the presence of dependent censoring. Biometrika. 1995;82:805–820.
45. Efron B, Tibshirani RJ. An Introduction to the Bootstrap. Boca Raton, FL: Chapman & Hall/CRC; 1993.
46. Hausman JA. Specification tests in econometrics. Econometrica. 1978;46:1251–1271.
47. Newey WK. Generalized method of moments specification testing. J Econom. 1985;29:229–256.
48. Hernán MA, Alonso A, Logroscino G. Cigarette smoking and dementia: potential selection bias in the elderly. Epidemiology. 2008;19:448–450.
49. Marang-van de Mheen PJ, Shipley MJ, Witteman JC, Marmot MG, Gunning-Schepers LJ. Decline of the relative risk of death associated with low employment grade at older age: the impact of age related differences in smoking, blood pressure and plasma cholesterol. J Epidemiol Community Health. 2001;55:24–28.
50. Nybo H, Petersen HC, Gaist D, et al.. Predictors of mortality in 2,249 nonagenarians—the Danish 1905-Cohort Survey. J Am Geriat. 2003;51:1365–1373.
51. Lewington S, Clarke R, Qizilbash N, Peto R, Collins R. Age-specific relevance of usual blood pressure to vascular mortality: a meta-analysis of individual data for one million adults in 61 prospective studies. Lancet. 2002;360:1903–1913.
52. Kennelly SP, Lawlor BA, Kenny RA. Blood pressure and the risk for dementia: a double edged sword. Ageing Res Rev. 2009;8:61–70.
53. Letenneur L, Gilleron V, Commenges D, Helmer C, Orgogozo JM, Dartigues JF. Are sex and educational level independent predictors of dementia and Alzheimer's disease? Incidence data from the PAQUID project. J Neurol Ne. 1999;66:177–183.
54. Luchsinger JA, Patel B, Tang MX, Schupf N, Mayeux R. Measures of adiposity and dementia risk in elderly persons. Arch Neurol. 2007;64:392–398.
55. Rocca WA, Bower JH, Maraganore DM, et al.. Increased risk of cognitive impairment or dementia in women who underwent oophorectomy before menopause. Neurology. 2007;69:1074–1083.
56. Li G, Rhew IC, Shofer JB, et al.. Age-varying association between blood pressure and risk of dementia in those aged 65 and older: a community-based prospective cohort study. J Am Geriat. 2007;55:1161–1167.
57. Rocca WA, Grossardt BR, Shuster LT. Oophorectomy, menopause, estrogen, and cognitive aging: the timing hypothesis. Neurodegener Dis. 2010;7:163–166.
58. Fitzpatrick AL, Kuller LH, Lopez OL, et al.. Midlife and late-life obesity and the risk of dementia: cardiovascular health study. Arch Neurol. 2009;66:336–342.

Supplemental Digital Content

Copyright © 2012 Wolters Kluwer Health, Inc. All rights reserved.