Obesity, which is characterized by an excess accumulation of body fat, is a risk factor for many chronic diseases (24,25,27,33). Anthropometric variables are commonly used in clinical and field settings to estimate laboratory-measured body composition (9,10,14,19,21,23). The generalized equations of Durnin and Wormsley (10), Jackson and Pollock (19), and Jackson et al. (21) were designed to estimate two-component percent fat (BF%) from combinations of skinfolds. The generalized skinfold equations were developed using cross-sectional samples of men and women who varied in age and BF%. These popular generalized equations have been cited more than 4300 times in the scientific literature.
Recent cross-validation research (2,17) showed that the Jackson-Pollock generalized equations were highly correlated with dual-energy x-ray absorptiometry (DXA; women r = 0.85, men r ≥ 0.93) but lacked accuracy, systematically underestimating DXA-BF%. In addition, body mass index (BMI) estimates of BF% determined using a variety of laboratory-based measures have been shown to be biased when applied to racially/ethnically diverse populations (4-9,11,12,16,23). Specifically, for the same BMI, the BF% of African-American (AA) men and women is overestimated, whereas the BF% of Hispanics and Asians is underestimated, relative to non-Hispanic white (NHW) men and women. Jackson et al. (17) showed that this race/ethnicity bias extended to skinfold BF% estimates. Generalized skinfold equations overestimate the BF% of AA men and underestimate BF% of Hispanic men and women. Two major limitations of generalized equations (10,19,21) are that they were developed with samples of nearly all NHW men and women and using two-component BF% as the referent standard. The contemporary US adult population has become more obese and racially/ethnically diverse (29), and multicomponent models have replaced two-component BF% as the referent standard (15).
BF% prediction models using anthropometric measures have used cross-sectional data to develop multiple regression equations to estimate laboratory-measured BF% from combinations of anthropometric variables (2,10,19,21). These equations have been shown to provide accurate estimates of BF% in cross-sectional samples of largely NHW men and women. What have not been developed to date are prediction models that can be applied to infer changes in BF% over time for diverse young men and women. The purpose of this study was to develop accurate anthropometric models to estimate DXA-BF% that generalize to diverse young adults in cross-sectional and longitudinal field settings. Anthropometric prediction models were developed using the sum of three skinfold measurements (∑3S) and BMI obtained on NHW, AA, and Hispanic adults who ranged in age from 17 to 35 yr. Because of the well-documented sex differences in body composition, separate models were developed for men and women.
Subjects were recruited from the Training Intervention and Genetics of Exercise Response (TIGER) study. Subjects from the TIGER study were students enrolled at the University of Houston (Houston, TX); the study protocol was approved by the university's Committee for the Protection of Human Subjects, and subjects volunteered to participate by signing informed consent. The target population consisted of sedentary women and men younger than 35 yr who exercised <30 min·wk−1 for the previous 6 months and who were not actively limiting caloric intake by dietary modification. Subjects were excluded if they had a known physical or physiological contraindication to aerobic exercise or a diagnosed metabolic disorder that may alter body composition.
The TIGER study subjects engaged in 30 wk (two consecutive semesters) of exercise training, 3 d·wk−1 for 30 min·d−1 at 65%-85% of HR-defined V˙O2max (22). The data came from five yearly cohorts. For this analysis, the subject sample consisted of 705 women and 428 men, aged 17-35 yr, and was limited to NHW, AA, and Hispanics. Although the TIGER study included young men and women who self-identified as Asian or Asian-Indian, the sample sizes for these groups were deemed too small to include in the development of the prediction equations. Each subject had one to three measurement visits during the 9-month study duration. The measurement visits were at baseline, after 15 wk, and at the end of the 30-wk exercise program. The race/ethnicity distribution of the subject sample was as follows: 423 NHW (37%), 324 Hispanic (29%), and 386 AA (34%); race/ethnicity was self-selected by the subjects.
Height was determined with a stadiometer (Road Rod; SECA, Hanover, MD), and weight was measured with a digital scale (Model 770; SECA). Subjects self-reported their birth date and sex and selected one category from a list of race/ethnicity groups using a standard, coded demographic form. Skinfold thicknesses at the triceps, iliac crest, and thigh for the women and at the chest, abdomen, and thigh for the men were measured using Lange (Beta Technology, Inc., Santa Cruz, CA) or Lafayette (Lafayette Instruments, Inc., Lafayette, LA) skinfold calipers by trained technicians following standardized procedures (3,20). The sex-specific sum of the three skinfold measurements (∑3S) was from the same sites used to develop the generalized equations of Jackson and Pollock (19) and Jackson et al. (21). Previous research (18) showed that the three skinfolds measured the same common source of body fat. Each year, skinfold measurements were obtained by three to five different individuals. Before collecting skinfold data, all testers were trained by one of the authors (A.S.J.). For quality control, reliability analyses were conducted. Each year, the intraclass (between-testers) reliability estimates were ≥0.95. This is further supported by our previously published TIGER study data (17). The correlation between skinfold estimated percent fat and DXA percent fat for more than 1100 subjects was 0.91, among the highest reported in the literature.
DXA was used to measure BF%. Whole-body DXA scans were completed on a Hologic Delphi-A unit (adult whole body software v.11.2, Bedford, MA) and a Hologic Discovery W instrument (adult whole body software QDR v.12.3). The same trained technicians administered the DXA scans throughout the study. The instruments were calibrated daily with a spine standard and weekly with a step calibrator as described by the manufacturer. The manufacturer's recommended procedures and software were used to calculate whole-body fat mass, lean mass, and bone mineral mass. Total DXA weight was computed by summing the DXA component measures and used to compute DXA-BF%. As recommended by Lohman and Chen (28), total DXA weight and scale-measured body weight were compared with linear regression. The R2 between measured and DXA weight was >0.99. The slope of the measured-versus-DXA weight regression line of 1.01 (95% confidence interval (CI) = 1.00-1.02) and the intercept of −0.08 (95% CI = −0.37 to 0.20) were within chance variation of 1.0 and 0, respectively. The SEE using DXA to estimate scale-measured weight was 1.5 kg. The test-retest measurement error of DXA-BF% in our sample was 1.3% for men and 1.2% for women, similar to those of previous reports (13).
The DXA data for the first two cohorts were only measured at baseline and 30-wk time points. DXA data were available for all three test visits for cohorts 3-5 if the subject completed the exercise program. All 1133 subjects had baseline data, 457 subjects (40%) had repeat tests at 15 wk, and 247 subjects (22%) had repeat tests at 30 wk. The total number of observations used for data analysis was 1837.
Maximum-likelihood linear mixed models regression with random intercepts were used to estimate the prediction equations (30) using the Stata xtmixed program (Stata 10.1; StataCorp, College Station, TX). Because of the well-documented sex differences in the relation of anthropometric measures to body composition (19,21,23), the analyses were stratified by sex. The dependent variable was DXA-BF%. The independent variables were the ∑3S, BMI, and race/ethnic group membership, which comprised the fixed part of the model. Race/ethnic group was coded as a categorical variable using NHW as the referent group. A top-down strategy (34) was used to build the field models. The relationship between the ∑3S and two-component BF% is known to be nonlinear (19,21), so both the linear (∑3S) and quadratic (∑3S2) skinfold terms were examined.
The starting model (full model) included all independent variables (∑3S, BMI, and race/ethnic group). Restricted models were then developed using only ∑3S and race/ethnic group. Random intercepts were included to accommodate the multiple measurements over time within subjects and comprised the random part of the model. A modified Bland-Altman (1) plot of the residuals was examined to evaluate the fit for both the fixed and random models. Regression coefficients were tested for statistical significance with a z-statistic (30). The accuracy of the field models was defined by the SEE for the fixed model, which was computed as the square root of the sum of the variances for the random intercepts and residuals (30,34).
Table 1 summarizes the sample's characteristics contrasted by sex and race/ethnicity. The mean ∑3S of the women did not differ by race/ethnic group (P = 0.523). Mean weight of AA women was significantly higher than that of NHW and Hispanic women (P = 0.001). The mean BMI of AA women was significantly higher than that of NHW women (P = 0.001). The DXA-BF% of Hispanic women was significantly higher than that of NHW and AA women (P < 0.001). Weight (P = 0.404) and BMI (P = 0.583) of the men did not differ significantly among the race/ethnic groups, whereas the ∑3S (P = 0.001) and DXA-BF% (P < 0.001) of AA men was significantly lower than those of NHW and Hispanic men. For men and women, height was significantly (P < 0.001) lower in Hispanics compared with the other two race/ethnic groups.
Table 2 shows the women and men's linear mixed model analysis for the full model, including the regression coefficient estimates (EST), SE of the coefficients, and P values for tests of statistical significance. The linear and quadratic skinfold regression coefficients for both men and women were statistically significant, confirming that the relation between skinfold fat and DXA-BF% was nonlinear. The coefficient for BMI for both men and women was significant (P < 0.001). In women, the coefficients for the AA and Hispanic race/ethnic groups were both significantly different from NHW. Controlling for skinfold fat and BMI, DXA-BF% was 1.34% (95% CI = −1.97% to −0.71%) lower for AA women and 1.89% (95% CI = 1.20%-2.57%) higher for Hispanic women relative to NHW women. In men, only the coefficient for the AA ethnic group was significantly different from zero. In men, after controlling for skinfold fat and BMI, the DXA-BF% for AA was 2.40% (95% CI = −3.15% to −1.66%) lower than that for NHW. Validation analyses using a jackknife ("leave-one-out") estimation, similar to the predicted residual sum of squares technique, resulted in the 95% CI for the SEE for the fixed model increasing only slightly from 3.40%-3.88% to 3.36%-3.93% for the women and from 2.81%-3.42% to 2.70%-3.54% for the men. This clinically inconsequential difference supports the validity of the equations in these populations.
Figure 1 shows the modified Bland-Altman bivariate plots of the residuals by estimated DXA-BF% with the residuals of the random model superimposed on those of the fixed model. A graphic analysis of these residuals showed that they were normally distributed (not shown). The difference in variability between the fixed and random residuals is due to the variance of the random intercept part of the models, which includes variation unique to the individual but not due to ∑3S, BMI, or race/ethnic group membership. This variance component is included in the fixed residuals but not in the random model residuals. The SEE for the random intercepts model of 1.2% (95% CI = 1.1%-1.3%) for women and 1.4% (95% CI = 1.2%-1.5%) for men were approximately equal to the test-retest precision of DXA-BF% estimation alone.
Table 3 provides the sex- and race/ethnic group-specific linear mixed model regression equations that include the ∑3S, BMI, and significant race/ethnic group terms as well as models that include just ∑3S and race/ethnic group effects. The significant race/ethnic group was accounted for by adjusting the NHW constant by the significant race/ethnic group regression coefficient. These data show that, although the regression coefficient for BMI was significant, it does not improve the accuracy for men, whereas it does for women. The accuracy of these equations for use in field and clinical settings is defined by the fixed-effects SEE (the square root of the sum of the variances for the random intercepts and residuals) and by the 95% CI of the SEE provided in Table 3.
These generalized field equations can be used to estimate accurately DXA-BF% of diverse young men and women in both cross-sectional and longitudinal field settings across the wide range of DXA-BF% (6%-53%) observed in this investigation. The finding of the nonlinear relationship between ∑3S and DXA-BF% is consistent with the results reported with generalized equation studies using cross-sectional data (2,10,19,21). A major difference found with these men and women from the TIGER study was the race/ethnic group bias. Compared with NHW individuals, the DXA-BF% of Hispanic women was underestimated by 1.9%, and AA women and men were overestimated by about 1.3% and 2.4%, respectively, as indicated by the regression coefficients for the respective ethnic groups in Table 2. The race/ethnic group bias is well documented for several BMI prediction models (4-9,11,16,23), and these results confirm that the race/ethnic group bias extends to skinfold fat prediction models.
The final fixed model SEE estimates of 3.64% (95% CI = 3.41%-3.89%) for women and 3.12% (95% CI = 2.82%-3.44%) for men were slightly lower but within the sampling error of those reported in original cross-sectional study (women = 3.9%, men = 3.4%) of Jackson and Pollock (19) and Jackson et al. (21). The error estimates for these new equations are higher than the SEE of 2.2% reported for men by Ball et al. (2). The equation of Ball et al. used the quadratic sum of seven skinfolds to estimate DXA-BF% with cross-sectional data and did not examine race/ethnic group bias. Gallagher et al. (12) reported a correlation of 0.95 between four-component BF% and DXA-BF% using a cross-sectional sample of 1626 NHW, AA, and Asian men and women. The SEE of the cross-sectional regression model of Gallagher et al. was 3.2%, and the four-component data provided an accurate fit of DXA-BF% (intercept = −1.7, slope = 1.06). The 95% CI of the women's field ∑3S model (3.62%-4.36%) in Table 3 confirmed that the women's equation was slightly less accurate than the laboratory four-component-measured BF%, whereas the men's ∑3S equation (95% CI = 2.77%-3.40%) was within chance variation. It is also important to note that this is the first body composition validation study that reports the 95% CI of the SE. The random errors of 1.2% and 1.4% in our model-the errors after accounting for subject-to-subject variability-are extremely low. These random error estimates are only slightly higher than the expected DXA-BF% test-retest error estimates. The low random estimates and the low fixed SEE estimates compared with the SEE of Gallagher et al. for the four-component BF% demonstrate that skinfold fat and BMI of women and men can be reliably measured. Future studies using independent samples need to cross-validate the accuracy of our models' estimates.
Hispanic and AA individuals are the two prominent minority groups in the United States. A strength of this study is that these racial/ethnic groups, along with NHW, were each represented with large samples that were all similar in size. Research has shown that, compared with NHW individuals, BMI underestimates the BF% of Asian and Asian-Indian men and women (4-8,16). To examine the potential bias of the equations, we applied the equations (Table 3) to the data of the self-selected 107 Asian and 57 Asian-Indian women and men from the TIGER study, none of whom provided data used to develop the equations. Provided in Table 4 are the total number of observations and means and SD of the residuals (i.e., measured − estimated DXA-BF%). All race-specific equations underestimated the measured DXA-BF% of Asian-Indian women and men, with substantial mean errors ranging from 2.2% to nearly 7%. In contrast, the equations of NHW and Hispanic men accurately predicted DXA-BF% for Asian men. The mean differences were near 0, and the error estimates of 2.80% and 2.91% were within the equations' 95% SEE CI. The Hispanic ∑3S and BMI equation provided a reasonable fit for Asian women. The mean error was just 0.07%, and the error estimate of 3.44% was within the 95% SEE CI of the women's full model. These results suggest that these equations can be accurately applied to Asian women and men but not to Asian-Indian young adults. These Asian-Indian cross-validation results are consistent with BMI research (6,9,16) showing that, compared with NHW, the BF% of Asian-Indian men and women is substantially underestimated from BMI. Asian-Indians need to be studied more closely in future investigations.
The cross-sectional data used to develop other generalized equations (2,10,19,21) limit statistical inference to just one measurement occasion or point in time; the lack of ethnic diversity in those data also limit inference to NHW men and women. A strength of this study was the inclusion of multiple measures of individuals involved in a longitudinal study of exercise. A major advantage of mixed effects regression modeling is that it accommodates unbalanced, unevenly spaced observations, making it an ideal tool for analyzing repeated-measures data when subjects do not have the same number of observations or observations that recur at different intervals (30,32,34). Accounting for the repeat tests in the random part of the model allows for the valid generalization of the error estimates over multiple measurement occasions. These equations provide prediction models that estimate BF% at one point in time and have the capacity to evaluate changes over time within individuals. These equations have the same limitations as all such equations that use anthropometric measures in that estimates of body fat may be inaccurate in individuals who have atypical body composition, such as weight lifters or persons who are nonambulatory.
A limitation of this study is that these subjects from the TIGER study ranged in age from 17 to 35 yr. The age range of the samples used to develop previous generalized prediction equations was from 17 to >60 yr. Age was included as an independent variable in these previous studies to account for the variation in age, and the age effect was statistically significant (2,19,21). These cross-sectional studies reported that the age effect ranged from 0.07%·yr−1 to 0.12%·yr−1. A stronger aging effect, 0.14%·yr−1 to 0.23%·yr−1, has been reported for equations estimating BF% from BMI (23). In our preliminary analyses, we examined the influence of age with the subjects from the TIGER study, found it was not statistically significant for either men (P = 0.328) or women (P = 0.752), and so did not include age in the final equations. These results suggest that these equations will likely underestimate DXA-BF% if applied to individuals older than 35 yr. Because the criterion for these models was DXA-BF% and not body density, the effect of aging needs to be examined with cross-validation of these equations with older men and women. NHANES data (29) suggest that young adults are an important group to consider. Ogden et al. (29) reported that 31.1% of men and 33.2% of women of all ages were obese (BMI ≥ 30 kg·m−2), and the origin of the obesity is linked to young adulthood. The NHANES data showed that 28.0% of men and 28.9% of women, aged 20-39 yr, were obese. Although these subjects from the TIGER study are not a random sample of the US population, a comparison with the NHANES BMI data shows that the subjects from the TIGER study are reasonably representative of the US population of young adults. The NHANES 2003-2004 data (29) showed that 51.7% (95% CI = 46.5%-56.9%) of women and 62.2% (95% CI = 58.0%-66.4%) of men aged 20-39 yr were overweight or obese (BMI ≥ 25 kg·m−2). The prevalence of a BMI ≥ 25 kg·m−2 was 56% and 59% in these TIGER women and men, respectively.
Self-selection of race/ethnic group membership was another study limitation. Self-report does not allow for determination of whether the observed effects on the prediction of BF% using anthropometric measures were due to race, which is considered primarily a biological factor, or ethnicity, which is influenced by culture and heritage (26). Despite these limitations, self-reported race/ethnicity was a significant predictor of BF% in our study after controlling for ∑3S and BMI; what remains unclear is how much of the variance attributed to this effect was produced by common biological and genetic factors or a result of shared sociocultural values and behaviors for subjects self-identifying in the same race/ethnic group. The genetic technique of admixture mapping provides an objective method of defining racial heritage, including individuals with multiple racial ancestries. Research has shown that admixture mapping can both localize and fine map a phenotypically important variant (31). Yaeger et al. (35) reported that self-identified race correlated well with ancestry clusters but could not predict individual admixture. Genetic examination of admixture in these TIGER data in our next level of analysis should provide a clearer understanding of the potential source of bias attributed to race/ethnicity in these diverse young men and women.
The generalized equations presented here provide accurate models for estimating DXA-BF% of diverse young women and men. The SE of the fixed model equations are among the lowest published in the literature. The mixed model results documented a race/ethnic group bias for men and women. When using common anthropometric measurements in prediction models, the DXA-BF% is overestimated for AA men and women and underestimated for Hispanic women. This bias is controlled in our new models by using race/ethnic group as an independent variable. A major strength of these prediction equations is that the accuracy can be inferred to diverse young adults in both cross-sectional and longitudinal settings. To our knowledge, these are the first published body composition equations with generalizability to multiple time points.
This work was supported by a grant from the National Institute of Diabetes and Digestive and Kidney Diseases (DK062148) and a grant from the USDA Agricultural Research Services (6250-51000-046).
None of the authors has a conflict of interest to declare.
The results of the present study do not constitute endorsement by the American College of Sports Medicine.
1. Altman DG, Bland JM. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet
2. Ball SD, Altena TS, Swan PD. Comparison of anthropometry to DXA: a new prediction equation for men. Eur J Clin Nutr
3. Baumgartner TA, Jackson AS, Mahar MT, Rowe DA. Measurement for Evaluation in Physical Education and Exercise Science
. 8th ed. New York (NY): McGraw-Hill; 2007. 544 p.
4. Chang CJ, Wu CH, Chang CS, et al. Low body mass index
but high percent body fat
in Taiwanese subjects: implications of obesity cutoffs. Int J Obes
5. Chung S, Song J, Shin H, et al. Korean and Caucasian overweight premenopausal women have different relationship of body mass index
to percent body fat
wit age. J Appl Physiol
6. Deurenberg-Yap M, Schmidt G, van Staveren WA, Deurenberg P. The paradox of low body mass index
and high body fat percentage among Chinese, Malays and Indians in Singapore. Int J Obes
7. Deurenberg P. Invited commentary: universal cut-off BMI points for obesity are not appropriate. Br J Nutr
8. Deurenberg P, Deurenberg-Yap M. Validity of body composition
methods across ethnic population groups. Forum Nutr
9. Deurenberg P, Weststrate JA, Seidell JC. Body mass index
as a measure of body fatness: age- and sex-specific prediction formulas. Br J Nutr
10. Durnin JVGA, Wormsley J. Body fat assessed from total body density and its estimation from skinfold thickness: measurements on 481 men and women aged from 16 to 72 years. Br J Nutr
11. Fernandez JR, Heo M, Heymsfield SV, et al. Is percentage body fat differentially related to body mass index
in Hispanic Americans, African Americans, and European Americans. Am J Clin Nutr
12. Gallagher D, Heymsfield SB, Heo M, Jebb SA, Murgatroyd PR, Sakamoto Y. Healthy percentage body fat ranges: an approach for developing guidelines based on body mass index
. Am J Clin Nutr
13. Haarbo J, Gotfredsen A, Hassager C, Christiansen C. Validation of body composition
by dual energy x-ray absorptiometry (DEXA). Clin Physiol
14. Hannan WJ, Wrate RM, Cowen SJ, Freeman CP. Body mass index
as an estimate of body fat. Int J Eat Disord
15. Heymsfield SB, Lohnman TG, Wang Z, Going SB. Human Body Composition
. 2nd ed. Champaign (IL): Human Kinetics; 2005. p. 17-34.
16. Jackson AS, Ellis KJ, McFarlin BK, Sailors MH, Bray MS. Body mass index
bias in defining obesity of diverse young adults: the Training Intervention and Genetics of Exercise Response (TIGER) study. Br J Nutr
17. Jackson AS, Ellis KJ, McFarlin BK, Sailors MH, Bray MS. Cross-validation of generalised body composition
equations with diverse young men and women: the Training Intervention and Genetics of Exercise Response (TIGER) study. Br J Nutr
18. Jackson AS, Pollock ML. Factor analysis and multivariate scaling of anthropometric variables for the assessment of body composition
. Med Sci Sports Exerc
19. Jackson AS, Pollock ML. Generalized equations for predicting body density of men. Br J Nutr
20. Jackson AS, Pollock ML. A practical approach for assessing body composition
of men, women, and athletes. Phys Sportsmed
21. Jackson AS, Pollock ML, Ward A. Generalized equations for predicting body density of women. Med Sci Sports Exerc
22. Jackson AS, Ross RM. Understanding Exercise for Health and Fitness
. Dubuque (IA): Kendall Hunt; 1997. p. 182.
23. Jackson AS, Stanforth PR, Gagnon J, et al. The effect of sex, age, and race on estimating percent body fat
from BMI: the HERITAGE Family Study. Int J Obes
24. Janssen I, Katzmarzyk PT, Ross R. Body mass index
, waist circumference, and health risk: evidence in support of current National Institutes of Health guidelines. Arch Intern Med
25. Janssen I, Katzmarzyk PT, Ross R. Waist circumference and not body mass index
explains obesity-related health risk. Am J Clin Nutr
26. Kaplan JB, Bennett T. Use of race and ethnicity in biomedical publication. JAMA
27. Lean ME. Pathophysiology of obesity. Proc Nutr Soc
28. Lohman TG, Chen Z. Dual energy x-ray absorptiometry. In: Heymsfield SB, Lohnman TG, Wang Z, Going SB, editors. Human Body Composition
. Champaign (IL): Human Kinetics; 2005. p. 63-78.
29. Ogden CL, Carrol MD, Curtin LR, McDowell MA, Tabak CJ, Flegal KM. Prevalence of overweight and obesity in the United States, 1999-2004. JAMA
30. Rabe-Hesketh S, Skrondal A. Multilevel and Longitudinal Modeling Using Stata
. 2nd ed. College Station (TX): Stata Press; 2008. p. 553.
31. Reich D, Patterson N, Ramesh V, et al. Admixture mapping of an allele affecting interleukin 6 soluble receptor and interleukin 6 levels. Am J Hum Genet
32. Twisk J. Applied Longitudinal Data Analysis for Epidemiology: A Practical Guide
. Cambridge (UK): Cambridge University Press; 2003. p. 320.
33. Welborn TA, Dhaliwal SS. Preferred clinical measures of central obesity for predicting mortality. Eur J Clin Nutr
34. West BT, Welch KB, Galecki AT. Linear Mixed Models
. New York (NY): Chapman & Hall/CRC; 2007. p. 353.
35. Yaeger R, Avila-Bront A, Abdul K, et al. Comparing genetic ancestry and self-described race in African Americans born in the United States and in Africa. Cancer Epidemiol Biomarkers Prev