In the past three decades, HIV sentinel surveillance has been conducted on the basis of unlinked anonymous testing among pregnant women at a selection of antenatal clinics (ANC-UAT). ANCs were selected as sentinel sites for convenience and geographic spread . Since the early 2000s, ANC-UAT data have often been complemented in settings with the highest burden of HIV by prevalence data from nationally representative population-based surveys (NPS) . Both data sources are important inputs for HIV prevalence models in the Estimation and Projection Package (EPP) of the Joint United Nations Programme on AIDS (UNAIDS) supported Spectrum (Avenir Health, Glastonbury, CT) software tool (the software and documentation are online at http://www.avenirhealth.org/software-spectrum and http://www.unaids.org/en/dataanalysis/datatools/spectrumepp) .
In August 2015, the WHO and UNAIDS released new guidelines recommending that countries transition from conducting ANC-UAT to using ANC routine testing (ANC-RT) data [4,5]. In the ANC-RT approach, HIV prevalence among pregnant women attending ANCs is drawn from data routinely reported on a monthly or quarterly basis through existing HIV program monitoring systems. These data typically include the number of women attending their first antenatal visit, the number who are already known to be HIV positive, the number who are tested for HIV, and the number of those who test HIV positive. ANC-RT data have the potential to improve the representativeness of HIV prevalence estimates over ANC-UAT, if coverage of HIV testing services at ANC and the quality and completeness of available data are consistently high. In this setting, prevalence trends are representative of all pregnant women, rather than among those from a nonrepresentative sample of health facilities.
In anticipation of the 2015 recommendations, WHO and UNAIDS published guidelines in 2013 on how to assess and increase the use of ANC-RT data for surveillance purposes . The primary focus of the assessment was to determine the quality of facility-based HIV testing services at ANCs and to provide additional support to improve services where required. The guidelines also recommended reviewing and improving processes for recording and reporting a minimum set of demographic and testing information required to interpret the estimates. Finally, countries were encouraged to explore how transitioning from ANC-UAT to ANC-RT data might change HIV prevalence estimates after the transition period by comparing facility-level and national-level ANC-UAT and ANC-RT data. With this final recommendation, it was recognized that in addition to the usual caveat of using ANC-UAT data to monitor general population trends, specifically, that HIV prevalence trends among pregnant women might reflect changes in the fertility patterns of infected women rather than true population trends [7–10], use of ANC-RT data poses some further considerations:
- The ‘routine’ nature of the data could lead to more errors in recording or reporting, delays or incomplete reporting by some facilities, and variation in the coverage or quality of data ascertainment over time or location.
- HIV status ascertainment among pregnant women does not necessarily entail conducting an HIV test for every woman, but is a combination of routine HIV testing outcomes and some women self-reporting their HIV status (and ideally providing documentation).
- Women can opt out of routine testing, which may create participation bias.
- Inclusion of all pregnant women attending ANCs results in large number of women tested and statistically precise estimates of prevalence, whereas nonsampling error arising from other sources of variability (described above) are likely to be much more important sources of uncertainty.
- Many health facilities provide ANC-RT data, and statistically modeling individual time series from each facility (as with existing ANC-UAT ) is computationally intractable.
In practice, countries have made considerable progress since 2013 in improving the coverage of testing and the availability and quality of ANC-RT data. At the same time, they have encountered challenges in quantifying some of the other possible biases described above; in particular, there is little information about how the inclusion of ANC-RT estimates of HIV prevalence might influence our ability to consistently monitor trends in HIV prevalence among pregnant women. Responding to these opportunities and needs, the main objective of this article is to describe the conceptual considerations and underlying methods used to incorporate ANC-RT data into EPP. As a secondary objective, synthetic ANC-RT data are used to illustrate how the availability of ANC-RT data affects the accuracy of various parameter estimates. In the synthetic data analysis, the availability of ANC-RT data varies in four dimensions that are census-level data vs site-level data, the number of sites, the overlap with ANC-UAT sites, and the number of years of data available.
Within Spectrum, the key element for estimating general population HIV prevalence and incidence trends is the EPP. This package is a set of mathematical equations based on a susceptible-infected epidemiological model with CD4+ progression patterns and ART utilization used to inform survival [12–14]. Model outputs of HIV prevalence are derived for people aged 15–49 years. Previous versions of the EPP software jointly fit the model to ANC-UAT and NPS data in a Bayesian statistical framework .
We propose two approaches to incorporating ANC-RT data. First, a ‘site-level’ approach in which ANC-RT data from the same clinics that historically participated in ANC-UAT are used as an extension of the prevalence time series for each clinic, and possibly expanding the number of sites to a larger and more representative selection of clinics. Second, a ‘census’ approach in which ANC-RT data are aggregated across all health facilities in which HIV testing is offered to pregnant women within the geographic region being represented by the EPP model fit; in this approach, a single time series of HIV prevalence from ANC-RT is calculated and used in the model.
Existing statistical models for antenatal clinics unlinked anonymous testing and nationally representative population-based surveys data
The existing EPP model estimation utilizes separate likelihood functions to relate model predictions to HIV prevalence observations from NPS and ANC-UAT. Technical details of these likelihood functions are described elsewhere [11,14]. Briefly, we assume that NPS provide an unbiased estimate of the true general population HIV prevalence ρt, with appropriately estimated standard errors accounting for the complex survey design . For ANC-UAT prevalence, we model the probit-transformed prevalence trend within each sentinel site, with site-level random effects [bs ∼ N(0, τ2)] to capture differences in epidemic levels across sites . A bias term αUAT is incorporated to allow for a systematic bias (on the probit scale) between pregnant women HIV prevalence and the mean prevalence at ANC-UAT sites; this bias term captures the potential nonrepresentative selection of ANC-UAT site locations. The model prediction for ANC-UAT prevalence is adjusted to account for the expected change in the ratio ϕt of HIV prevalence among pregnant women to the general population as the epidemic matures . Elsewhere in this supplement, Eaton and Bao  describe another innovation to this likelihood to quantify and account for additional nonsampling error observed in ANC-UAT prevalence trends; this approach is also useful for modeling ANC-RT prevalence.
Data from routine testing of pregnant women
For the purposes of this analysis, we assumed that data were available on an annual basis from each health facility for (1) the number of women attending their first antenatal visit, (2) the number already known to be HIV positive, (3) the number tested for HIV, and (4) the number testing HIV positive. Prevalence among pregnant women from routine data is calculated as
We note some key assumptions underlying this calculation. First, the prevalence calculation includes self-reported status and hence relies on the accuracy of this self-reporting. However, these women cannot be excluded from the calculation because this will systematically bias the resulting prevalence estimate. Second, some women might not have their HIV status ascertained [the difference between (1) number attending ANC first visit and the denominator], and this calculation assumes that prevalence is similar regardless whether women had their status ascertained. In well functioning systems in which ANC-RT would be reliable, this number should be low, but may be an additional source of uncertainty.
In some settings, women may also be recorded as ‘previously known HIV-negative’ based on presenting documentation of a recent HIV test; in this case, the prevalence calculation is revised accordingly. Reporting systems also typically capture HIV testing of women at labor and delivery and in post-partum care. Capturing these diagnoses is important for programmatic monitoring, but for surveillance of epidemic trends, we recommend restricting the prevalence calculation to HIV status ascertainment at the first ANC visit for continuity of the previous ANC-UAT sentinel population, because facility attendance for labor and delivery is often lower than ANC attendance and potentially a more selected and biased population, and because women may attend different facilities for delivery than for ANC, increasing the potential for selective double counting of some women.
Statistical model for site-level antenatal clinics routine testing data
In countries that are currently expanding or improving ANC-RT services available to pregnant women, it may be advantageous to focus on collecting data from a limited number of health facilities in which high-quality data can be assured, as opposed to taking a census approach in which data quality could be more variable and prevalence trends may be reflective of changes in the composition of testing sites as the program expands.
The models of site-level ANC-RT data include many of the same terms from the model of ANC-UAT data. In addition, we introduce the calibration parameter βRT between ANC-UAT prevalence and ANC-RT prevalence from the same sites; this allows for the potential of a systematic bias between ANC-UAT and ANC-RT due to testing procedure differences and nonconsent. The error term of site-level ANC-RT
includes the sampling error with variance
and the nonsampling error. We assume that the nonsampling errors of the site-level ANC-RT and the site-level ANC-UAT have the same variance
because both use preselected sites and have approaches that incorporate extensive quality assurance checks. This leads to the following site-level ANC-RT data model:
where the free parameters are ρt, αUAT, βRT, τ2, and
Statistical model for census-level antenatal clinics routine testing data
Census-level ANC-RT data are an aggregation of routine data from all facilities that offer ANC-RT within the geographic region represented by an EPP model fit. If ANC-RT data have sufficiently high quality and HIV testing coverage is consistently high across facilities, then this approach overcomes the nonrepresentativeness of ANC-UAT sites. Considering the routine nature of the data, some sites may have lower quality data due to increased errors in testing or recording; however, with the large number of sites, the impact on aggregate prevalence may be small. Let αRT be the calibration parameter between pregnant women HIV prevalence and census-level ANC-RT prevalence, and
be the error term that includes sampling error
and nonsampling error from census-level routine testing
. The model of census-level ANC-RT data includes some terms from the ANC-UAT model, but these data are no longer at site-level
where the free parameters are ρt, αRT, and
We created synthetic datasets to understand how the incorporation of ANC-RT data in EPP affects estimates of the HIV prevalence trend and other parameters. We used the EPP r-spline model to simulate a single epidemic curve representative of a high HIV prevalence setting in sub-Saharan Africa (SSA). Taking this epidemic prevalence to be the ‘true’ prevalence trend, we simulated synthetic datasets consisting of two NPS occurring in 2004 and 2010; ANC-UAT at 11 sentinel sites in 1994–1996, 1998–1999, 2001, 2003, 2005, 2007, and 2010 (6 ANC-UAT sites only have data in 2007 and 2010); and ANC-RT data at 500 health facilities starting from 2011. This pattern of NPS and ANC-UAT data availability is typical of many countries in SSA.
To generate the synthetic data, we use ‘true’ parameter values. We have ‘true’ site effects bs for the 11 ANC-UAT sites and ‘true’ site effect variance τ2 = 0.1167. The ‘true’ value of the ANC-UAT calibration parameter αUAT is set to be 0.2402 (this corresponds to a prevalence increase from 6.00% to 9.43%) reflecting an assumption that ANC-UAT prevalence tends to be higher than general population prevalence because of the selection bias in sentinel site locations . For βRT, the calibration parameter between ANC-UAT prevalence and ANC-RT prevalence from the same sites, we set the ‘true’ value at −0.1, reflective of an assumption of a systematic difference between routine testing prevalence and that which would be obtained through ANC-UAT at the same facility; the value βRT = −0.1 corresponds to, for example, a prevalence decrease from 6.00% to 4.90%. For the site-level nonsampling variance parameter
, the ‘true’ value is set to 0.01301. To generate synthetic census-level ANC-RT data, we aggregate the simulated site-level ANC-RT data from 500 sites.
In our synthetic data analysis, the availability of ANC-RT data varies in four dimensions, which are census-level vs site-level, the number of sites, the overlap with ANC-UAT sites, and the number of data years. From the first three dimensions, we have five different scenarios for the availability of ANC-RT data based on realistic cases of ANC-RT usage. These scenarios include no ANC-RT data, ANC-RT sites solely as a continuation of existing ANC-UAT sites, additional ANC-RT sites with continuation of existing ANC-UAT sites, entirely new ANC-RT sites chosen to be more geographically representative, and census-level ANC-RT data.
- ‘No ANC-RT’ – no ANC-RT (ANC-UAT and NPS only),
- ‘11 Original’ – 11 ANC-RT sites as a continuation of ANC-UAT sites,
- ‘50 with Continuity’ – 50 ANC-RT sites with 11 sites as a continuation of ANC-UAT sites,
- ‘50 Resampled’ – 50 ANC-RT sites with no continuation of ANC-UAT sites,
- ‘Census’ – 500 ANC-RT sites aggregated to census level.
For each scenario except ‘No ANC-RT’, we vary the number of ANC-RT data years at 1, 3, and 5 years. Further details of the simulation procedure are provided in Appendix A1, http://links.lww.com/QAD/B50.
After simulating ANC-RT, ANC-UAT, and NPS data, we fit the EPP r-spline and r-trend models to the synthetic datasets and estimate the HIV prevalence and additional parameters. During estimation, we used prior distributions including αUAT ∼ N(0.15, 1.0) , βRT ∼ N(0, 1.0), αRT ∼ N(0, 1.0),
. To evaluate the prediction accuracy, we calculate the mean absolute error (MAE) for population HIV prevalence in 2011 and 2016 (ρ2011, ρ2016); the year-on-year change in prevalence between 2010–2011 and 2015–2016 (ρ2011–ρ2010, ρ2016–ρ2015); and the additional model parameters αUAT, βRT, and
. MAE is defined as the absolute difference averaged over 50 simulations between the median fitted value (of 3000 posterior samples) and the ‘true’ value.
Under the various settings of the EPP model, ANC-RT scenario, and number of ANC-RT data years, we compare the estimated and ‘true’ values of certain quantities. We present only the results for the r-spline model  in this section, but many findings remain the same when we analyze the synthetic data by using the r-trend model  (refer to r-trend results in Appendix A2, http://links.lww.com/QAD/B50).
For adults, the ‘true’ values of HIV prevalence in 2011 (ρ2011) and 2016 (ρ2016) were 6.62% and 5.74%, respectively. For HIV prevalence change, the ‘true’ value of the change from 2010 to 2011 was −0.193%, and the ‘true’ value of the change from 2015 to 2016 was −0.179%.
Site-level antenatal clinics routine testing with continuation from antenatal clinics unlinked anonymous testing sites
Table 1 presents results for ‘No ANC-RT’ and the two site-level ANC-RT scenarios (11 and 50 ANC-RT sites) with continuation from ANC-UAT sites. With more years of ANC-RT data, for prevalence, prevalence change, and
, the MAE generally decreases, but the mean difference did not always improve. For example, for estimation of ρ2016 with 11 ANC-RT sites, the MAE decreases from 0.378% for ‘No ANC-RT’ and 0.378% for 1 ANC-RT data year to 0.364% for 5 ANC-RT data years; however, the mean difference (bias) increases from −0.052% for ‘No ANC-RT’ and −0.079% for 1 ANC-RT data year to 0.116% for 5 ANC-RT data years. With more years, the MAE may decrease for βRT and increase for αUAT.
Comparing the two site-level ANC-RT scenarios, the scenario with 50 ANC-RT sites has lower MAE and mean difference for βRT. Increasing from 11 to 50 ANC-RT sites, the MAE decreases from 0.0364 to 0.0331 for 1 ANC-RT data year, from 0.0286 to 0.0262 for 3 ANC-RT data years, and from 0.0309 to 0.0276 for 5 ANC-RT data years; the mean difference decreases in magnitude from −0.0205 to −0.0098 for 1 ANC-RT data year, from −0.0229 to −0.0191 for 3 ANC-RT data years, and from −0.0242 to −0.0211 for 5 ANC-RT data years. The scenario with 50 ANC-RT sites has higher MAE for αUAT, and generally lower MAE for
(except for 1 ANC-RT data year). For prevalence and prevalence change, with more ANC-RT sites, the MAE generally does not change much.
Census-level antenatal clinics routine testing
Table 2 presents results for the ‘No ANC-RT’ scenario and the census-level ANC-RT scenario. With more years of ANC-RT data, for HIV prevalence and prevalence change, the MAE generally decreases, but the mean difference did not always improve. For example, for estimation of ρ2016, the MAE decreases from 0.378% for ‘No ANC-RT’ and 0.382% for 1 ANC-RT data year to 0.360% for 5 ANC-RT data years; however, the mean difference increases from −0.052% for ‘No ANC-RT’ and −0.057% for 1 ANC-RT data year to 0.113% for 5 ANC-RT data years. For αUAT and
, the MAE generally increases with more years of ANC-RT data.
Comparing site-level antenatal clinics routine testing with and without continuation from antenatal clinics unlinked anonymous testing sites
Table 3 presents results for the two site-level ANC-RT scenarios with 50 ANC-RT sites. With more years of ANC-RT data, for HIV prevalence, prevalence change, and
, the MAE generally decreases, but the mean difference did not always improve. For example, for estimation of ρ2016 without continuation from ANC-UAT sites, the MAE decreases from 0.391% for 1 ANC-RT data year to 0.365% for 5 ANC-RT data years; however, the mean difference changes from −0.093% for 1 ANC-RT data year to 0.110% for 5 ANC-RT data years. With more years of ANC-RT data, the MAE may decrease for βRT and increase for αUAT.
Having continuation from ANC-UAT sites does not significantly impact MAE for prevalence change. For prevalence, continuation may increase MAE for 1 and 3 ANC-RT data years, but slightly decrease MAE for 5 ANC-RT data years. For βRT, continuation improves the MAE and mean difference. The MAE decreases from 0.0602 to 0.0331 for 1 ANC-RT data year, from 0.0539 to 0.0262 for 3 ANC-RT data years, and from 0.0494 to 0.0276 for 5 ANC-RT data years; the mean difference decreases in magnitude from 0.0438 to −0.0098 for 1 ANC-RT data year, from 0.0352 to −0.0191 for 3 ANC-RT data years, and from 0.0281 to −0.0211 for 5 ANC-RT data years. However, for αUAT, continuation increases the MAE. For
, having continuation from ANC-UAT sites reduces MAE for 1 ANC-RT data year, but not necessarily for 3 and 5 ANC-RT data years.
In this article, we propose statistical models that incorporate the newly recommended ANC-RT data into Spectrum software. The new set of models allows joint analysis of HIV prevalence data from NPS, ANC-UAT, and ANC-RT, so that Spectrum can better inform estimates of prevalence and other measures of the epidemic. We also applied these models to synthetic datasets and examined how availability of ANC-RT data affects the accuracy of various parameter estimates. Based on the synthetic data results, our conclusions are as follows.
Fitting HIV prevalence trends using synthetic data generally gives precise estimates (low MAE) of the underlying trend and other parameters. Based on our simulation results, the proposed models for ANC-RT data are appropriate for use with either the r-spline or the r-trend model in EPP.
With more years of ANC-RT data, our estimates of HIV prevalence, prevalence change, and site-level nonsampling variance became more precise as represented by the lower MAE values. (In the census-level ANC-RT scenario, the estimate of site-level variance did not improve because, in that scenario, this parameter was the variance of ANC-UAT data, not ANC-RT data.) Although the mean difference sometimes improved with more years of ANC-RT data, in many cases, the reduction in MAE was likely due to reduced variance of the prediction when more recent ANC-RT data were incorporated. In terms of the calibration parameters, we generally obtain good estimates from only 1 year of ANC-RT data.
Increasing the number of ANC-RT sites (but keeping the same number of continuation sites from ANC-UAT) reduces the MAE and the mean difference for βRT; perhaps more ANC-RT sites lead to a more accurate average of ANC-RT prevalence and hence a better estimate of the ANC-RT calibration. However, the MAE for αUAT generally increases; as the mean difference for αUAT generally improves, the MAE increase is due to additional uncertainty in αUAT.
Having ANC-RT continuation from ANC-UAT sites gives a more precise estimate of the site-level ANC-RT calibration parameter βRT; this makes intuitive sense because βRT measures the systematic difference between ANC-RT and ANC-UAT prevalence at the same sites. However, continuation generally increases the MAE for αUAT; as the mean difference for αUAT improves, the MAE increase is due to additional uncertainty in αUAT.
When we only have a few years of ANC-RT data, the variance of nonsampling errors may not be accurately estimated. An overestimated variance parameter will make the ANC-RT data contribution negligible. Considering this, we assume that the variance of site-level ANC-RT nonsampling errors is the same as the variance of ANC-UAT nonsampling errors as they are both collected from pregnant women. Users of EPP may choose to estimate site-level ANC-RT and ANC-UAT variances separately, if there are sufficient years of site-level ANC-RT data available, and Appendix A3, http://links.lww.com/QAD/B50 further investigates this.
As more high-quality ANC-RT data become available, it is essential to apply the proposed model to real datasets and evaluate the many as-yet unvalidated assumptions underlying our understanding of ANC-RT prevalence data. Some of these key assumptions include that trends in ANC-RT prevalence accurately reflect the HIV prevalence trends among pregnant women rather than changes in patterns of service provision or utilization; self-reported HIV status by known HIV-positive women are reliable and when combined with routine testing prevalence result in accurate estimates of the true prevalence among pregnant women; we are able to adequately account for changes in fertility among HIV-positive women to distinguish between true epidemic changes vs changes in fertility patterns (in particular the effects of ART on fertility and fertility intentions ); and that our proposed approaches for accounting for additional uncertainty about routine prevalence data are appropriate and can be estimated from available data. Overall, routine prevalence data provide an opportunity to substantially improve the precision and granularity of HIV epidemic estimates. However, just as programs, surveillance, and reporting systems must be continually evaluated and improved, the interpretation and modeling of these must also be reviewed and validated.
The research was supported by the Joint United Nations Programme on HIV/AIDS, NIH – R56 AI120812-01A1, NIH – UL1 TR000127 and TR002014, NSF IGERT – DGE-1144860, and NSF – BCS-0941553. The authors are grateful to Tim Hallett, Kelsey Case, Sabrina Lamour, Jacob Dee, Peter Young, Ray Shiraishi, Peter Ghys, Tim Brown, John Stover, Andreas Jahn, and Thoko Kalua.
Conflicts of interest
There are no conflicts of interest.