In an ongoing emerging infectious disease outbreak, contact tracing can provide accurate real-time information on relevant parameters, which is needed by public health professionals to control an outbreak.1–4 Especially, in an emerging infectious disease outbreak, active daily (or even hourly) monitoring of contacts provides a wealth of information. First, data on how many contacted individuals (hereafter: contacts) remain under surveillance can indicate what control efforts are possibly required in the short term. Second, data on exposure type–specific attack rates (ARs) can help in targeting control efforts to contacts at high risk. We defined the exposure type–specific AR as the proportion of infected individuals with a certain exposure type among the total number of traced individuals with that exposure type. Third, estimation of the effective reproduction number R(t), defined as the expected number of secondary cases per primary case infected at time t,5 allows evaluation of infection transmissibility and the effect of current control measures.
However, real-time use of contact tracing data by public health professionals is limited, despite publication of several methods for inferring key parameters from outbreak data.6–13 One problem is that contact tracing data contains a lot of information on confirmed cases and their contacts through various exposure routes at different time points or intervals. Using all that information requires an overview. In addition, many methods focus on just a single parameter,7–10 often the effective reproduction number. To obtain all key outbreak parameters, a public health professional has to search through a wide range of methods, which is quite cumbersome and time-consuming. A third issue is that real-time analysis should account for the fact that the data are typically censored: at the time of analysis, the outcome (whether a contact is infected or not) is unknown for many exposed persons.9 Censoring has a consequence for accurate estimation of outbreak parameters, such as ARs, which may be underestimated if traced contacts later appear to be cases. As far as we know, only few studies take such censoring into account.9,10 A final issue is that published methods require specific software that is either hard to obtain or requires advanced programming skills.8,12,13
To improve public health decision-making during an emerging infectious disease outbreak, we developed a tool that uses contact tracing information to estimate outbreak parameters in real time while correcting for right censoring of the data (i.e., taking into account that contacts in follow-up may become cases in the near future in the real-time estimation of outbreak parameters). Our tool provides a comprehensive visualization of the outbreak, including the course of the outbreak and contacts under surveillance. The tool is easy to use by public health professionals as it is available in a user-friendly web interface. The tool is specifically aimed at analyzing contact tracing data of emerging infectious disease outbreaks, as contact tracing is most relevant during such outbreaks where there is the greatest need for such knowledge about the ongoing outbreak. Our tool is aimed at serious infectious disease outbreaks, for which active daily contact tracing is in place, whereby all cases are identified. In this article, we demonstrate this tool using contact tracing data from a well-documented smallpox outbreak in The Netherlands in 1951.
METHODS
Data
Data for the tool should be entered in a spreadsheet, with a row for each case or contact and the following information in the columns:
- –unique identifier (ID)
- –gender
- –infection status: is it a case or a contact?
- –ID of source
- –exposure type
- –exposure date (first day)
- –exposure duration
- –date of disease onset
In this article, we use the word “contact” to mean an individual who has come into contact with a case and has not yet been shown to be a case himself and “exposure” to mean the event of contact between the two individuals. Additional information that should be obtained is the incubation period distribution (mean and standard deviation) and the generation interval distribution (mean and standard deviation). The generation interval refers to the time between symptom onset of successive cases in a chain of transmission. The data saved as a csv-file can be uploaded into a user-friendly web interface for easy use of the tool (https://github.com/lsoetens/Contactviz) or loaded via the R code provided at the same place, using the R statistical software.
For illustration of the tool, we used data from a smallpox outbreak in Tilburg, The Netherlands, in 1951.14,15 In this outbreak, 52 cases were identified, with a date of symptom onset ranging between 23 February and 25 May 1951. The outbreak was detected on 24 April, and by 29 April, control measures, such as additional mass vaccination and isolation of cases and contacts, had been implemented. No individual details on vaccination status were available, but the vaccination coverage in Tilburg in 1951 was approximately 88%.14,15 The cases occurring before 29 April were traced back and their contacts identified. These efforts led to a very complete documentation of this outbreak. For almost all cases (n = 50), it is known by whom or how they were most likely infected. Contacts of cases were also registered and monitored. The data are given in eTable 1; https://links.lww.com/EDE/B290. The exposure type was categorized as several types: family/household, friend/neighbor, work-related, and medical/religious setting. For some family/household cases or contacts, the date and duration of exposure were not known exactly. We therefore assumed that the duration was 21 days forward from the symptom-onset date of the related case. If the related case was put in isolation, we assumed contact exposure from the symptom-onset date of the related case to the isolation date. To demonstrate our tool in this article, we analyzed the data as they would have been observed on 30 May 1951. For the remainder of the article, we will call this date “the current date.” Ethical approval was deemed not necessary, as the data were already publicly available and do not allow identification of individual persons.
Monitoring Period
An important decision in outbreak control concerns which contacts need follow-up and which can be set aside. For each contact i, we define the monitoring period as the interval from the five-percentile time point to the 95-percentile time point of the effective incubation period. The effective incubation period feff,i(t) is defined as the total period from a first exposure to symptom onset. For any infected individual, the effective incubation period is the sum of the time from first exposure to effective transmission and from effective transmission to symptom onset. The distribution of the effective incubation period is the convolution of the uniform distribution from first exposure to last exposure with the lognormal distribution that describes the incubation period distribution. In our method, repeated exposures are merged to one exposure interval, which is assumed to be uniformly distributed. For smallpox, the lognormal distribution that best fits the observed incubation periods has a median of 13.0 days and a dispersion of 1.13 days.16 We denote the corresponding cumulative probability distribution function as Feff,i(t) and the corresponding survival distribution as Seff,i(t)= 1 − Feff,i(t). In the visual output, the monitoring period is displayed as a horizontal black line, and it is shown only for those contacts that are still monitored, that is, with Seff,i(t) (current date) > 0.05.
Probability of Infection for a Contact
Each contact of exposure type x has a probability πx of being infected and a probability of (1 − πx) of not being infected. If not infected, the contact does not develop any symptoms. If infected, the contact will develop symptoms, and the time from first exposure to symptom onset will follow the effective incubation period. This defines a mixture distribution of a contact i of type x for its time to infection: fx,i(t) = πxfeff,i(t). We denote the corresponding cumulative probability distribution function as Fx,i(t) = πxFeff,i(t) and the corresponding survival distribution as Sx,i(t) = 1 − πxFeff,i(t). For each contact i who has a link of exposure type x to its linked case j and has not developed any symptoms up to time t since exposure, the probability that it is infected is calculated by Bayes’ rule as
;)
which is, based on the definitions above, given by
;)
We used maximum likelihood estimation to estimate πx. More details can be found in eAppendix A; https://links.lww.com/EDE/B290.
Attack Rate by Exposure Type
The exposure type–specific AR, πx, is defined as the probability of a contact being infected, among contacts of exposure type x. In general, the exposure type–specific ARs will tell public health professionals the type of exposure through which contacts are at the highest risk of developing symptoms. In the context of contact tracing, the AR measures the true positive rate, or precision, of the contact definition. A higher value indicates a more efficient contact tracing. The AR is separately estimated for every exposure type x having a total number of cases and contacts greater than 5.
Reproduction Number
By evaluating the course of the effective reproduction number R(t) during the outbreak, public health professionals can assess changes in the effective transmissibility of the disease.7,11 There are various ways of estimating R(t) over time,7,11,17 but most assume that the links between the cases, who infected whom, are not known. The R(t) then needs to be inferred indirectly, whereas when using contact tracing data, the R(t) can be estimated directly. For each case j, we calculate the expected number of secondary cases. The secondary cases consist of confirmed secondary cases, contacts that are or might be infected but have not yet shown symptoms, and orphans. Orphans are infected individuals for whom the infector’s identity is unknown, though by assumption the infector is one of the known cases. To calculate the first term on the right hand side, we count the number of confirmed secondary cases of case j (Dj(t)). Confirmed secondary cases are defined as contacts that were followed up as a result of exposure to a case and eventually developed symptoms within their monitoring period. To calculate the second term, we sum over the contacts of case j and weigh each contact by his/her probability of being infected by case j (pi,j,x(t)). To calculate the third term, we sum over the number of orphans and weigh each by his/her probability of being infected by case j.17
;)
Here, the index i refers to all contacts, the index k to all orphan cases, and the index l to all contacts that could have infected the kth orphan case. The function w gives the generation interval distribution, where the generation interval is defined as the duration between symptom onset of an infected individual and symptom onset of his/her infector. For smallpox, we take the generation interval to follow a gamma distribution with a mean and variance of 17.7 days.18
For any given period of time, we can calculate the average value of the reproduction number for cases who developed symptoms within that period. Of interest here are the period from start of the outbreak to the time the first case was detected (including all primary cases with a date of onset of disease before 24 April 1951 and all secondary cases with an exposure date before 24 April 1951) and the period from implementation of control measures to the current date (including all cases with an exposure date between 29 April 1951 and 30 May 1951). To visualize the time course of the average reproduction number, we calculate for each time t a running average over a time window (t – τ, t) of length τ. We choose τ equal to the generation interval for the specific infection. For smallpox, it is 17.7 days.18 We estimate 95% nonparametric bootstrap percentile confidence intervals (CIs)11 around the reproduction numbers, which reflect the uncertainty in whether contacts in follow-up will become cases and the uncertainty regarding possible links to orphan cases. The intervals are estimated by repeating the following steps a thousand times: first, draw from a binomial distribution with probability pi,j.x for every contact in follow-up whether it is going to be a case or not and for every possible link to an orphan case if this a true link or not; second, sum the total number of secondary cases per case (this includes confirmed secondary cases, contacts in follow-up, and orphan cases); and third, compute the mean of the total number of secondary cases per case. By repeating this a thousand times, a thousand means are estimated, from which the 2.5% and 97.5% percentiles define the 95% CI around the R(t) in a certain time interval.
Algorithm for Layout of the Time-oriented Transmission Tree
To provide an overview of the links between cases and contacts and the evolution of the outbreak over time, we adapted the Reingold–Tilford tree-drawing algorithm19 to obtain a time-oriented transmission tree (see further details in eAppendix B; https://links.lww.com/EDE/B290). In this node-link tree, cases and contacts are represented as nodes, and the exposures between cases and contacts are represented as links. The nodes of the cases are placed according to the first day of symptom onset, and the nodes of the contacts are placed according to the lower level of the monitoring period.
Model Assumptions
Underreporting
The model assumes that all cases and contacts have been observed. However, this is most often not the case. This underreporting might therefore introduce bias to our model. As intensive daily contact tracing is in place during an emerging infectious disease outbreak, we assume that the number of missed secondary cases is minimal and subsequently that this underreporting has a limited effect on the estimation of the effective reproduction number. Underreporting does not play a role in the estimation of the attack rates, as we have defined the attack rate as the proportion of infected individuals among the total number of traced individuals, instead of susceptibles.
Reporting Delay
The model does not take into account reporting delay, which may introduce bias in our estimates of the effective reproduction number and the attack rates. As intensive contact tracing is in place, we assumed that reporting delays are minimal as all cases and contacts are under strict monitoring.
Introductions Versus Orphans
The model assumes there was only one introduction (index case) in the community and that cases not reported as a contact by anyone else (i.e., orphans) have their infector among the observed cases. We considered it more likely that a case with an unknown source originates from another observed case than a new introduction of the emerging infectious disease occurred in the community.
Simulation Study
To allow assessing the accuracy of the estimated attack rates and effective reproduction number and to assess the performance of our tool for a larger outbreak, we performed a simulation study. We used the outbreaker package in R to simulate an outbreak three times as large as the smallpox outbreak in Tilburg, with similar parameter settings. We compared the estimates of our model that use information on cases in follow-up to correct for right censoring with reference estimates obtained by including all present and future information on the infections and their contacts (no right censoring) and with naïve estimates obtained by ignoring any current information on the contacts (no correction for right censoring). A detailed description of the simulation study can be found in eAppendix C; https://links.lww.com/EDE/B290.
Visualization
To provide an overview of the outbreak, we display the contact tracing data in a multipanel plot (Figure 1) using the above-mentioned parameters and algorithms. It includes a large plot with three horizontal panels plus a fourth panel at lower right.
FIGURE: Screenshot of the contact tracing visualization tool with an overview of contact tracing information during an outbreak of smallpox in Tilburg, The Netherlands, using information as of 30 May 1951. Top, Node-link diagram of the (possible) transmission events between cases and contacts over time. The cases are shown as symbols (square, men; diamond, women; and circle, unknown) by date of onset of symptoms. The contacts with S eff,i(current date) > 0.05 are displayed as a horizontal dark red bar indicating their monitoring period. The links between cases (case–case links, solid) and between cases and contacts in follow-up (case–contact links, dashed) are depicted as lines and colored by exposure type. Middle, Course of R(t) during the outbreak and observed number of secondary cases per primary case. The course of the average case reproduction number, R(t) (with 95% nonparametric bootstrap percentile confidence intervals), was estimated using a left-sided moving average with a time-window of 18 days (the generation interval). The observed numbers of secondary cases per primary case are displayed as black dots, with the size of the dots being proportional to the frequency of the number of secondary cases. Bottom, The epidemic curve of the outbreak per 3 days by date of onset of symptoms. Bar chart, The exposure type–specific attack rates with their confidence intervals. Only exposure types with >5 cases/contacts are displayed.
In the large plot, the bottom panel contains an epidemic curve of the outbreak so far. The epidemic curve is displayed as a histogram of the number of cases per three days from the date of symptom onset of the first case to the current date. We take as a rule of thumb: the width of the bars in the epidemic curve is usually one quarter of the length of the incubation period.
The middle panel, which is vertically aligned with the epidemic curve, contains a line graph of the course of R(t) with its 95% CI during the outbreak. In addition, for every case, the observed number of secondary cases Dj(t) is displayed by date of symptom onset of case j as a scatterplot, in which the size of the dots is proportional to the frequency of the number of secondary cases.
The top and main panel, vertically aligned with those below, contains the time-oriented transmission tree. We have used the algorithm described above to place the nodes and links. The nodes representing cases in the tree are displayed as squares (men) or as diamonds (women) at the date of symptom onset. The nodes representing contacts in follow-up are displayed as dark red bars reflecting their monitoring period. In this way, it can easily be seen which contacts are still at risk of developing symptoms, that is, where the period overlaps the current date. Nodes are labeled with the case or contact ID number. The links between nodes are depicted as colored lines reflecting the various exposure types (family/household, friend/neighbor, medical/religious, work); the line types differ for links between cases (solid) and case–contact links (dashed).
A solid vertical bar intersects these three panels to indicate the current date. The time period after the current date is colored gray to indicate that the outcomes are still unknown for that period. Finally, the plot is annotated with dashed vertical lines that indicate important dates, such as the day when the first case was discovered and the day when control measures were installed. With the addition of such annotations, it can easily be seen if the situation has changed after a certain time point.
The fourth panel, lower right, contains a bar chart representing the exposure type–specific ARs and their 95% CIs. The bar colors match the line colors in the network plot. Exposure types are displayed only when their total number of cases and contacts is greater than five.
Finally, above the fourth panel is information containing some summary statistics of the outbreak so far (number of cases/contacts, number of contacts in follow-up or discarded, and maximum number of secondary cases per case) as well as important dates and explanation of the above-mentioned features. The R 3.1.0 statistical software is used to construct this multipanel plot; main packages used are igraph, ggplot, and shiny.
RESULTS
An overview of the outbreak situation at 30 May 1951 in a multipanel plot is given in the Figure. The run time of the analysis of this outbreak at this date was 10.92 seconds. An overview of the outbreak at an earlier time, the 29 April 1951, can be found in eAppendix D; https://links.lww.com/EDE/B290.
Between 23 February and 30 May 1951, 52 smallpox cases including 28 men and 24 women were identified, with the highest number of cases reported at the beginning of May, as seen in the epidemic curve (bottom left panel, Figure). For three cases (ID 1, 34, and 50), no source could be identified. By 30 May, known as the current date, 22 contacts were in follow-up (i.e., within their surveillance period) and 174 contacts were discarded as their surveillance was over. The latter are not shown to prevent crowding of the plot. If no new cases and contacts present, follow-up of all contacts was to end by mid-June 1951.
The time-oriented transmission tree (top left panel, Figure) shows that most cases were infected through household/family contact (31/52; red lines), followed by friend/neighbor contact (9/52; blue lines). The exposure type–specific ARs are depicted in the bar chart in the lower right panel of Figure 1. The AR was highest among those infected by work-related contact (AR = 42%; 95% CI = 9%, 81%) and by medical contact (AR = 31%; 95% CI = 5%, 71%).
The evolution of R(t) during the outbreak (left middle panel, Figure) shows several peaks in March and April, after which the R(t) declines to zero at the beginning of May. At the end of May, a slight increase in the R(t) is observed, indicating probable secondary cases among the contacts still in follow-up. The maximum number of secondary cases per case was 13 by case 35, who could be identified as the main spreader in this outbreak thus far. The transmission tree and the course of R(t) over time indicate that before the first case was discovered and additional control measures were implemented, the R(t) was decreasing. The R(t) before the first case was detected on 24 April was 2.42 (95% CI = 2.4, 2.53), and the R(t) after additional control measures were installed was 0.83 (95% CI = 0.43, 1.29). The R(t) before outbreak detection is slightly lower than the reproduction numbers reported in previous studies20–22; however, when taking into account that the population in this outbreak was only partially susceptible because of childhood vaccination, the findings are quite comparable.
The simulation study in eAppendix C; https://links.lww.com/EDE/B290 shows that a larger outbreak can be studied in a similar way as the outbreak used in this study. The proposed estimators for the attack rate and effective reproduction number that include information on contact tracing to account for right censoring of the data come closer to the actual values in comparison with another (naïve) estimator that does not correct for right censoring. The run time of the analysis of the simulated outbreak was 27 seconds on a standard laptop computer (Intel Core i5 processor).
DISCUSSION
We have proposed a tool and methods to accurately infer outbreak parameters in real time from contact tracing data to enhance decision-making and subsequently the efficacy and efficiency of outbreak control. The tool provides an indication of the workload of outbreak investigators (the number of contacts in follow-up) and gives an impression of the effective transmissibility of the disease during the outbreak (R(t)). Visualization of this information shows clearly which contacts are still at risk of developing symptoms and which are no longer at risk and can therefore be discarded, reducing the workload. The estimation and visualization of exposure type–specific ARs helps public health practitioners to identify the exposure types through which contacts are at the highest risk of developing symptoms. Contact definitions can then be made more specific during the outbreak. In our example of smallpox, we surprisingly found that work-related and medical contacts yielded the highest AR, although family/household contact is usually more intense. Finally, the information available through contact tracing allows assessment of interventions. In the smallpox outbreak, it was observed that before the first case was discovered and the implementation of additional control measures, the R(t) was decreasing. This decrease might be attributed to the intervention as cases identified just before its installation (up to 18 days, the generation interval) were less likely to cause secondary cases. The course of R(t) is likely to be consistent with effectiveness of the control measure.
Our approach has three main strengths. First and foremost is its ability to analyze the data in real time. Incubation period distributions are used to estimate when a contact is most likely to develop symptoms, providing an up-to-date overview of the contacts in follow-up and the related workload for outbreak investigators. We added to the real-timeliness of the tool by taking the censored data into account. This involved estimating the probability of infection for a contact using maximum likelihood estimation for those contacts still in follow-up to estimate the AR and R(t) during the course of the outbreak. The AR is often obtained by using aggregated numbers for total number of cases and contacts at a single time point. Contacts in follow-up are usually neglected and left out of the analysis. However, the contacts in follow-up can still become cases, and the ARs might therefore be underestimated. Also, R(t) is often underestimated as potential secondary cases are not taken into account. By accounting for the censoring of contacts still in follow-up, we could make our AR and R(t) estimations more accurate (as we have shown in our simulation study in eAppendix C; https://links.lww.com/EDE/B290).
The second strength of our approach is that it provides a comprehensive overview of the outbreak at a glance. This graph is easily brought to table when outbreak management decisions are discussed, making it easier to use by outbreak investigators compared to other more software-oriented tools.8,12,13 The tool is provided with a user-friendly interface, making the use of the tool relatively straightforward. Value has been shown for all elements of the visual, such as the epidemic curve, the contact network, and the statistics, but having them readily available in one visual output with the latest results is novel. Our combination of elements offers a great advantage over previous methods focused on only one element of the outbreak analysis.7–9
The third strength is that the output is presented visually. We specifically aimed for this because the human mind can acquire more information through vision than through all of the other senses combined.23 Besides fast and efficient processing, visualization has several other advantages,23 allowing the viewer to comprehend large amounts of data and perceive patterns. Problems with the data can become more easily apparent. In our smallpox example, it is immediately clear that the generation time between case 6 and 12 is improbably long and most likely because of a registration error or missed case in between. Finally, visualization facilitates hypothesis generation. For these reasons, we support the use of visuals instead of tables and text to communicate contact tracing information for decision-making. In addition, the visual output bridges the gap between the more software-oriented tools8,12,13 and public health professionals whose programming skills are not always advanced.
Several issues regarding use of our tool in daily practice should be kept in mind. First, the visual is only as good as the input information. For this study, we used historical data of a well-documented outbreak. We are aware that during an outbreak, registration of all cases and contacts in a tidy way might not be the first priority. However, we expect that our approach to visualize the valuable information for decision-making provides an incentive for outbreak investigators to organize the data in the required format, as it will be time-saving in the end. On the other hand, since this method is so dependent on data quality, it can be used to trace errors of data entry as well. Second, the visual layout can be adapted to deal with very large outbreaks with numerous contacts, such as the pertussis outbreak in The Netherlands in 2012,24 infectious diseases with a potentially long incubation time, such as tuberculosis,25 or other than human-to-human transmissible diseases. For very large outbreaks, the tool could be adapted by splitting the outbreak into smaller clusters. For other than human-to-human transmissible diseases, like environmental point source outbreaks, the tool could be adapted by estimating the attack rates and transmission tree by environmental source. The estimates of the effective reproduction number are not relevant in such a situation.
There are various ways in which the tool can be improved upon in future studies. We chose to use the incubation period distribution and generation time distribution from the literature. One could also try to estimate these parameters from the data itself, with as a major advantage a higher accuracy of the estimates for that specific outbreak. However, the estimates would be very unstable at the beginning of the outbreak because only limited data is available. It could be a possible direction for future development of the tool to combine prior information from literature with a growing amount of data from an ongoing outbreak. Second, outbreaks are typically described by the time, pathogen type, place, and person characteristics. This tool was mainly focused on the dimensions of time and person. Future work on how to add the dimensions of pathogen type and place to this layout would be useful. A final minor point for improvement is the tree plotting algorithm, for which we adapted a proven algorithm to optimize the use of available space (eAppendix B; https://links.lww.com/EDE/B290), but which in some cases results in crossing lines.
To conclude, we have proposed a tool to visualize contact tracing information to increase timely access to information on the characteristics of an emerging outbreak. The tool obtains and visualizes accurate outbreak parameters by taking into account the censoring of the data. The resulting information can help public health professionals to get a real-time overview of the outbreak and subsequently make better decisions to control the outbreak. Further research should focus on improving the network layout algorithm and enhancing the visual with data on pathogen type and place that are obtained during the outbreak.
REFERENCES
1. Eames KT, Keeling MJContact tracing and disease control. Proc Biol Sci. 2003;270:2565–2571.
2. Klinkenberg D, Fraser C, Heesterbeek HThe effectiveness of contact tracing in emerging epidemics. PLoS One. 2006;1:e12.
3. Kiss IZ, Green DM, Kao RRDisease contact tracing in random and clustered networks. Proc Biol Sci. 2005;272:1407–1414.
4. Bonačić Marinović A, Swaan C, van Steenbergen J, Kretzschmar MQuantifying reporting timeliness to improve outbreak control. Emerg Infect Dis. 2015;21:209–216.
5. Fraser CEstimating individual and household reproduction numbers in an emerging epidemic. PLoS One. 2007;2:e758.
6. Jewell CP, Roberts GOEnhancing Bayesian risk prediction for epidemics using contact tracing. Biostatistics. 2012;13:567–579.
7. Cori A, Ferguson NM, Fraser C, Cauchemez SA new framework and software to estimate time-varying reproduction numbers during epidemics. Am J Epidemiol. 2013;178:1505–1512.
8. Obadia T, Haneef R, Boëlle PYThe R0 package: a toolbox to estimate reproduction numbers for epidemic outbreaks. BMC Med Inform Decis Mak. 2012;12:147.
9. Ghani AC, Donnelly CA, Cox DR, et alMethods for estimating the case fatality ratio for a novel, emerging infectious disease. Am J Epidemiol. 2005;162:479–486.
10. Cauchemez S, Boelle PY, Donnelly CA, et alReal-time estimates in early detection of SARS. Emerg Infect Dis. 2006;12:110–113.
11. Hens N, Calatayud L, Kurkela S, Tamme T, Wallinga JRobust reconstruction and analysis of outbreak data: influenza A(H1N1)v transmission in a school-based population. Am J Epidemiol. 2012;176:196–203.
12. Jombart T, Aanensen DM, Baguelin M, et alOutbreakTools: a new platform for disease outbreak analysis using the R software. Epidemics. 2014;7:28–34.
13. Höhle MAn R package for the monitoring of infectious diseases. Comput Stat. 2007;22(4):571–582.
14. Sas GJKlinische en epidemiologische waarnemingen tijdens de pokkenepidemie te Tilburg in 1951. 1954.Leiden, The Netherland: Universiteit Leiden;
15. Ministerie van sociale zaken en volksgezondheid. Rapport omtrent de pokkenepidemie te Tilburg in 1951. Staatsdrukkerij- en uitgeverijbedrijf, ‘ 1953:s-Gravenhage; 33565.
16. Sartwell PEThe incubation period and the dynamics of infectious disease. Am J Epidemiol. 1966;83:204–206.
17. Wallinga J, Teunis PDifferent epidemic curves for severe acute respiratory syndrome reveal similar impacts of control measures. Am J Epidemiol. 2004;160:509–516.
18. Vink MA, Bootsma MC, Wallinga JSerial intervals of respiratory infectious diseases: a systematic review and analysis. Am J Epidemiol. 2014;180:865–875.
19. Reingold EM, Tilford JSTidier drawings of trees. IEEE Trans Software Eng. 1981;SE-7:223–228.
20. Gani R, Leach STransmission potential of smallpox in contemporary populations. Nature. 2001;414:748–751.
21. Eichner M, Dietz KTransmission potential of smallpox: estimates based on detailed data from an outbreak. Am J Epidemiol. 2003;158:110–117.
22. Riley S, Ferguson NMSmallpox transmission and control: spatial dynamics in Great Britain. Proc Natl Acad Sci USA. 2006;103:12637–12642.
23. Ware CInformation Visualization: Perception for Design. 2004.Amsterdam: Elsevier Science Morgan Kaufmann;
24. van der Maas NA, Mooi FR, de Greeff SC, Berbers GA, Spaendonck MA, de Melker HEPertussis in the Netherlands, is the current vaccination strategy sufficient to reduce disease burden in young infants? Vaccine. 2013;31:4541–4547.
25. Borgdorff MW, Sebek M, Geskus RB, Kremer K, Kalisvaart N, van Soolingen DThe incubation period distribution of tuberculosis estimated with a molecular epidemiological approach. Int J Epidemiol. 2011;40:964–970.