 Research article
 Open Access
 Open Peer Review
 Published:
The role of superspreading events in Mycobacterium tuberculosis transmission: evidence from contact tracing
BMC Infectious Diseasesvolume 19, Article number: 244 (2019)
Abstract
Background
In current epidemiology of tuberculosis (TB), heterogeneity in infectiousness among TB patients is a challenge, which is not well studied. We aimed to quantify this heterogeneity and the presence of “superspreading” events that can assist in designing optimal public health interventions.
Methods
TB epidemiologic investigation data notified between 1 January 2005 and 31 December 2015 from Victoria, Australia were used to quantify TB patients’ heterogeneity in infectiousness and superspreading events. We fitted a negative binomial offspring distribution (NBD) for the number of secondary infections and secondary active TB disease each TB patient produced. The dispersion parameter, k, of the NBD measures the level of heterogeneity, where low values of k (e.g. k < 1) indicate overdispersion. Superspreading was defined as patients causing as many or more secondary infections as the 99th centile of an equivalent homogeneous distribution. Contact infection was determined based on a tuberculin skin test (TST) result of ≥10 mm. A NBD model was fitted to identify index characteristics that were associated with the number of contacts infected and risk ratios (RRs) were used to quantify the strength of this association.
Results
There were 4190 (2312 pulmonary and 1878 extrapulmonary) index TB patients and 18,030 contacts. A total of 15,522 contacts were tested with TST, of whom 3213 had a result of ≥10 mm. The dispersion parameter, k for secondary infections was estimated at 0.16 (95%CI 0.14–0.17) and there were 414 (9.9%) superspreading events. From the 3213 secondary infections, 2415 (75.2%) were due to superspreading events. There were 226 contacts who developed active TB disease and a higher level of heterogeneity was found for this outcome than for secondary infection, with k estimated at 0.036 (95%CI 0.025–0.046). In regression analyses, we found that infectiousness was greater among index patients found by clinical presentation and those with bacteriological confirmation.
Conclusion
TB transmission is highly over dispersed and superspreading events are responsible for a substantial majority of secondary infections. Heterogeneity of transmission and superspreading are critical issues to consider in the design of interventions and models of TB transmission dynamics.
Background
The Global Tuberculosis (TB) Strategy looks towards the ultimate vision of elimination of the TB epidemic, although the disease still causes more than 10 million cases and 1.8 million deaths each year [1, 2]. The epidemic is not homogeneously distributed, but is a collection of heterogeneous local microepidemics [3]. The existence of heterogeneity in transmission has the potential to disrupt elimination strategies, many of which assume broadly similar transmission potential of infectious people. Hence, it is essential to understand and quantify the degree of heterogeneity that exists in TB transmission.
Several studies have reported heterogeneity in the capacity of individual source patients to transmit various pathogens to their contacts [4,5,6,7]. Variation between infectious individuals in their capacity to transmit infectious agents is well described, with some superspreaders infecting large number of contacts while others may only infect very few or none [4, 5, 7,8,9,10]. The contribution of superspreading has previously been quantified for directly transmitted infections such as SARS, measles, smallpox, monkeypox and pneumonic plague [4].
TB patients are diverse in their capacity to transmit infection to their contacts, with a systematic review that included several contact tracing studies showing that clinical, demographic and behavioural characteristics of TB patients were associated with their ability to transmit Mycobacterium tuberculosis (M. tb) infection [11]. Quantification of heterogeneity in M. tb transmission will help to understand better how its transmission is sustained. A study in the Netherlands using genotypic clustering data quantified M. tb transmission heterogeneity and reported signs of superspreading [10]. However, how TB patients vary with respect to their capacity to produce secondary infection is not well understood, including the extent to which superspreading events exist and are responsible for driving transmission. Understanding transmission heterogeneity and characterising those with greater capacity to spread the infection is critical to better target interventions and predict their likely impact. As the global TB response transitions towards ending TB and the epidemic becomes more localised [3], understanding heterogeneities of transmission is increasingly important. We aimed to characterise transmission heterogeneity in a wellresourced setting using highquality surveillance data including detailed information on contacts and their infection status.
Methods
Setting
Victoria is a state of Australia with approximately 5.6 million people and a single centralised tuberculosis program (the Victorian Tuberculosis Program; VTP). Notification of all confirmed or suspected cases of TB disease is mandatory for both laboratories and clinicians and culture confirmation of M. tb is routine in this setting. While hospitalisation of cases is not mandatory, those with pulmonary disease are typically maintained in isolation until they are considered noninfectious (> 2 weeks of effective therapy and/or smearnegativity) [12, 13]. On receipt of a notification, a public health nurse from the VTP is allocated to the patient to provide support, assist with treatment compliance, and assess the extent of contact tracing required. Household contacts and others with greater than an estimated 8 h of contact are considered eligible for screening, with individualised assessment of screening recommendations for higherrisk contacts performed (e.g. immunosuppressed or those with highintensity exposure).
Contact investigation initially consists of clinical assessment and serial testing for M. tb infection. Testing of contacts is conducted by either tuberculin skin testing (TST) using the Mantoux procedure or an interferon gamma release assay (IGRA), although during the study period the large majority of testing was undertaken with TST. Those negative on initial testing are tested again 8–12 weeks following exposure. Contacts with either symptoms suggestive of active disease or a positive test for TB infection undergo chest xray (CXR) and further clinical assessment, with isoniazid preventive treatment offered for those where active disease is excluded [12].
Data
Data from the VTP which are stored by the Victorian Department of Health and Human Services (DHHS) were used for the following analysis. Index patients were classified as confirmed cases of TB notified from 1 January 2005 to 31 December 2015 in residents of Victoria. The data set includes contact tracing information and results of testing for M. tb infection, with cases of subsequent active TB disease linked to these contact episodes now extending to March 2017 (see [14] for earlier publication of linkage process). We constructed empirical offspring distributions from the detailed contact tracing data set of the VTP.
Ethical approval was obtained from Monash University, Human Research Ethics Committee (Project Number: 7776) and permission was given by the VTP and DHHS.
Definitions
A microbiologically confirmed case of TB requires culture or polymerase chain reaction (PCR) diagnosis of Mycobacterium tuberculosis, while clinical/radiological diagnosis may also be made by a medical practitioner experienced in TB management. Approximately 90% of TB cases in Victoria are bacteriologically confirmed [15]. All cases diagnosed with active TB in this dataset also underwent secondary case review by a TB specialist to ensure that guidelines for confirming TB disease were met.
Contacts were individuals identified as having had personal contact with index patients by the VTP, through school, workplace, household and other settings. Contact latent TB infection (LTBI) was defined as a TST result of ≥10 mm in an identified contact. Where contacts had had multiple TSTs, we used the value of the latest TST result performed within three months of exposure. Any contact developing active TB during the stated period until 21 March 2017 was considered to have secondary TB. We define a superspreading event as the number of secondary infections per index case that was greater than the 99th centile of the equivalent Poisson distribution (with distribution mean equal to the mean number of infections per index).
Fitting distributions to data
We were primarily interested in the distribution of the number of secondarily infected contacts from each index patient. Although superspreading is usually defined in terms of the number of secondary cases of active disease produced by each index patient, we wished to estimate parameters in the absence of preventive therapy. As isoniazid prophylaxis is used widely in our setting and its use may be clustered according to index patients (e.g. family members electing together to undertake preventive treatment), we anticipated this could artefactually inflate our estimates of overdispersion. Although there are contact factors that are likely to affect progression to active disease after infection, these factors may not be differentially distributed by index patient and the distribution of infections would be unaffected by use of preventive therapy. Therefore, the number of secondarily infected cases is our primary analysis throughout the remainder of the paper, although analogous analyses are presented for the distribution of contacts (close contacts and all contacts) and secondary cases of active disease to facilitate epidemiological interpretation.
Our primary outcome can be described using a probability distribution termed an offspring distribution, defined as the probability of the number of transmission events across the range of index TB patients. This process can be modelled by a negative binomial distribution (NBD), which has the advantage of being able to accommodate overdispersed count data [16,17,18]. The NBD permits sufficient flexibility with only two parameters (the shape parameter and the mean) [4] and subsumes the Poisson distribution while also allowing for “overdispersion”, where the variance (of the offspring distribution) may be greater than the mean [18].
We denote the individual reproductive number by v and, the distribution of individual reproductive numbers (offspring distribution) by Z. To incorporate individual infectious histories, v follows a negative binomial offspring distribution with dispersion parameter k and mean m, such that Z~NegB(m, k). The dispersion parameter k quantifies the extent of overdispersion in the count data. If there is extraheterogeneity between index patients in the number of secondary infections produced, dispersion increases and the parameter k approaches zero (k → 0). In the absence of overdispersion, k → ∞ and the mean and the variance approach parity, with the negative binomial distribution reducing to the Poisson distribution. If k = 1, the negative binomial distribution reduces to the geometric distribution, such that the negative binomial model can accommodate Poisson, geometric and overdispersed distributions [16].
The probability of observing index patients with v number of infected contacts is given by:
As the variance m(1 + (m/k)) approaches the mean (m), overdispersion decreases, i.e. k → ∞.
Interpretation of transmission parameters
The parameters, k and m were estimated by maximum likelihood estimation (MLE), which provides unbiased estimates, especially for large sample sizes [16]. The MLE of the mean of the offspring distribution, m is the sample mean of Z or the mean number of secondary infections. The dispersion parameter, k, was estimated after fitting the data to the negative binomial distribution using the MASS package [19] of the R environment for statistical computing [20], with a value of k less than one interpreted as evidence of superspreading [10].
Superspreading events
We used the protocol proposed by LloydSmith et al [4] in which: first, we calculated the mean number of secondary infections per index or effective reproductive number, R_{n}; second, we constructed a Poisson distribution with mean R_{n}, representing the range of Z (offspring distribution) due to stochasticity without individual variation; third, we define a superspreading event as any patient who infected more than Z(i) contacts, where Z(i) is i^{th} centile of the offspring Poisson distribution. Arbitrarily but as in this previous study for SARS, we considered the 99th percentile of this distribution as the cutpoint to determine superspreading events. For prediction of the expected proportion of superspreading event, we produce a negative binomial distribution with dispersion parameter, k, and the mean number of secondary infections per index, R_{n} [4, 21].
Identifying associations with index characteristics
The outcome variable was the number of secondary infections per index patient, which was found to be overdispersed as described below. Therefore, we fitted a negative binomial regression model with both bivariate and multivariate regression with the MASS package [19] of the R environment for statistical computing [20]. The logarithmic scale coefficients were exponentiated to give ratios and are presented with their 95% confidence intervals (CI).
We evaluated the need for a negative binomial regression model (because of inequality of the conditional mean and conditional variance) with the likelihood ratio test. We also evaluated the predictive accuracy of the model with rootograms [22].
Results
The Victorian TB program data had a total of 4190 confirmed TB index patients and 18,030 contacts within the period from 1 January 2005 to 31 December 2015. The mean age of index patients was 33.0 years and 54.9% were male. Among index patients, 1878 (44.8%) were extrapulmonary, while 1757 (42.0%) were pulmonary and 555 (13.2%) had both pulmonary plus other site involvement.
The average age of contacts was 28.4 years, 9276 (51.4%) were female and 8510 (47.2%) were male (with 244 (1.4%) contacts sex not stated). The majority of contacts (15,031; 83.4%) were contacts of pulmonary only TB patients, while (2988; 16.6%) were contacts of patients of TB at pulmonary plus other sites and only 11 contacts were identified from EPTB patients (although all EPTB were considered to have produced zero secondary infections and secondary cases). Henceforward we use the term “pulmonary” to refer to any patient with pulmonary involvement, i.e. both the “pulmonary only” and the “pulmonary plus other sites” categories. There were five categories for the types of contacts, 8059 close contacts, 5484 school contacts, 2366 work contacts, 1286 casual contacts and 824 contacts from other congregate setting such as hospitals, nursing homes, airlines and childcare facilities.
Secondary infection distribution
A total of 15,522 of 18,019 contacts of PTB index patients were tested with TST. Based on our cutoff for diagnosing infection as those with a TST result of ≥10 mm as positive, 3213 of contacts were infected and 12,309 were not. Of 3213 infected contacts 2050 (63.8%) were close contacts. Of the 4190 index patients (1878 extrapulmonary and 2312 pulmonary) 3166 (75.6%) did not produce any secondary infection, with all extrapulmonary patients assumed not to have produced any secondary infections (Additional file 1). There were 26 cluster sizes of secondary infection, ranging from zero infections to 41 infections per index. The mean and variance of the number of secondary infections was 0.77 and 5.06 infections per index respectively. The median number of secondary infections per index patient was zero, the 95th centile was two and the 99th centile was 10 infections per index. By fitting the NBD to the observed distribution of secondary infection, we found evidence of overdispersion (k = 0.16, 95%CI 0.14–0.17) for all types of contacts.
Restricting our analysis to index patients with pulmonary involvement only, the mean and variance of the number of infections per index was 1.4 and 8.3 infections per index, respectively with evidence of overdispersion (k= 0.36, 95%CI 0.33–0.40) (Fig. 1a). In another restricted analysis of close contacts only, there were 2042 secondary infections, of which 771 (37.8%) were superspreading events, with less dispersion compared to all contact types (k= 0.98, 95%CI 0.84–1.12).
Superspreading events
We constructed a Poisson distribution with the mean number of infections per index (i.e. 0.77) to establish a cutoff number of secondary infections per index for defining superspreading events. The 99th centile was three infections per index. Therefore, we classified transmission events where index patients produced three or more secondary infections as superspreading events. Accordingly, there were 414 (9.9% index patients) associated with superspreading events, which accounted for a total of 2415 (75.2%) of the 3213 secondary infections. For predicting the number of superspreading events in TB transmission, we estimated the expected proportion of index patients, with confidence intervals considering the dispersion parameter, k = 0.16 and the effective reproductive number,R_{n} = 0.77. With this approach, the expected proportion of TB superspreading events was 9.8% (95% CI: 8.9–10.6%).
Secondary active TB disease distribution
We further analysed infectiousness heterogeneity from the number of secondary active TB cases per index TB patient. There were 226 secondary active TB cases identified among 18,030 contacts. Among these secondary TB cases, approximately half (116; 51.3%) were the sole secondary case identified among the contacts of a specific index patient. Among 4190 index patients, only 137 (3.3%) were responsible for all 226 secondary TB cases. Two of the secondary cases were contacts of extrapulmonary index patients (maternal to foetal transmission in utero). The largest cluster of secondary TB disease was 12 cases. The distribution of secondary TB cases per index, was over dispersed with the dispersion parameter estimated at k = 0.036 (95%CI 0.025–0.046) (Fig. 1b).
Contact distribution
We also investigated the individual variation among index patients with respect to the number of contacts identified to determine how the transmission heterogeneity could be related to contact patterns. There was evidence of heterogeneity (k= 0.38, 95%CI 0.36–0.41) for the distribution all contact types (Additional file 2). Similar but lesser heterogeneity was found (k = 0.63, 95%CI 0.59–0.68) after restricting our analysis to close contacts only (Additional file 3).
Associations of index characteristics with number of infections
Because of the extent of heterogeneity described above, we fitted a negative binomial regression model for index patients with pulmonary involvement to determine the characteristics of index patients that were associated with the number of secondary infections. The index characteristics included were age, sex, site of disease, patient detection pathway, CXR result, method of diagnosis, whether the patient was new or relapse and number of contacts per index.
From this multivariate model, TB in pulmonary and additional sites (compared to pulmonary only) was independently associated with a 42% decrease in the number of secondary infections. Identification through contact tracing and the Australian postmigration follow up program (“health undertakings” [23]) (compared to clinical presentation) was associated with a lower number of secondary infections (71 and 46% respectively). Diagnosis by PCR, histology or clinical signs (compared to culture) was associated with a 70% decrease in secondary infections, while diagnosis by radiological techniques was associated with a 55% decrease. The number of contacts identified for each index patient showed a positive association with the number of secondary infections produced, with the identification of one additional contact associated with an increase in the number of secondary infections by 4 % (Table 1).
The likelihood ratio test showed that assuming equality of the conditional mean and variance was not safe (Pvalue < 0.001), whereas model evaluation indicated that the negative binomial model fitted well to the observed data. Compared to the equivalent Poisson model, the negative binomial model was wellfitted and predicted counts well with minimal residuals (Additional file 4).
Discussion
To our knowledge, this study is the first to use programmatic epidemiological observations to formally quantify M. tb transmission heterogeneity. We found evidence of superspreading events as constituting the large majority of M. tb transmission events. We also demonstrated considerable variability between index TB patients in three important respects: in the number of contacts identified, the number of contacts infected and the number of cases of secondary active TB subsequently occurring. Therefore, assuming a homogeneous population of infective patients in TB transmission modelling may be highly unrealistic. From a programmatic viewpoint, although the effective reproduction number is less than one in our setting, significant transmission may still occur due to a very small number of superspreading events (Fig. 2).
We found that M. tb transmission is heterogeneous, with the distribution of secondary infections per index varying by more than simple random variation between individuals. The level of overdispersion was comparable with the estimate from a previous study that employed genotypic data (k=0.1) [10]. Although the distribution of secondarily infected contacts per index patient is probably a better marker for M. tb transmission than genotypic clusters, our approach may even underestimate heterogeneity (overestimate k) due to dilution of differences between patients from more homogeneously distributed distant past infection (i.e. prior to the index exposure identified). Our estimate of k was slightly higher (i.e. less heterogeneous) than an estimate for SARS transmission heterogeneity (k=0.1) [4], though our finding may underestimate the true level of heterogeneity. The distribution of secondary cases was even more heterogeneous (k< 0.04) than secondary infections. This finding was also more heterogeneous compared to the previous TB estimate from genotypic data (k = 0.1) [10] and other infectious diseases such as SARS [4]. However, the effective use of preventive therapy in this setting and long incubation period could lead to an underestimate of the risk of active TB, as there is a risk of missing late reactivations of TB, although late reactivation are relatively rare in Victoria [24]. The effective use of preventive therapy could also overestimate the true level of heterogeneity in cases of secondary active TB, by differentially reducing the number of secondary cases produced by some index cases. The analyses of contacts demonstrated that these heterogeneities are not solely driven by heterogeneity in contact patterns. Therefore, we believe that the true value of the M. tb transmission dispersion parameter is likely to fall somewhere between the k estimate for secondary infections and secondary active TB distributions.
Based on our definition [4], threequarters of all secondary M. tb infections occurred as a result of superspreading events. Although the average number of secondary infections per index (R_{n}) was less than one (0.77), TB rates may not decline as expected in the population due to this high heterogeneity. Moreover, TB transmission goes well beyond the 20/80 ruleofthumb for infectious disease transmission which states that 80% of transmission is due to only 20% of the population [25], since in our results 20% of index patients produce 90% of secondary infections (Fig. 3).
Our regression analysis identified several important associations between index patients’ characteristics and the number of secondary infections they produced, rather than considering just the proportion of contacts infected as typically done in previous studies [26, 27]. Compared to index patients with pulmonary TB only, index patients with TB involving pulmonary and nonpulmonary sites were less infectious. This may be explained by those with extrapulmonary TB but minor CXR abnormalities often being classified as “pulmonary plus other sites”, with these patients tending to have a low bacillary load and smearnegative pulmonary disease (given that it is wellestablished that pulmonary patients are more infectious than extrapulmonary who have virtually zero infectiousness [28, 29]). The method of patient identification was also an important predictor of infectiousness, with index patients found by clinical presentation being more infectious than those found through contact tracing or postmigration followup. This could be explained by the fact that those patients identified through the passive process of relying upon clinical presentation spend longer infectious, whereas those identified through the more active approaches of contact tracing and postmigration followup allows earlier identification and treatment. This explanation is consistent with delayed diagnosis and treatment being a major predictor of TB patients’ infectiousness [30,31,32,33]. Similarly, index patients diagnosed by culture produced a higher number of secondary infections, which is consistent with patients with culturepositive results having a higher bacillary load [34].
The most important limitation of our study is that some secondary infections might be the result of distant past infections, rather than relating to the contact episode. Our definition of superspreading events is based on contact infection rather than active disease in contacts, since there is no standard definition in the case of TB and the difficulty in interpreting active disease in a setting of widespread use of preventive therapy. However, we argue that contact infection is the best available measure of true M. tb transmission in our setting and this definition could be used for future studies on the disease.
Conclusions
We conclude that M. tb transmission is a highly heterogeneous process in our population and superspreading events are a major driver of transmission. Therefore, it is essential to consider this heterogeneity when modelling TB transmission dynamics and considering control strategies. Future observational studies should characterise superspreading in different epidemiological settings to further characterise this phenomenon.
Abbreviations
 BCG:

Bacillus Calmette–Guérin
 CI:

Confidence Interval
 CXR:

Chest XRay
 DHHS:

Department of Health and Human Services
 IGRA:

Interferon Gamma Release Assay
 LTBI:

Latent Tuberculosis Infection
 M. tb :

Mycobacterium tuberculosis
 PCR/NAT:

Polymerase Chain Reaction/Nucleic Acid Test
 RR:

Rate Ratio
 TB:

Tuberculosis
 TST:

Tuberculin Skin Test
 VTP:

Victorian Tuberculosis Program
 WHO:

World Health Organization
References
 1.
WHO. Global tuberculosis report 2016. Switzerland: World Health Organization; 2016.
 2.
WHO. The End TB Strategy: Global strategy and targets for tuberculosis prevention, care and control after 2015. Geneva: World Health Organization; 2014.
 3.
Pai M, Behr MA, Dowdy D, Dheda K, Divangahi M, Boehme CC, Ginsberg A, Swaminathan S, Spigelman M, Getahun H, et al. Tuberculosis. Nat Rev Dis Primers. 2016;2:16076.
 4.
LloydSmith JO, Schreiber SJ, Kopp PE, Getz WM. Superspreading and the effect of individual variation on disease emergence. Nature. 2005;438(7066):355–9.
 5.
Shen Z, Ning F, Zhou W, He X, Lin C, Chin DP, Zhu Z, Schuchat A. Superspreading SARS events, Beijing, 2003. Emerg Infect Dis. 2004;10(2):256–60.
 6.
Small M, Tse C, Walker DM. Superspreaders and the rate of transmission of the SARS virus. Physica D: Nonlinear Phenomena. 2006;215(2):146–58.
 7.
Stein RA. Superspreaders in infectious diseases. Int J Infect Dis. 2011;15(8):e510–3.
 8.
Curtis AB, Ridzon R, Vogel R, McDonough S, Hargreaves J, Ferry J, Valway S, Onorato IM. Extensive transmission of mycobacterium tuberculosis from a child. N Engl J Med. 1999;341(20):1491–5.
 9.
Galvani AP, May RM. Epidemiology: dimensions of superspreading. Nature. 2005;438(7066):293–5.
 10.
Ypma RJ, Altes HK, van Soolingen D, Wallinga J, van Ballegooijen WM. A sign of superspreading in tuberculosis: highly skewed distribution of genotypic cluster sizes. Epidemiology. 2013;24(3):395–400.
 11.
Melsew YA, Doan TN, Gambhir M, Cheng AC, McBryde E, Trauer JM. Risk factors for infectiousness of patients with tuberculosis: a systematic review and metaanalysis. Epidemiology & Infection. 2018;146(3):34553.
 12.
Department of Health & Human Services. Management, control and prevention of tuberculosis: Guidelines for health care providers; Victorian Government; 2015.
 13.
Mycobacterial infections (tuberculosis) [https://www2.health.vic.gov.au/publichealth/infectiousdiseases/diseaseinformationadvice/tuberculosis]. Accessed 7 Apr 2017.
 14.
Moyo N, Tay E, Denholm J. Evaluation of tuberculin skin testing in tuberculosis contacts in Victoria, Australia, 2005–2013. Public Health Action. 2015;5(3):188–93.
 15.
Dale K, Tay E, Trevan P, Denholm J. Mortality among tuberculosis cases in Victoria, 2002–2013: case fatality and factors associated with death. Int J Tuberc Lung Dis. 2016;20(4):515–23.
 16.
LloydSmith JO. Maximum likelihood estimation of the negative binomial dispersion parameter for highly overdispersed data, with applications to infectious diseases. PLoS One. 2007;2(2):e180.
 17.
Fisher RA. The negative binomial distribution. Ann Eugenics. 1941;11(1):182–7.
 18.
Bliss CI, Fisher RA. Fitting the negative binomial distribution to biological data. Biometrics. 1953;9(2):176–200.
 19.
Venables WN, Ripley BD. Modern Applied Statistics with S, Fourth edn. New York: Springer; 2002.
 20.
RCoreTeam. R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing; 2014.
 21.
LloydSmith JO, Schreiber SJ, Getz WM. Moving beyond averages: Individuallevel variation in. In: Mathematical Studies on Human Disease Dynamics: Emerging Paradigms and Challenges: AMSIMSSIAM Joint Summer Research Conference on Modeling the Dynamics of Human Diseases: Emerging Paradigms and Challenges, July 1721, 2005, Snowbird, Utah: 2006: American Mathematical Soc; 2006. p. 235.
 22.
Kleiber C, Zeileis A. Visualizing count data regressions using rootograms. Am Stat. 2016;70(3):296–303.
 23.
Flynn M, Brown L, Tesfai A, Lauer T. Postmigration screening for active tuberculosis in Victoria, Australia. Int J Tuberc Lung Dis. 2012;16(1):50–4.
 24.
Trauer JM, Moyo N, Tay EL, Dale K, Ragonnet R, McBryde ES, Denholm JT. Risk of active tuberculosis in the five years following infection... 15%? Chest J. 2016;149(2):516–25.
 25.
Woolhouse ME, Dye C, Etard JF, Smith T, Charlwood J, Garnett G, Hagan P, Hii J, Ndhlovu P, Quinnell R. Heterogeneities in the transmission of infectious agents: implications for the design of control programs. Proc Natl Acad Sci. 1997;94(1):338–42.
 26.
Carvalho AC, Deriemer K, Nunes ZB, Martins M, Comelli M, Marinoni A, KRITSKI AL. Transmission of mycobacterium tuberculosis to contacts of HIVinfected tuberculosis patients. Am J Respir Crit Care Med. 2001;164(12):2166–71.
 27.
Faksri K, Reechaipichitkul W, Pimrin W, Bourpoern J, Prompinij S. Transmission and risk factors for latent tuberculosis infections among index casematched household contacts. Southeast Asian J Trop Med Public Health. 2015;46(3):486.
 28.
Godoy P, Cayla JA, Carmona G, Camps N, Alvarez J, Rodes A, Altet N, Pina JM, Barrabeig I, Orcau A, et al. Immigrants do not transmit tuberculosis more than indigenous patients in Catalonia (Spain). Tuberculosis. 2013;93(4):456–60.
 29.
Lohmann EM, Koster BFPJ, Le Cessie S, Kamstvan Agterveld MP, Van Soolingen D, Arend SM. Grading of a positive sputum smear and the risk of mycobacterium tuberculosis transmission. Int J Tuberc Lung Dis. 2012;16(11):1477–84.
 30.
Tornee S, Kaewkungwal J, Fungladda W, Silachamroon U, Akarasewi P, Sunakorn P. Risk factors for tuberculosis infection among household contacts in Bangkok, Thailand. Southeast Asian J Trop Med Public Health. 2004;35(2):375–83.
 31.
Golub J, Bur S, Cronin W, Gange S, Baruch N, Comstock G, Chaisson R. Delayed tuberculosis diagnosis and tuberculosis transmission. Int J Tuberc Lung Dis. 2006;10(1):24–30.
 32.
Lin X, Chongsuvivatwong V, Lin L, Geater A, Lijuan R. Dose–response relationship between treatment delay of smearpositive tuberculosis patients and intrahousehold transmission: a crosssectional study. Trans R Soc Trop Med Hyg. 2008;102(8):797–804.
 33.
Mendes MA, Gaio R, Reis R, Duarte R. Contact screening in tuberculosis: can we identify those with higher risk? Eur Respir J. 2013;41(3):758–60.
 34.
O'Shea MK, Koh GC, Munang M, Smith G, Banerjee A, Dedicoat M. Timetodetection in culture predicts risk of mycobacterium tuberculosis transmission: a cohort study. Clin Infect Dis. 2014;59(2):177–85.
Acknowledgments
The authors would like to thank Monash University for providing the PhD scholarship to YAM. We are also thankful the Victorian Department of Health and Human Services for providing data.
Funding
Y. A. Melsew is a recipient of a Monash Graduate Scholarship for his PhD study. J. M. Trauer is a recipient of an Early Career Fellowship from the National Health and Medical Research Council. No specific funding was received for this study.
Availability of data and materials
The data that support the findings of this study are available from the Victorian Department of Health and Human Services but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of the Victorian Department of Health and Human Services.
Author information
Affiliations
Contributions
YAM, MG and JMT conceptualised the study. YAM developed the protocol and, JMT and MG reviewed it. ET and YAM performed data extraction and cleaning. YAM, ACC, ESM, JTD and JMT undertook the analysis. YAM drafted the manuscript and all authors involved in consecutive revisions of the manuscript. All authors reviewed and approved the final manuscript.
Corresponding author
Correspondence to Yayehirad A. Melsew.
Ethics declarations
Ethics approval and consent to participate
Ethical approval was obtained from Monash University, Human Research Ethics Committee (Project Number: 7776) and permission was given by the Victorian Department of Health and Human Services and the Victorian Tuberculosis Program. Since we used secondary data that were stored by the Victorian Department of Health and Human Services, we did not seek individual consent.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional files
Additional file 1:
Figure S1. Schematic presentation of number of index TB patient and contacts in Victoria, for the period 2005–2015. (DOCX 52 kb)
Additional file 2:
Figure S2. Distribution of number of contacts per index TB patient in Victoria, for the period 2005–2015. A. All contacts (with negative binomial distribution fitted to count data) strategies, the number of index patients with zero contacts was 639 (beyond limit of vertical axis). B. Subset of contacts (0–40 contacts per index only). (DOCX 34 kb)
Additional file 3:
Figure S3. Distribution of number of Close contacts per index TB patient in Victoria, for the period 2005–2015. A. All Close contacts (with negative binomial distribution fitted to count data). B. Subset of close contacts (0–40 close contacts per index), the number of index patients with zero close contacts was 653 (beyond limit of vertical axis). (DOCX 34 kb)
Additional file 4:
Figure S4. Hanging rootograms for a Poisson model (upper panel) and negative binomial model (lower panel) count data. (DOCX 37 kb)
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Received
Accepted
Published
DOI
Keywords
 Tuberculosis
 Superspreading
 Negative binomial distribution
 Victoria