Research article | Open | Open Peer Review | Published:

# Estimating the impact of school closure on social mixing behaviour and the transmission of close contact infections in eight European countries

*BMC Infectious Diseases***volume 9**, Article number: 187 (2009)

## Abstract

### Background

Mathematical modelling of infectious disease is increasingly used to help guide public health policy. As directly transmitted infections, such as influenza and tuberculosis, require contact between individuals, knowledge about contact patterns is a necessary pre-requisite of accurate model predictions. Of particular interest is the potential impact of school closure as a means of controlling pandemic influenza (and potentially other pathogens).

### Methods

This paper uses a population-based prospective survey of mixing patterns in eight European countries to study the relative change in the basic reproduction number (R_{0} - the average number of secondary cases from a typical primary case in a fully susceptible population) on weekdays versus weekends and during regular versus holiday periods. The relative change in R_{0} during holiday periods and weekends gives an indication of the impact collective school closures (and prophylactic absenteeism) may have during a pandemic.

### Results

Social contact patterns differ substantially when comparing weekdays to the weekend and regular to holiday periods mainly due to the reduction in work and/or school contacts. For most countries the basic reproduction number decreases from the week to weekends and regular to holiday periods by about 21% and 17%, respectively. However for other countries no significant decrease was observed.

### Conclusion

We use a large-scale social contact survey in eight different European countries to gain insights in the relative change in the basic reproduction number on weekdays versus weekends and during regular versus holiday periods. The resulting estimates indicate that school closure can have a substantial impact on the spread of a newly emerging infectious disease that is transmitted via close (non sexual) contacts.

## Background

Mathematical models of how infectious diseases spread from person to person through close contacts rely on assumptions regarding the underlying transmission process. These assumptions are often summarized in the so-called 'Who Acquires Infection from Whom' matrix (WAIFW). The WAIFW matrix expresses the rate at which a susceptible individual is infected by an infectious individual and is a determinant of the basic reproduction number. Since the structure of the WAIFW matrix is both very uncertain and influential for quantitative model projections, several authors have tried to obtain direct information on social mixing behaviour using social contact surveys [1–7] or alternatively time use surveys and social network analysis [8, 9]. Whereas most of these studies were based on small and unrepresentative samples, Mossong et al. [6] published the results of large and representative population based surveys on social contacts recorded on a randomly assigned day in eight European countries. Hens et al. [7] provided an in-depth analysis for one of these country surveys (Belgium), which collected information on two randomly assigned days per participant. From these studies and subsequent work, it has become clear that social contact data provide crucial information for dynamic models, aiming to simulate person to person transmission of close-contact infections [3, 8, 10, 11] and (Melegaro, A., Jit, M., Gay, N., Zagheni, E., Edmunds, W.J. What types of contacts are important for the spread of infectious diseases? Using contact survey data to explore European mixing patterns, submitted).

In this paper, we revisit the data of Mossong et al. [6] and provide a more in depth discussion on the change in mixing behaviour from the week to weekends and regular to holiday periods by estimating the social contact matrices for the different countries for both a day during the week and a day on the weekend. If available, we also compared holiday with non-holiday ('regular') periods. Throughout this paper we define 'the week' as Monday until Sunday and 'weekday' or 'working week' as Monday until Friday. When it is stated that we compare the week with the weekend, we actually refer to an average day of the week to an average day of the weekend.

By comparing these period-specific contact matrices we estimate the associated change in basic reproduction numbers R_{0}. As schools are closed during holiday and weekend periods, the relative change in R_{0} provides an indication of the impact collective school closures may have (see e.g. [12]).

Since such data were collected in each of the countries, we can study the differences in mixing behaviour between countries and assess the differential impact of holidays and weekends on 'regular' mixing and R_{0} in the various countries. This comparison would reflect the change in the way people mix at school and work since school activities are reduced to a minimum and most people do not attend work on the weekend. Additionally, for some countries, contacts were reported during either school or public holidays. We believe school holiday periods to be a better proxy for school closure than public holidays (during the (pre-summer) school holidays it is likely that most adults continue working, whereas this is unlikely on public holidays). However, the comparison between school and public holidays was not made because of small sample sizes for either of the two for several of the different countries. Still, by comparing the regular (non-holiday) and holiday periods, the impact on mixing behaviour - especially due to changes in contact behaviour for children and adolescents - can be studied. Note that during the holiday periods childcare may very well substitute school attendance for young children, implying that mixing behaviour is modified in more than one way (e.g. grandparents taking more care of children, see [7]).

In the next section, we briefly introduce the data. In the subsequent section, we introduce the regression models used to study the effects of participant characteristics on the number of contacts people make. We show how social contact matrices can be estimated and how this relates to the estimation of the next-generation operator and the basic reproduction number. Note that the reader may wish to skip this more technical part, which is non-essential to understand the remainder of the paper. In the results section we report our findings and we end with a discussion.

## Methods

### Data

A population-based prospective survey of mixing patterns in eight European countries (Belgium (BE), Great Britain (GB), Finland (FI), Germany (DE), Italy (IT), Luxemburg (LU), Poland (PL) and The Netherlands (NL)) using a common paper diary methodology was conducted as part of the POLYMOD project [6]. This study was conducted covering all age groups. A total of 7290 participants recorded characteristics of 97904 contacts during one day. The surveys were conducted between May 2005 and September 2006. A contact was defined as either a non-physical contact: a two-way conversation of three or more words in the physical presence of another person without physical contact or a physical contact: a two-way conversation with skin-to-skin touching.

Survey participants were recruited in such a way as to be broadly representative of the whole population in terms of geographical spread, age and sex. In BE, IT and LU survey participants were recruited by random digit dialling using land lines; in GB, DE and PL survey participants were recruited through a face-to-face interview; survey participants in NL and FI were recruited via population registers. Children and adolescents were deliberately oversampled, because of their important role in the spread of infectious agents. Only one person in each household was asked to participate in the study. Paper diaries were sent by mail or given face to face. Participants were explained by telephone or in person how to complete the diary. They were asked to provide contextual information about the age, sex, location and 'usual contact' frequency of each contacted person. Diaries were translated into local languages. For more information on these surveys we refer to Mossong et al. [6].

We highlight two aspects mentioned by these authors. First, contacts at work were reported differently in the different surveys due to between-country differences in survey design (see Table 1). These differences were ignored in the analyses as presented by Mossong et al. [6]. Second, the sample period for some of the countries, included at least one local holiday period (Table 1). Since schools and child care centres are typically closed during these periods, we investigate the relative impact of holiday periods on social contact patterns (see [7] for such an analysis focused on BE). Moreover we also compare contact patterns during the weekend and the week. The latter analysis could not be conducted in the regular-holiday strata because of the small sample sizes and thus warrants a marginal interpretation. In the analyses, we define a weekend to be regular when it falls in between two regular weeks and as a holiday otherwise.

### Methodology

In this section the methodology used to identify the factors that influence the number of reported contacts is explained. We start from the model proposed by Mossong et al. [6] and then show how we included work contacts. We then show how the relative impact of looking at various types of contacts on the basic reproduction number can be established.

### Modelling the number of contacts

The response of interest, i.e. the participant's number of contacts within a day, is a count and a Poisson distribution seems a plausible assumption. However, the Poisson distribution assumes the equality of mean and variance, a property that is rarely fulfilled in practice. Therefore, we consider the negative binomial distribution which explicitly models overdispersion, i.e. the variance is allowed to be larger than the mean. Often, overdispersion is caused by an excess variation between response probabilities or counts, possibly originating from omitting important explanatory predictors [13]. Denote *μ* the mean parameter for the negative binomial distribution, the variance is then given by *μ* + *αμ*
^{2}, where *α* ≥ 0 is the overdispersion parameter. When *α* = 0, the negative binomial distribution simplifies to the Poisson distribution.

Since for some of the surveys the number of possible contact entries was limited, the number of contacts is right censored. Although we could take the country-specific censoring count, for uniformity, we opted to take the minimum of these limits, i.e. 29 contacts for the survey in GB (Table 1). To accommodate for post-stratification with respect to age and household size in each country, i.e. factors known to influence contact behaviour, we weight the individual contributions. The log-likelihood function for the weighted censored negative binomial is

where *δ*
_{
i
}= 1 if *y*
_{
i
}< 29 and 0 otherwise, *u*
_{
i
}is the post-stratification weight of observation *i*, *y*
_{
i
}is the number of contacts (including work contacts) for observation *i*, *X*
_{
i
}is the vector of explanatory variables and *P* is the density function for the negative binomial distribution:

where *μ* = *μ*(*X*
_{
i
}) = exp(*X*
_{
i
}
*β*) is the mean parameter with *β*, the vector of coefficients.

Empirical count data are frequently not only characterized by overdispersion but also excess zeros. Zero-inflated count models provide a parsimonious yet powerful way to model this type of situation. Such models assume that the data are a mixture of two separate data generation processes: one generates only zeros, and the other is either a Poisson or a negative binomial data-generating process. The result of a Bernoulli trial is used to determine which of the two processes generates an observation. A standard negative binomial model would not distinguish between these two processes, but a zero-inflated model allows for this complication. We contrasted the weighted censored negative binomial regression in (i) and (ii) with its zero-inflated version. The latter is found by replacing (ii) by

where *π* denotes the probability of the zeros-governing process and *P*(*Y* = *y*
_{
i
}|*X*
_{
i
}) denotes the negative binomial density function in (ii). Note that the covariate vector *Z*
_{
i
}is used to allow this probability to depend on covariates which may differ from *X*
_{
i
}. If *π* = 0, the zero-inflated negative binomial model simplifies to the negative binomial model. Comparing the different models can be done using the likelihood ratio test [14].

Since professional contacts were not systematically surveyed in the same way for the different countries, the aforementioned methodology cannot be applied directly. Indeed, in the diary for some countries (BE, DE, FI and NL) participants were instructed not to list their professional contacts, if the number of professional contacts was greater than 20 (for participants from BE) or greater than 10 (for participants from DE, FI and NL, see Table 1). Whereas Mossong et al. [6] used the weighted censored negative binomial model from the recorded individual contact data only, in the current paper we extend their model by taking these extra professional contacts into account, thus improving the comparability of the results between countries.

### Estimating Social Contact Matrices

In this section, we outline how the country-specific social contact matrices have been estimated. We arrange the weighted average number of counts by age classes in a "social contact matrix" *M*. Each matrix element *m*
_{
ij
}= E(*Y*
_{
ij
}) gives the mean number of contacts per day by a participant of age class *j* with persons in age class *i*. Consider the random variable *Y*
_{
ij
}, the number of contacts in age class *i* during one day as reported by a respondent in age class *j* (*i* = 1, ..., *I*, *j* = 1, ..., *J*), which has observed values *Y*
_{
ijk
}, *k* = 1, ..., *n*
_{
j
}, where *n*
_{
j
}denotes the number of participants in the contact survey belonging to age class *j*. We considered 5 year age bands. The contact rates *c*
_{
ij
}are related to the social contact matrix by *c*
_{
ij
}= *m*
_{
ij
}/*w*
_{
i
}, where *w*
_{
i
}denotes the country-specific population size in age class *i*, obtained from demographical data (EUROSTAT, 2006). We use a generalized linear model with negative binomial response distribution and bivariate smoothing approach [15] to estimate the number of contacts during a day in age class *i* by participants in age class *j* [6, 7, 10, 11]. For the estimation of the matrix elements *m*
_{
ij
}, we take the reciprocal nature of conversational contacts into account by imposing *c*
_{
ij
}= *c*
_{
ji
}.

### Estimation of Next-Generation Matrices

Consider the next generation matrix *G* with elements *g*
_{
ij
}, denoting the average number of secondary infections in age class *i* through the introduction of a single infectious individual of age class *j* into a fully susceptible population. The next generation matrix determines how the risk of infection varies over age classes and is defined by

with population size *N*, mean duration of infectiousness *D* and life expectancy *L* [16]. *β* denotes the matrix of per capita rates *β*
_{
ij
}at which an individual of age class *i* makes effective contact, i.e. transferring the infection, with a person of age class *j*. In the literature, this matrix is often called the 'Who Acquires Infection From Whom' or WAIFW-matrix. Assuming individuals are contacted at random within age classes, we introduce a proportionality factor *q* measuring the disease-specific infectivity and susceptibility and stipulate *β*
_{
ij
}= *q* × *c*
_{
ij
}or *β* = *q* × *C*. This so-called social contact hypothesis is tenable only under the reasonable assumption that the contacts from which *C* is estimated are good proxies for those contacts responsible for disease transmission [3, 10, 11] and (Melegaro, A., Jit, M., Gay, N., Zagheni, E., Edmunds, W.J. What types of contacts are important for the spread of infectious diseases? Using contact survey data to explore European mixing patterns, submitted).

The basic reproduction number *R*
_{0} (sometimes called basic reproductive rate or basic reproductive ratio), i.e. the mean number of secondary cases a typical single infected case will cause in a population with no immunity to the disease, is the largest eigenvalue of the next generation operator defined in (iv) [16]:

*R*
_{0} has threshold value 1, in the sense that an epidemic will result from introduction of the infective agent when *R*
_{0} > 1, while the number of new infections per day declines right after the introduction when *R*
_{0} ≤ 1.

To determine the relative change in *R*
_{0} from the week to weekends and from regular to holiday periods, we calculate

where indices 1 and 2 refer to the contacts registered during the weekend and week (Monday to Sunday) or holiday and regular period, respectively. It is straightforward to show that the normalizing constants cancel and thus the ratio relates only to contact data. Using a nonparametric bootstrap on the contact data by participant, 95% percentile confidence intervals for the relative change in *R*
_{0} can be obtained.

## Results

We first describe the results for the number of contacts per participant and then the results for the relative change in basic reproduction number when comparing the different periods.

### Modelling the number of contacts

The results of the weighted, censored, negative binomial regression analysis using participant's age, gender, household size, day of the week, period (holiday or not) and country as explanatory variables are summarized in Table 2.

The dispersion parameter was estimated at 0.41 (95% CI: (0.40, 0.43)), indicating the necessity of taking overdispersion into account. We contrasted the aforementioned model with its zero-inflated version and found that zero-inflation was non-significant (P-value 0.3173). The more parsimonious model was therefore used in further analyses.

Participants in the 10-49 years age-category had the highest number of contacts, while participants above the age of 70 years had the lowest number of contacts followed by children younger than 5 years. There was no difference in the number of contacts made between males and females. Participants living in larger households had a higher number of contacts. Participants have a greater number of contacts during the week than over the weekend, and significantly fewer contacts on Sunday during the weekend. IT and NL have a relatively high number of contacts compared to BE, LU and PL whereas DE, FI and GB have a relatively low number of contacts. The results for DE, GB, IT, LU and PL remained similar as published by Mossong et al. [6]. However inclusion of work contacts proved to be important for BE, FI and NL with a significant rise in the number of contacts made.

The differences between the sample estimates (Mean and Std Dev in Table 2) and the model-based relative number of reported contacts indicate that it is important to control for the different participant characteristics.

### Estimation of Social Contact Matrices and Relative Change in R_{0}

A negative binomial model with bivariate smoothing approach was used to model the number of contacts per day with age class *i* made by a participant in age class *j*. We illustrate this approach for close contacts on weekdays for the eight different countries as shown in Figure 1. The country-specific patterns are very similar and show a clear assortative structure indicating people most often mix with people of similar age. The non-assortative mixing patterns originate mostly from professional contacts between people of various age-classes. The off-diagonals show mixing between age groups and can be seen to indicate social contacts between generations (e.g. in families between children-parents-grandparents).

From the estimated *M*-matrix, we derived the relative change in *R*
_{0} as outlined in the methods section. The relative changes in *R*
_{0}, comparing the week to weekends on the one hand and regular to holiday periods on the other hand, are summarized in Tables 3 and 4, together with their 95% bootstrap-based confidence intervals based on 1000 bootstrap samples. Note that, whenever necessary, weights were adjusted to make the sample representative for the population at hand. Extra professional contacts were not taken into account in this analysis due to the shortage of additional information for these contacts. Omitting these extra work contacts has shown moderate impact on *R*
_{0} since the most influential part of the contact surface determining *R*
_{0} is contacts between children.

Table 3 shows a significant decrease of at least 12% up to 26% in *R*
_{0} due to all contacts in all countries except DE and FI, in which no significant changes in contact patterns during the weekend were recorded. For close contacts, which are believed to be better proxies for those contacts responsible for the spread of airborne infections (see [10–12]), these differences are less pronounced and the significantly lower *R*
_{0} are again observed for BE, GB, IT, LU, NL and PL, ranging from 5% to 21%.

The comparison of holiday with regular periods was only possible for BE, GB, LU and NL, because only in these countries the survey was partly carried out during a holiday period. For GB and NL there were regional differences in the dates of holiday periods (Table 1). Since exact information by participant is not available, a sensitivity analysis was conducted, resulting in multiple versions of what can be interpreted as a holiday period: (1) the period encompassing all region-specific holidays (indicated by †) or (2) the holiday period of one or two of the regions only (indicated by ˠ). Although for DE holiday periods were observed, we don't wish to compare them since these periods were state-specific and scattered over the whole sampling period (Table 1). The results in Table 4 show that for BE, GBˠ and NL (NL^{†} and both NLˠ), there is a significant decrease in *R*
_{0} by 17%, 13% and 40%, respectively. When focusing on close contacts, we estimate a significant decrease in *R*
_{0} for BE (10%), GBˠ (17%), and NL (45%) whereas no significant difference was observed for LU.

Since *R*
_{0} is a summary measure of the next generation matrix and thus the contact surface, we zoom in on the relative ratios between the close contact surfaces on weekends and weekdays, and holiday and regular periods, respectively, for countries in which we observed a significant difference. We use a three-category scale based on the 95% bootstrap-based confidence intervals for the cell-specific contact ratios:

where LCL and UCL refer to the lower and upper confidence limit of the 95% bootstrap-based confidence intervals for the cell-specific contact ratios, respectively. Figure 2 and Figure 3 show the resulting score matrices.

The score matrices show greater off-diagonal mixing (less assortative) and lower (grand)parent-child components for weekends compared to the week (Monday to Sunday). That is, during the week, many contacts occur between individuals of similar age, or between parents and their children. During the weekend, more contact is made between other age groups. Clearly the rates of contact between persons of about 20-50 years are lower for weekends compared to the week due to greater professional activity during the week. A similar observation can be made when comparing holiday to regular periods although the professional contact component is less obvious for BE. Note that the red component in the score matrices is less assortative in children/adolescents for the relative ratio between holiday and regular period when compared to the relative ratio between weekend and weekdays. The result for NL relies on relatively few participants and therefore shouldn't be overinterpreted. In general, these score matrices should be interpreted with caution since sample sizes for higher age-values are small. Moreover, since scores are obtained from a pointwise comparison of the ratio and the bootstrap samples, looking at the full score surface cannot be done since multiple testing is not accounted for.

## Discussion

For a newly emerging infectious disease that is transmitted via close (non sexual) contacts, the range of prevention and control options is often limited, as specific pharmaceutical interventions (such as vaccination) are typically not (yet) available. Instead, mitigation strategies are used that focus on isolating known infectious cases, or - more generally - on reducing contacts between potentially infectious and susceptible persons. School closure is one of the strategies often considered, as children are important spreaders of many close contact pathogens, due to their frequent and intimate social contacts, their general hygiene, and perhaps their increased shedding. In this paper we assessed the impact of social distancing as a consequence of school closure and of work interruption by comparing recorded social contact behaviour during weekends and holiday periods versus the week and regular working periods, respectively. We defined a weekend to be regular when it falls in between two regular weeks and as part of a holiday period otherwise. Note that due to small sample sizes, we could not compare contact patterns between the week and the weekend in the regular/holiday strata. Therefore the results warrant a marginal interpretation.

In general, we observed a lower number of contacts during weekends compared to working weekdays (about 30% difference) and during holiday periods compared to regular periods (9% difference). We quantified the reduction in transmission by comparing the country-specific basic reproduction number for these different periods. Focusing on close contacts, believed to be most predictive for contacts enabling transmission, comparing the week to the weekends, we observed no significant difference in *R*
_{0} for DE, FI and a significant decrease of 12% to 26% for BE, GB, IT, LU, NL and PL. Comparing holiday to regular periods no significant difference was observed for LU whereas a significant decrease in *R*
_{0} of 10%, 17% and 45% was found for BE, GB and NL, respectively. On weekends it appears that between-generation mixing becomes more frequent (eg, through family gatherings), and same age mixing becomes relatively less frequent, particularly in BE, GB, IT, LU, NL and PL. When comparing the relative change in *R*
_{0} from a working weekday (Monday-Friday) to the weekend (results not shown), we observed an even larger reduction of up to 45%. This finding again indicates a change in mixing behaviour between weekdays and the weekend and consequently the week and the weekend. During holiday periods too, BE, GB and NL show an increase in intergenerational mixing compared to the regular periods, and a decrease in same-age mixing. The Belgian data show that 25 to 35 year olds mix more frequently during holidays within their own age group (presumably because their age does not imply intense mixing in a class room type situation during a regular period, while it may imply that they spent the holidays with their friends rather than within an intergenerational family-type setting).

If we can assume that school closure in a pandemic situation resembles school closure during holiday periods, then our results show that such a strategy would have significant impact on the basic reproduction number. Similarly the additional effect of social distancing in terms of reducing work-related contacts might be observed through social contact information on weekend days. During a pandemic presumably also typical weekend activities with a strong social component such as team sports competition, and cultural outings may not take place, and therefore our estimated reductions in *R*
_{0} are conservative. Similarly, typical holiday activities such as youth camps may not take place during a pandemic.

In other words, *R*
_{0} potentially decreases with about 21% when considering these comparisons with weekends and holidays as proxies for school closure and associated work interruptions. Since the latter occur mostly during the weekend (and to a lesser extent during the holidays documented in the periods over which the surveys were carried out), the comparison based on holiday mixing may best approximate the impact of school closure, and the comparison based on weekend mixing may best approximate the impact of a combined school closure and work interruption strategy.

Clearly, care has to be taken when interpreting the results of this study since its design did not aim at a direct comparison of weekdays/weekends and regular/holiday periods. Using post-stratification with population-specific weights we believe we addressed this issue as much as possible. Bearing these caveats in mind, we believe that the current paper produces interesting results in that it directly uses the changes in contact patterns that occur during periods of school and/or work closure. Previous modelling studies of the potential impact of school closure for mitigating a pandemic have relied on assumptions for the reduction in contacts (see e.g. [17–19]), or have relied on assumptions for the redistribution of contacts (compensatory behaviour) [13] during periods of school closure. Several other studies estimated the impact of social distancing for the 1918 pandemic (see e.g. [20–22]) or related settings [23, 24] from incidence data. We have estimated the reduction in contacts that may occur, including the compensatory behaviours. That is, our results are more driven by directly observed data than previous studies.

In summary, these results indicate that school closure would have a substantial impact for several countries whereas for some countries this would have a moderate and for one country (DE) potentially even negative impact (although non-significant here). It is noteworthy that the data collection approach in the German study (DE) digressed substantially from the other countries [6], to the extent that we believe the results based on DE to be subject to markedly more bias compared to the other countries. If transmission occurs via this route, as studies of other close-contact viruses suggest [3, 10, 11] and (Melegaro, A., Jit, M., Gay, N., Zagheni, E., Edmunds, W.J. What types of contacts are important for the spread of infectious diseases? Using contact survey data to explore European mixing patterns, submitted), there is potential for the emergence of complex epidemiological patterns with a decreased incidence in children partly offset by an increase in incidence in adults. A number of economic models have shown that school closure and prophylactic absenteeism have a considerable macroeconomic impact [25, 26] and (Keogh-Brown, M.R., Smith, R.D., Edmunds, W.J., Beutels, P. The macroeconomic impact of pandemic influenza: estimates from models of the UK, France, Belgium and The Netherlands, submitted).

Therefore, these mitigation strategies would have to balance the effects of school closure and prophylactic absenteeism versus the macroeconomic cost of these measures.

## Conclusion

We used a large-scale social contact survey in eight different European countries to gain insights in the relative change in the basic reproduction number on weekdays versus weekends and during regular versus holiday periods. The resulting estimates indicate that school closure can have a substantial impact on the spread of a newly emerging infectious disease that is transmitted via close (non sexual) contacts.

## References

- 1.
Edmunds W, O'Callaghan C, Nokes D: Who mixes with whom? A method to determine the contact patterns of adults that may lead to the spread of airborne infections. Proceedings of the Royal Society B: Biological Sciences. 1997, 264: 949-957. 10.1098/rspb.1997.0131.

- 2.
Edmunds W, Kafatos G, Wallinga J, Mossong J: Mixing patterns and the spread of close-contact infectious diseases. Emerging Themes in Epidemiology. 2006, 3: 10-10.1186/1742-7622-3-10.

- 3.
Wallinga J, Teunis P, Kretzschmar M: Using data on social contacts to estimate age-specific transmission parameters for respiratory-spread infectious agents. American Journal of Epidemiology. 2006, 164: 936-944. 10.1093/aje/kwj317.

- 4.
Beutels P, Shkedy Z, Aerts M, Van Damme P: Social mixing patterns for transmission models of close contact infections: exploring self-evaluation and diary-based data collection through a web-based interface. Epidemiology and Infection. 2006, 134: 1158-1166. 10.1017/S0950268806006418.

- 5.
Mikolajczyk RT, Akmatov MK, Rastin S, Kretzschmar M: Social contacts of school children and the transmission of respiratory-spread pathogens. Epidemiology and Infection. 2007, 1-10.

- 6.
Mossong J, Hens N, Jit M, Beutels P, Auranen K, Mikolajczyk R, Massari M, Salmaso S, Scalia Tomba G, Wallinga J, Heijne J, Sadkowska-Todys M, Rosinska M, Edmunds J: Social Contacts and Mixing Patterns Relevant to the Spread of Infectious Diseases. PLoS Medicine. 2008, 5: 381-391. 10.1371/journal.pmed.0050074.

- 7.
Hens N, Goeyvaerts N, Aerts M, Shkedy Z, Van Damme P, Beutels P: Mining social mixing patterns for infectious disease models based on a two-day population survey in Belgium. BMC Infectious Diseases. 2009, 9: 5-10.1186/1471-2334-9-5.

- 8.
Zagheni E, Billari FC, Manfredi P, Melegaro A, Mossong J, Edmunds J: Using time-use data to parametrize models for the spread of close-contact infectious diseases. American Journal of Epidemiology. 2008, 168 (9): 1082-1090. 10.1093/aje/kwn220.

- 9.
Dell Valle SY, Hyman JM, Hethcote HW, Eubank SG: Mixing patterns between age groups in social networks. Social Networks. 2007, 29: 539-554. 10.1016/j.socnet.2007.04.005.

- 10.
Goeyvaerts N, Hens N, Ogunjimi B, Aerts M, Shkedy Z, Van Damme P, Beutels P: Estimating infectious disease parameters from data on social contacts and serological status. Journal of the Royal Statistical Society, Series C. 2010,

- 11.
Ogunjimi B, Hens N, Goeyvaerts N, Aerts M, Beutels P: Using empirical social contact data to model person to person infectious disease transmission: an illustration for varicella. Mathematical Biosciences. 2009, 278 (2): 80-87. 10.1016/j.mbs.2008.12.009.

- 12.
Cauchemez S, Valleron AJ, Boelle PY, Flahault A, Ferguson NM: Estimating the impact of school closure on influenza transmission from sentinel data. Nature. 2008, 452: 750-754. 10.1038/nature06732.

- 13.
Hilbe J: 2007, Cambridge University Press

- 14.
Erdman D, Jackson L, Sinko A: Zero-Inflated Poisson and Zero-Inflated Negative Binomial Models Using the COUNTREG Procedure. SAS Institute Inc., Cary, NC, Paper 322

- 15.
Wood S, Generalized Additive Models: an Introduction with R. 2006, Chapman and Hall/CRC Press

- 16.
Diekmann O, Heesterbeek J, Metz J: On the definition and the computation of the basic reproduction ratio

*R*_{0}in models for infectious diseases in heterogeneous populations. Journal of Mathematical Biology. 1990, 28: 65-382. 10.1007/BF00178324. - 17.
Ferguson NM, Cummings DA, Fraser C, Cajka JC, Cooley PC, Burke DS: Strategies for mitigating an influenza pandemic. Nature. 2006, 442: 448-452. 10.1038/nature04795.

- 18.
Glass RJ, Glass LM, Beyeler WE, Min HJ: Targeted social distancing design for pandemic influenza. Emerging Infectious Diseases. 2006, 12: 1671-1681.

- 19.
Germann TC, Kadau K, Longini IM, Macken CA: Mitigation strategies for pandemic influenza in the United States. Proceedings of the National Academy of Sciences of the United States of America. 2006, 103 (15): 5935-5940. 10.1073/pnas.0601266103.

- 20.
Hatchett RJ: Public health interventions and epidemic intensity during the 1989 influenza pandemic. Proceedings of the National Academy of Sciences of the United States of America. 2007, 104: 7582-7587. 10.1073/pnas.0610941104.

- 21.
Markel H, Lipman HB, Navarro JA, Sloan A, Michalsen JR, Stern AM, Cetron MS: Nonpharmaceutical interventions implemented by US cities during the 1918-1919 influenza pandemic. Journal of the American Medical Association. 2007, 298: 644-654. 10.1001/jama.298.6.644.

- 22.
Cowling BJ, Ho LM, Leung GM: Effectiveness of control measures during the SARS epidemic in Beijing - a comparison of the Rt curve and the epidemic curve. Epidemiology and Infection. 2008, 136: 562-566. 10.1017/S0950268807008722.

- 23.
Heymann A, Chodick G, Reichman B, Kokia E, Laufer J: Influence of school closure on the incidence of viral respiratory diseases among children and on health care utilization. Pediatric Infectious Diseases Journal. 2004, 23: 675-677. 10.1097/01.inf.0000128778.54105.06.

- 24.
Heymann AD, Hoch I, Valinsky L, Kokia E, Steinberg DM: School closure may be effective in reducing transmission of respiratory viruses in the community. Epidemiology and Infection. 2009, 37: 1369-76. 10.1017/S0950268809002556.

- 25.
Keogh-Brown MR, Wren-Lewis S, Edmunds WJ, Beutels P, Smith RD: Calculating the macroeconomic effects on the UK of an influenza pandemic. Health Economics.

- 26.
Keogh-Brown MR, McDonald S, Edmunds WJ, Beutels P, Smith RD: The macroeconomic costs of a global influenza pandemic using the GLOBE model. GTAP conference paper. [https://www.gtap.agecon.purdue.edu/resources/download/3828.pdf]

### Pre-publication history

The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2334/9/187/prepub

## Acknowledgements

We thank both referees for their comments which have led to an improved version of the manuscript. This work has been funded by 'SIMID', a strategic basic research project funded by the institute for the Promotion of Innovation by Science and Technology in Flanders (IWT), project number 060081, by POLYMOD, a European Commission project funded within the Sixth Framework Programme, Contract number: SSP22-CT-2004-502084, by the IAP research network nr P6/03 of the Belgian Government (Belgian Science Policy).

## Author information

## Additional information

### Competing interests

The authors declare that they have no competing interests.

### Authors' contributions

NH drafted the manuscript in consultation with PB, NG, MA, JM and JE; GMA conducted the analyses in consultation with NH, NG and MA. All authors read and approved the final manuscript.

## Authors’ original submitted files for images

## Rights and permissions

## About this article

#### Received

#### Accepted

#### Published

#### DOI

### Keywords

- Social Contact
- Negative Binomial Distribution
- Negative Binomial Model
- Basic Reproduction Number
- Contact Pattern