Using sequence data to identify alternative routes and risk of infection: a case-study of campylobacter in Scotland
© Bessell et al; licensee BioMed Central Ltd. 2012
Received: 29 August 2011
Accepted: 1 April 2012
Published: 1 April 2012
Genetic typing data are a potentially powerful resource for determining how infection is acquired. In this paper MLST typing was used to distinguish the routes and risks of infection of humans with Campylobacter jejuni from poultry and ruminant sources
C. jejuni samples from animal and environmental sources and from reported human cases confirmed between June 2005 and September 2006 were typed using MLST. The STRUCTURE software was used to assign the specific sequence types of the sporadic human cases to a particular source. We then used mixed case-case logistic regression analysis to compare the risk factors for being infected with C. jejuni from different sources.
A total of 1,599 (46.3%) cases were assigned to poultry, 1,070 (31.0%) to ruminant and 67 (1.9%) to wild bird sources; the remaining 715 (20.7%) did not have a source that could be assigned with a probability of greater than 0.95. Compared to ruminant sources, cases attributed to poultry sources were typically among adults (odds ratio (OR) = 1.497, 95% confidence intervals (CIs) = 1.211, 1.852), not among males (OR = 0.834, 95% CIs = 0.712, 0.977), in areas with population density of greater than 500 people/km2 (OR = 1.213, 95% CIs = 1.030, 1.431), reported in the winter (OR = 1.272, 95% CIs = 1.067, 1.517) and had undertaken recent overseas travel (OR = 1.618, 95% CIs = 1.056, 2.481). The poultry assigned strains had a similar epidemiology to the unassigned strains, with the exception of a significantly higher likelihood of reporting overseas travel in unassigned strains.
Rather than estimate relative risks for acquiring infection, our analyses show that individuals acquire C. jejuni infection from different sources have different associated risk factors. By enhancing our ability to identify at-risk groups and the times at which these groups are likely to be at risk, this work allows public health messages to be targeted more effectively. The rapidly increasing capacity to conduct genetic typing of pathogens makes such traced epidemiological analysis more accessible and has the potential to substantially enhance epidemiological risk factor studies.
Epidemiological risk factor analyses are used to identify factors that influence the risk of individuals acquiring a particular infection. Such risk factor analyses commonly assume that the risk factors associated with different sources of exposure to infection are homogeneous [1–3]. However, in many cases there are multiple sources of infection and different risk factors may be associated with the different sources. Backward-tracing data on the sources of infection could be used to ascribe different risks to different sources of exposure.
Infection with C. jejuni can be acquired from consumption of contaminated food as well as through direct and indirect contact with animal faeces and has multiple hosts including poultry, ruminants and wild birds [4, 5]. Recent developments in the typing of Campylobacter bacteria permits the tracing of sources of infection for human cases of Campylobacteriosis . Campylobacter can be classified by their allelic profile using Multi-Locus-Sequence-Type (MLST) typing techniques , which places isolates into specific Sequence Type (ST) profiles. Using STRUCTURE software  it is possible to calculate a probability of the ST originating from a particular species [6, 9].
Previous studies have identified an association between human C. jejuni infection in Scotland and lower social deprivation score (indicating lower social deprivation) and being a child living in an area of lower population density . A recent study in New Zealand  typed C. jejuni isolates using MLST and used the Asymmetric Island probabilistic genetic attribution model  to divide these types into ruminant and poultry origin types. Logistic regression analysis of the two types demonstrated that cases of ruminant origin were more likely to occur in rural areas relative to those of poultry origin . A similar methodology will be used in this paper to build on the risk factor analysis of Bessell et al.  by differentiating between the risks associated with different sources of infection. For example, one potential explanation for the association found by Bessell et al.  with lower deprivation could be differences in access to outdoor leisure activities. If this were the case, it might result in the less deprived being more exposed to ruminant strains should there be greater exposure to ruminant types in the environment.
Infection with ruminant strains is more common in rural areas with a large ruminant population.
Infection with ruminant types is more associated with lower deprivation than infection with poultry types.
Infection with ruminant types is more common in summer relative to poultry types.
Infection with ruminant types is more common among children rather than adults relative to poultry types.
Infection with ruminant types is associated with domestic exposures whilst poultry attributed infections more commonly result from exposure to exotic types overseas.
Anonymised reports of laboratory confirmed, passively reported C. jejuni infections were collected by staff at Health Protection Scotland (HPS) from the Public Health Teams at the 12 mainland NHS Health Boards that existed in Scotland prior to 2006. Ethical approval for the collection and use of the data was obtained from the Multi-Centre Research Ethics Committee (MREC) in Scotland; additionally, approval for the research was obtained from the Research and Development Committee in each of the NHS Health Boards. Cases that were confirmed between June 2005 and September 2006 were typed using MLST [6, 7]. Typing data was linked to epidemiological and demographic data, where available. The data included the postcode sector of the main residence of the case and either the date of onset or more commonly the date of laboratory report. Cases that were part of an outbreak were excluded and of the remainder, 101 cases were missing a verifiable postcode; these were excluded, leaving 3,834 cases. A further 2 cases had no data on gender and 9 had no record of age; these were also removed leaving 3,823 cases.
In a recent study, we collected samples of C. jejuni from food and environmental sources including chicken, pate and liver, farms with ruminant livestock, livestock faeces, wild bird faeces and urban areas where animal faeces and humans coincide, such as parks . C. jejuni were isolated from these samples and typed using MLST. Subsequently each isolated ST was assigned a probability of originating from a particular source - either poultry, cattle, sheep, wild birds, water and environmental based on their occurrence in each source . The probabilities were assigned using the STRUCTURE software . Each of 441 STs isolated from the 3,451 human cases of C. jejuni (372 cases that were infected with C. coli were removed from the analysis) was assigned a probability that the ST originated from poultry, cattle, sheep, wild bird and environmental sources as described in Sheppard et al. . STs were assigned to ruminant (cattle and sheep), poultry or wild bird whenever the probability for that species was greater than 0.95; otherwise the case remained unassigned. Very few cases were assigned to environmental or swine origin, so these sources were excluded . Cattle and sheep were merged to form a single ruminant category because Ogden et al.  demonstrated that there are no significant differences between probabilities assigned to cattle compared to probabilities assigned to sheep and therefore the two sources are indistinguishable in terms of their C. jejuni sequence types.
Individuals infected with a poultry assigned type (cases) versus individuals infected with a ruminant assigned type (controls).
Individuals infected with an unassigned type (cases) versus individuals infected with a ruminant assigned type (controls).
Individuals infected with a poultry assigned type (cases) versus individuals infected with an unassigned type (controls).
As the data points are individual cases, case-specific data could be included. Such data include the age, gender and time of year of laboratory reports. The following putative risk factors were included in these analyses:
Easting and northing of the postcode sector centroid.
Population density (people/km2) of the postcode sector using population data from the 2001 Scottish census . This was split to a binary predictor based around a cut-off of 500 people/km2.
Density of cattle, sheep and poultry (head/km2) in the postcode sector from the June 2004 agricultural census (EDINA, http://edina.ac.uk/agcensus 2004 estimates).
Gender (Female reference level)
Age: Adult/Child (Adult reference level). Children defined as being 18 and under.
Season in which infection reported: Summer/Winter (Summer reference level). Summer 15 April to 15 October.
Reporting of recent overseas travel.
To allow for the clustering of certain predictors at the level of 749 postcode sectors, the postcode sector is entered as a random effect. Furthermore, the data were gathered by the 12 mainland NHS Health Boards, so this was entered as a second random effect. Following univariate screening all predictors that were significant at p < 0.25 were entered into a multivariable model which was subsequently reduced by excluding the least significant predictors in turn until only those which were significant at p < 0.05 remained. The effect of removing predictors on the remaining p-values was monitored. Sensitivity analysis checked for the effect of the source assignment cut -off probability by repeating the analysis for a range of cut-off probabilities from 0.5 to 1 and testing for significant change in the risk factors in the final reduced model. Multilevel logistic regression analysis was carried out using the lme4 package  in the R statistical environment .
The numbers of cases and STs assigned to different sources based upon a probability of greater than 0.95
Number of cases (%)
Number of STs (%)
Cases per ST
Logistic regression analysis
Logistic regression comparing risk factors for being infected by a ruminant attributed type (control) with those for a poultry attributed type (case)
OR (95% CIs)
1.497 (1.211, 1.852)
1.272 (1.067, 1.517)
0.834 (0.712, 0.977)
1.618 (1.056, 2.481)
< = 500/km2
1.213 (1.030, 1.431)
Logistic regression comparing risk factors for being infected by a ruminant attributed type (control) with those for an unassigned type (case).
OR (95% CIs)
1.524 (1.156, 2.008)
1.919 (1.399, 2.632)
4.808 (3.165, 7.299)
< = 500/km2
1.359 (1.071, 1.724)
Season * pop.dens
Summer * < = 500/km2
winter * > 500/km2
0.605 (0.395, 0.926)
By using the MLST technique to attribute isolates from C. jejuni cases to host sources , this paper has demonstrated that risk factors for infection depend upon the source of the pathogen. Whilst there is a range of potential sources of C. jejuni infections, this paper has demonstrated that human infections of C. jejuni that are attributable to sources in ruminants are more seasonal and occur more in rural areas than those assigned to poultry sources. Those that were unassigned had very similar epidemiologies to the poultry attributable types.
The work of Sheppard et al.  on assigning source probabilities to individual STs has made this analysis possible and it demonstrates that the majority of human cases were attributable to sources in poultry and ruminants or were unassigned (Table 1). However, the majority of STs were not assigned to a source of infection with a probability of greater than 95%. This is in part reflects the large number of STs that represented a small proportion of human infections (Table 1 Figure 1), and suggests that there are either a large number of C. jejuni to which humans have low susceptibility or to which humans are rarely exposed. Consequently, changes in human behaviour or environmental exposures could result in exposure to a large additional pool of bacteria. Twenty-two C. jejuni STs were assigned to wild bird origins, but there were only in 67 reported human cases assigned to an origin in wild birds. This suggests that whilst wild birds are a reservoir there is little mechanism for human exposure, although exposure to preschool children in playgrounds has been suggested elsewhere .
The comparisons of poultry attributed cases, ruminant attributed cases and unassigned cases (Tables 2 and 3) showed that ruminant assigned types were more common in children in rural areas in summertime. This may reflect a tendency to play outdoors in the summertime coupled with poor hygiene after playing outdoors. Strachan et al.  find similar results and attribute the differences to the consumption of contaminated chicken in urban areas and playing outdoors in rural areas. These findings are similar to those from New Zealand , although our larger sample size has enabled us to show that younger age groups in rural areas are at greater risk of infection with a ruminant types in addition to the effect of season. Thus, the heterogeneities in exposure to infection of C. jejuni are consistent across different countries, with similar mechanisms of infection occurring in all, despite the fact that the most common ST in New Zealand that is associated with poultry (ST474) differs from that in Scotland (ST257).
Previous studies [1, 10] have identified an association with increased incidence in younger individuals that live in more rural settings. This paper suggests that this is likely to be the result of infection with ruminant types, thus underlining the importance of identifying different sources of infections. Here, the density of the human population rather than the density of cattle and sheep has been identified as the measure of risk for infection with ruminant strains. This suggests that either population density is a better measure of exposure to ruminant sources or that it is some property of rural areas that determines the risk. One such property has been demonstrated to be consumption of water from untreated sources . It is likely that consumption of water from private water sources will be greater in rural areas with lower population densities. ST45 was identified as a type that was associated with surface water sources during a period in the summer , however, in this study ST45 was attributed to sources in poultry.
This study did not demonstrate any difference in the risk associated with deprivation for different sources of infection. The relationship between campylobacteriosis and deprivation has been noted in Scotland , Denmark  and New Zealand , but the non-significance of deprivation in this study suggests that deprivation does not influence exposure to environmental sources.
The unassigned types had similar epidemiologies to the poultry types with the consequence that the only significant risk factors for being infected with a poultry rather than an unassigned type was overseas travel. This suggests that the majority of these unassigned types had a similar epidemiology to the poultry types, but insufficient isolates were found in the source assignment to demonstrate their origin and the association with overseas travel suggests that these may be exotic types. Bessell et al.  describe a higher likelihood of reporting infection in areas of lower deprivation and lower population density. These analyses show that the effect of rurality may be the signature of the ruminant origin cases.
By using a case-case approach this study did not seek to estimate population level risk of exposure. Rather this study analysed the subgroup of the population that has already been infected, with the principal risk factor being social deprivation . Case-case analysis is a means of comparing risk factors within this sub-group of the population that has acquired infection  and has been employed elsewhere for comparing risk factors for infection between sources of C. jejuni . As such, social deprivation remains the principal population level determinant of infection with C. jejuni but these analyses demonstrate that this does not vary between sources of infection.
Our results have demonstrated that over and above the previously demonstrated risk factors for infection at the population level , there are different risk factors for infection depending upon the sources of exposure to infection. Therefore, it is important to account for the source of infection in public health planning. The individuals that report infection depend upon the source of C. jejuni, with ruminant exposures more common among the young males in rural areas. For common genetic types, this analysis could be expanded to examine transmission routes that are specific to individual strains. By enhancing our ability to identify at-risk groups and the likely times at which these groups are at risk, public health messages can be targeted more effectively. The rapidly increasing capacity to conduct genetic typing of pathogens makes such traced epidemiological analysis more accessible and has the potential to substantially enhance epidemiological risk factor studies.
The authors are grateful to the Food Standards Agency of Scotland for funding this study.
- Ethelberg S, Simonsen J, Gerner-Smidt P, Olsen KE, Molbak K: Spatial distribution and registry-based case-control analysis of Campylobacter infections in Denmark, 1991-2001. Am J Epidemiol. 2005, 162 (10): 1008-1015. 10.1093/aje/kwi316.View ArticlePubMedGoogle Scholar
- Halliday JE, Chase-Topping ME, Pearce MC, McKendrick IJ, Allison L, Fenlon D, Low C, Mellor DJ, Gunn GJ, Woolhouse ME: Herd-level risk factors associated with the presence of Phage type 21/28 E. coli O157 on Scottish cattle farms. BMC Microbiol. 2006, 6: 99-10.1186/1471-2180-6-99.View ArticlePubMedPubMed CentralGoogle Scholar
- Bessell PR, Shaw DJ, Savill NJ, Woolhouse ME: Statistical modeling of holding level susceptibility to infection during the 2001 foot and mouth disease epidemic in Great Britain. Int J Infect Dis. 2010, 14 (3): e210-e215. 10.1016/j.ijid.2009.05.003.View ArticlePubMedGoogle Scholar
- Horrocks SM, Anderson RC, Nisbet DJ, Ricke SC: Incidence and ecology of Campylobacter jejuni and coli in animals. Anaerobe. 2009, 15 (1-2): 18-25. 10.1016/j.anaerobe.2008.09.001.View ArticlePubMedGoogle Scholar
- Mullner P, Spencer SE, Wilson DJ, Jones G, Noble AD, Midwinter AC, Collins-Emerson JM, Carter P, Hathaway S, French NP: Assigning the source of human campylobacteriosis in New Zealand: a comparative genetic and epidemiological approach. Infect Genet Evol. 2009, 9 (6): 1311-1319. 10.1016/j.meegid.2009.09.003.View ArticlePubMedGoogle Scholar
- Sheppard SK, Dallas JF, Strachan NJ, MacRae M, McCarthy ND, Wilson DJ, Gormley FJ, Falush D, Ogden ID, Maiden MC, et al: Campylobacter genotyping to determine the source of human infection. Clin Infect Dis. 2009, 48 (8): 1072-1078. 10.1086/597402.View ArticlePubMedPubMed CentralGoogle Scholar
- Maiden MC, Bygraves JA, Feil E, Morelli G, Russell JE, Urwin R, Zhang Q, Zhou J, Zurth K, Caugant DA, et al: Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms. Proc Natl Acad Sci USA. 1998, 95 (6): 3140-3145. 10.1073/pnas.95.6.3140.View ArticlePubMedPubMed CentralGoogle Scholar
- Pritchard JK, Stephens M, Donnelly P: Inference of population structure using multilocus genotype data. Genetics. 2000, 155 (2): 945-959.PubMedPubMed CentralGoogle Scholar
- Sheppard SK, Dallas JF, Macrae M, McCarthy ND, Sproston EL, Gormley FJ, Strachan NJ, Ogden ID, Maiden MC, Forbes KJ: Campylobacter genotypes from food animals, environmental sources and clinical disease in Scotland 2005/6. Int J Food Microbiol. 2009, 134 (1-2): 96-103. 10.1016/j.ijfoodmicro.2009.02.010.View ArticlePubMedPubMed CentralGoogle Scholar
- Bessell PR, Matthews L, Smith-Palmer A, Rotariu O, Strachan NJ, Forbes KJ, Cowden JM, Reid SW, Innocent GT: Geographic determinants of reported human Campylobacter infections in Scotland. BMC Public Health. 2010, 10: 423-10.1186/1471-2458-10-423.View ArticlePubMedPubMed CentralGoogle Scholar
- Mullner P, Shadbolt T, Collins-Emerson JM, Midwinter AC, Spencer SE, Marshall J, Carter PE, Campbell DM, Wilson DJ, Hathaway S, et al: Molecular and spatial epidemiology of human campylobacteriosis: source association and genotype-related risk factors. Epidemiol Infect. 2010, 138 (10): 1372-1383. 10.1017/S0950268809991579.View ArticlePubMedGoogle Scholar
- Wilson DJ, Gabriel E, Leatherbarrow AJ, Cheesbrough J, Gee S, Bolton E, Fox A, Fearnhead P, Hart CA, Diggle PJ: Tracing the source of campylobacteriosis. PLoS Genet. 2008, 4 (9): e1000203-10.1371/journal.pgen.1000203.View ArticlePubMedPubMed CentralGoogle Scholar
- Ogden ID, Dallas JF, MacRae M, Rotariu O, Reay KW, Leitch M, Thomson AP, Sheppard SK, Maiden M, Forbes KJ, et al: Campylobacter excreted into the environment by animal sources: prevalence, concentration shed, and host association. Foodborne Pathog Dis. 2009, 6 (10): 1161-1170. 10.1089/fpd.2009.0327.View ArticlePubMedPubMed CentralGoogle Scholar
- Carstairs V, Morris R: Deprivation and health in Scotland. Health Bull (Edinb). 1990, 48 (4): 162-175.Google Scholar
- UKBorders Service. [http://www.edina.ac.uk/]
- Bates D, Maechler M, Bin D: lme4: Linear mixed-effects models using S4 classes. 2011Google Scholar
- R Development Core Team: R: A language and environment for statistical computing. 2008, Vienna, Austria: R Foundation for Statistical ComputingGoogle Scholar
- French NP, Midwinter A, Holland B, Collins-Emerson J, Pattison R, Colles F, Carter P: Molecular epidemiology of Campylobacter jejuni isolates from wild-bird fecal material in children's playgrounds. Appl Environ Microbiol. 2009, 75 (3): 779-783. 10.1128/AEM.01979-08.View ArticlePubMedGoogle Scholar
- Strachan NJ, Gormley FJ, Rotariu O, Ogden ID, Miller G, Dunn GM, Sheppard SK, Dallas JF, Reid TM, Howie H, et al: Attribution of campylobacter infections in northeast Scotland to specific sources by use of multilocus sequence typing. J Infect Dis. 2009, 199 (8): 1205-1208. 10.1086/597417.View ArticlePubMedPubMed CentralGoogle Scholar
- Sopwith W, Birtles A, Matthews M, Fox A, Gee S, Painter M, Regan M, Syed Q, Bolton E: Identification of potential environmentally adapted Campylobacter jejuni strain, United Kingdom. Emerg Infect Dis. 2008, 14 (11): 1769-1773. 10.3201/eid1411.071678.View ArticlePubMedPubMed CentralGoogle Scholar
- Rind E, Pearce J: The spatial distribution of campylobacteriosis in New Zealand, 1997-2005. Epidemiol Infect. 2010, 138 (10): 1359-1371. 10.1017/S095026881000018X.View ArticlePubMedGoogle Scholar
- McCarthy N, Giesecke J: Case-case comparisons to study causation of common infectious diseases. Int J Epidemiol. 1999, 28 (4): 764-768. 10.1093/ije/28.4.764.View ArticlePubMedGoogle Scholar
- Gillespie IA, O'Brien SJ, Frost JA, Adak GK, Horby P, Swan AV, Painter MJ, Neal KR: A case-case comparison of Campylobacter coli and Campylobacter jejuni infection: a tool for generating hypotheses. Emerg Infect Dis. 2002, 8 (9): 937-942.View ArticlePubMedPubMed CentralGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2334/12/80/prepub
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.