Skip to main content
  • Research article
  • Open access
  • Published:

Comparative M-protein analysis of Streptococcus pyogenes from pharyngitis and skin infections in New Zealand: Implications for vaccine development



Acute rheumatic fever (ARF) and rheumatic heart disease (RHD) are responsible for a significant disease burden amongst Māori and Pacific populations in New Zealand (NZ). However, contemporary data are lacking regarding circulating group A Streptococcal (GAS) strains in NZ. Such information is important in guiding vaccine development.


GAS isolates from April to June 2015 were recovered from skin and pharyngeal samples from children living in areas of high social deprivation in Auckland, NZ, a significant proportion of which are Māori or Pacific. These children are among the highest risk group for developing ARF. Isolates were compared to concurrently collected pharyngeal isolates from Dunedin, NZ, where both the proportion of Māori and Pacific children and risk of developing ARF is low. Emm typing, emm cluster typing and theoretical coverage of the 30-valent vaccine candidate were undertaken as previously described.


A high diversity of emm types and a high proportion of emm-pattern D and cluster D4 isolates were detected amongst both skin and pharyngeal isolates in children at high risk of ARF. Pharyngeal isolates from children at low risk of ARF within the same country were significantly less diverse, less likely to be emm pattern D, and more likely to be theoretically covered by the 30-valent M protein vaccine.


The high proportion of emm pattern D GAS strains amongst skin and pharyngeal isolates from children at high risk of ARF raises further questions about the role of skin infection in ARF pathogenesis. Emm types and emm clusters differed considerably between ARF endemic and non-endemic settings, even within the same country. This difference should be taken into account for vaccine development.

Peer Review reports


Group A Streptococcus (GAS) is a major human pathogen, and infections caused by GAS are responsible for considerable global morbidity and mortality [1]. The spectrum of infections caused by GAS ranges from non-invasive infections such as impetigo and pharyngitis, through to invasive infections such as necrotizing fasciitis and toxic shock syndrome. In addition, acute rheumatic fever (ARF) and its sequela, rheumatic heart disease (RHD) continue to represent a considerable disease burden in many settings. This includes New Zealand, where the incidence of ARF in Māori and Pacific children is amongst the highest in the developed world [2].

At present, molecular typing of GAS is performed by sequence analysis of the 5’ hypervariable region of the emm gene that encodes the M-protein [3]. Recently, an emm cluster-based typing system was proposed, which classifies existing emm types into emm clusters based on genetic relatedness of emm protein sequences and functional activity of the M-protein [4]. This cluster system is predictive of emm pattern type, which, based on differential genotypic features and epidemiological associations, is a proposed marker for GAS tissue tropism [4]. In general, pattern A-C strains are associated with GAS pharyngitis, pattern D strains with skin infections (particularly impetigo), and pattern E strains with both oropharyngeal and skin infections [5].

In response to rising rates of ARF in New Zealand, the Rheumatic Fever Prevention Programme (RFPP) was set up by the New Zealand Ministry of Health. The RFPP is a multidisciplinary strategy aimed at reducing the incidence of ARF by two-thirds, to 1.4 cases per 100,000 population by June 2017 [6]. A large part of this strategy is focused on primary prevention of ARF, through the timely diagnosis and treatment of GAS pharyngitis. One of the models of care within this programme is the provision of school-based sore throat management clinics, with these services targeted at children at highest risk of ARF [6]. In New Zealand the distribution of ARF is unequal and those with the highest risk are Maori and Pacific and live in areas of low socio economic areas in the North Island [2]. The Counties Manukau region (South Auckland, North Island) has the highest incidence of rheumatic fever and the highest number of school-based sore throat clinics.

Recent epidemiological analyses have demonstrated that approximately half of ARF-associated GAS strains in New Zealand fall into pattern D–i.e. strains classically associated with skin infections [7]. Despite this association, and the high incidence of skin infections in New Zealand children [8], there have been few studies in New Zealand describing the circulating GAS emm types associated with superficial skin infections, and no contemporary studies comparing skin and pharyngeal isolates from high-risk children. In addition, it is not known whether there is any difference in GAS emm types circulating amongst children at high risk of ARF, compared to children with minimal risk.

The Coalition to Accelerate New Vaccines Against Streptococcus (CANVAS) is an Australian and New Zealand GAS vaccine development programme that aims to identify suitable GAS vaccine candidates, with the long-term aim of preventing GAS-related disease, particularly ARF/RHD and invasive disease [9]. At present, the most clinically advanced GAS vaccine candidates are those that target the N-terminal region of the M-protein, such as an experimental 30-valent M-protein vaccine [10, 11]. One of the early aims of CANVAS is to define the global epidemiology of circulating GAS strains, in order to better inform vaccine development and putative vaccine coverage.

Accordingly, the aims of this study were to: (i) describe the GAS emm types, emm clusters and patterns from children at high risk of ARF with presumptive GAS pharyngitis or skin disease in a New Zealand setting, (ii) compare the emm types of GAS strains from children at high risk of ARF with those at low risk, and (iii) determine the theoretical coverage of the 30-valent M-protein vaccine candidate against contemporary GAS strains associated with pharyngitis or skin disease.


Setting and population

Auckland is the largest city in New Zealand, with an estimated population of 1.47 million. The population is ethnically diverse, consisting approximately of the following major ethnicities in 2013: 57 % European/Other, 21 % Asian, 12 % Pacific Peoples and 10 % Māori [12]. Labtests Auckland (LTA) is the sole community diagnostic microbiology provider for the entire Auckland region. This includes testing for GAS from: (i) primary care, and (ii) school-based sore throat management programs that are part of the RFPP in Auckland. Between 1st April and 25th June 2015, GAS isolates were collected from throat swabs taken from children aged 5 to 15 years presenting with presumptive pharyngitis in school-based programs, as part of the previously described RFPP [6]. Children enrolled in the sore throat management programmes are at highest risk of developing ARF so tend to be of Māori or Pacific ethnicity and living in areas of high social deprivation. During the same time period, GAS skin isolates were collected from children aged 5 to 15 years who presented to a general practitioner with a presumptive skin infection. These samples were collected from primary care practitioners from the same geographic region (i.e. areas of high social deprivation) as the throat swabs.

In order to provide a comparator population, GAS isolates were collected between 1st April and 30th May 2015 from throat swabs taken from children aged 5 to 15 years in Dunedin, a city in the South Island of New Zealand. These GAS isolates were collected from children presenting to a general practitioner with presumptive pharyngitis. The population in Dunedin is predominantly European (84 %), followed by Māori (7 %), Asian (6 %) and Pacific Peoples (2 %) [12]. The incidence of ARF in children in Dunedin is negligible, with only one case of ARF reported in the last five years.

Basic demographic data were collected about each child who had GAS isolated, including age, sex, ethnicity and socioeconomic status, as determined by the New Zealand deprivation index (NZDep). The NZDep score is an area-based measure of socioeconomic deprivation derived from New Zealand census data, which assigns a decile ranking from 1 to 10, with 10 representing the most deprived neighbourhoods [13].

Bacterial isolates and molecular typing

Throat and skin swabs were plated onto tryptic soy sheep blood agar and incubated in 5 % CO2 overnight at 37 °C. GAS isolates were identified using a MALDI-TOF MS Biotyper (Bruker, Germany) and forwarded to the Invasive Pathogens Laboratory at the Institute of Environmental Science and Research, Wellington, New Zealand for further analysis. Polymerase chain reaction (PCR) and DNA sequencing of the emm gene was performed using previously described methods [3]. Emm clusters, along with emm patterns, were extrapolated from emm typing results [4].

Statistical analysis

Simpson’s index of diversity was used to assess variation in emm types. This index indicates the probability that two emm types randomly selected are of different types–i.e. the higher the index, the greater the diversity of emm types in a particular population [14]. 95 % confidence intervals (CI) for the Simpson’s index were calculated as previously described [15]. Statistical analysis was performed using GraphPad Prism (Version 6), and a p value of 0.05 was considered significant.

Ethical approval

As this study involved de-identified patient data, ethical approval was not deemed necessary by the Health and Disability Ethics Committees, New Zealand.


Population demographics

A total of 452 isolates (246 pharyngeal isolates from Auckland, 104 skin isolates from Auckland and 103 pharyngeal isolates from Dunedin) were collected from 452 children. In Auckland, over 80 % of children in both groups (i.e. those with either a pharyngeal or skin isolate) were either Māori or Pacific, and predominantly resided in areas of high socioeconomic deprivation placing them in the high-risk group for developing ARF (Table 1). In contrast, over 80 % of children with a pharyngeal isolate in Dunedin were European, and resided in areas of low and moderate socioeconomic deprivation placing them in the low-risk group for developing ARF (Table 1).

Table 1 Demographic characteristics of children with group A Streptococcal pharyngeal and skin isolates

emm typing and cluster analysis

Amongst the 452 isolates, a total of 83 emm types were detected, belonging to 20 emm clusters (Additional file 1: Table S1). Amongst Auckland isolates, the Simpson’s index did not differ significantly (on the basis of non-overlapping confidence intervals) between pharyngeal isolates (96.3 % [95 % CI 95.5 to 97.1 %]) and skin isolates (96.3 % [95 % CI 95.3 to 97.3 %]). However, the Simpson’s index was significantly lower in pharyngeal isolates from Dunedin (87.5 % [95 % CI 86.1 to 88.9 %]).

There were notable differences in emm cluster distribution according to specimen site and geographical location, with cluster D4 more common amongst skin isolates (36 % in Auckland skin isolates vs. 15 % in Auckland pharyngeal isolates vs. 3 % in Dunedin pharyngeal isolates), and clusters A-C4 and E4 more common amongst pharyngeal isolates, particularly from Dunedin (Table 2). Strains belonging to pattern D (skin pattern) made up 52 % of isolates recovered from skin, 32 % of pharyngeal isolates in Auckland, 4 % of pharyngeal isolates from Dunedin and 44 % of ARF associated strains reported in a previous study in New Zealand [7] (Fig. 1).

Table 2 emm clusters of group A Streptococcal strains isolated from children in New Zealand, 2015
Fig. 1
figure 1

emm patterns of group A streptococcal strains associated with pharyngitis, skin infections and rheumatic fever in New Zealand. The rheumatic fever isolates (*) were published previously [7]

Theoretical M-protein vaccine coverage

The theoretical coverage of the experimental 30-valent M-protein vaccine was lower for skin-associated isolates than for pharyngeal isolates (Table 3). Amongst Auckland isolates, theoretical coverage increased significantly in both groups when the potential effect of cross-opsonic antibodies was considered, although still only covered approximately half of skin isolates. There was a significant difference in theoretical vaccine coverage between pharyngeal isolates from Auckland and those from Dunedin, with coverage almost twice as high amongst GAS isolates from Dunedin compared to Auckland (Table 3).

Table 3 Theoretical coverage of 30-valent vaccine in relation to Group A Streptococcus-associated clinical syndromes


This study describes the contemporary molecular epidemiology of GAS strains circulating amongst high-risk children in a setting with a high burden of ARF, compared with strains amongst children residing in the same country, with minimal risk of ARF [2]. As expected, emm pattern D, classically associated with skin tropism, predominated in GAS skin isolates in high-risk children accounting for approximately half of all emm types. Somewhat surprisingly, approximately one-third of strains from high-risk children in Auckland with presumptive GAS pharyngitis also belonged to emm pattern D. This is in marked contrast to the distribution observed in pharyngeal isolates from low-risk children in Dunedin, where pattern D strains comprised only 4 % of isolates. Though emm pattern type has been described as an imperfect marker for tissue tropism [5], the high prevalence of emm pattern D strains observed in high-risk children in this study resembles observations in Aboriginal children in Australia, another Indigenous population with high ARF risk. A study in three remote Aboriginal communities found emm pattern D strains accounted for 53 % of pyoderma isolates and 24 % of pharyngeal isolates [16].

Clear differences were also observed in emm cluster types between high-risk and low risk children in this study. The emm clusters observed in pharyngeal isolates from low-risk Dunedin children closely resemble findings from other developed countries. [17]. For example, Shulman et al., recently applied the emm cluster system to pharyngitis isolates collected between 2000–2007 in the United States and found that clusters E4, A-C3 and A-C4 comprised 62.5 % of pharyngeal isolates, with isolates from cluster D4 representing <1 % of isolates [17]. In our study, clusters E4, A-C3 and A-C4 comprised 79 % of pharyngeal isolates from Dunedin, but only 45 % of pharyngeal isolates from Auckland. Isolates from cluster D4 accounted for just 3 % of pharyngeal strains in low-risk Dunedin children in our study but 15 % of pharyngeal strains in high-risk Auckland children and were the most prevalent cluster amongst GAS skin isolates comprising 36 % of strains. The emm-cluster D4 comprises 32 emm-types [4], and was also the most prevalent cluster in a recent analysis of GAS isolates collected in Fiji where it accounted for over 30 % of strains [18].

The high proportion of emm pattern D and cluster D4 amongst GAS skin and pharyngeal isolates in Auckland (an area with the highest rates of ARF in New Zealand) is in keeping with recent work describing a high proportion of emm pattern D and cluster D4 isolates amongst ARF-associated GAS strains in New Zealand [7]. Taken together these observations add further weight to the hypothesis that skin infections may play an important etiological role in the pathogenesis of ARF [7, 19]. Although the biological pathway is unclear, it is possible that colonizing GAS skin isolates may passage into the pharynx, or alternatively antecedent skin infection may ‘prime’ the immune system in an, as yet, uncharacterized manner, and contribute to the autoimmune process characteristic of ARF [20].

Previous studies have shown the theoretical coverage of the 30-valent GAS vaccine is reduced in low-income countries where the diversity of emm-types is highest [21]. In this study, vaccine coverage has been compared between different populations within the same country. The theoretical coverage was low amongst isolates from high-risk Auckland children for which the Simpson’s index for strain diversity is high, although did increase when the putative effect of cross-opsonic antibodies was considered. However approximately 40 % of skin-associated isolates and 28 % of Auckland pharyngeal isolates have not yet been tested for potential cross-opsonic activity [10, 11]. In contrast, the theoretical vaccine coverage for pharyngeal isolates from low-risk children in Dunedin is 93.2 % before taking into account the added potential effect of in vitro cross-opsonization. The high theoretical coverage in this low-risk population likely reflects the similarities in emm-clusters observed with pharyngeal strains from the US [17], as the 30-valent vaccine was designed to include emm-types commonly associated with pharyngitis in North America [10]. These findings demonstrate the importance of broad population sampling when assessing theoretical vaccine coverage.

There were a number of limitations with this study. It was assumed that GAS pharyngeal isolates were from children with pharyngitis, although information on clinical symptoms was not available for each child. Within the school-based programme, children are encouraged to present for assessment when they have a sore throat, though it remains possible that some of these isolates represent GAS colonization with concurrent viral pharyngitis, rather than true, serologically confirmed, GAS pharyngitis [22]. Similarly, clinical information about each child who had a skin swab submitted for testing was not available. However, as these swabs were submitted as part of a primary care consultation, it is likely that the majority of children had clinical symptoms suggestive of a skin infection. Moreover, our sampling period was over a relatively short timeframe; previous work, including a study from New Zealand, has demonstrated temporal variation in the proportional distribution of GAS emm types.


In summary, this study describes a high diversity of emm types and a high proportion of emm-pattern D and cluster D4 isolates amongst children at high risk of ARF in New Zealand. These findings have potential impact on the theoretical coverage of multivalent M-protein vaccines in high-risk populations in New Zealand. In contrast, pharyngeal isolates from children at low risk of ARF within the same country were less diverse, less likely to be emm pattern D, and more likely to be theoretically covered by the 30-valent M protein vaccine. The high proportion of emm pattern D GAS strains amongst skin and pharyngeal isolates from children at high risk of ARF further highlights the potential role of skin infections in the pathogenesis of ARF.


  1. Carapetis JR, Steer AC, Mulholland EK, Weber M. The global burden of group A Streptococcal diseases. Lancet Infect Dis. 2005;5(11):685–94.

    Article  PubMed  Google Scholar 

  2. Jaine R, Baker M, Venugopal K. Epidemiology of acute rheumatic fever in New Zealand 1996–2005. J Paediatr Child Health. 2008;44(10):564–71.

    Article  PubMed  Google Scholar 

  3. Beall B, Facklam R, Thompson T. Sequencing emm-specific PCR products for routine and accurate typing of group A Streptococci. J Clin Microbiol. 1996;34(4):953–8.

    CAS  PubMed  PubMed Central  Google Scholar 

  4. Sanderson-Smith M, De Oliveira DM, Guglielmini J, et al. A systematic and functional classification of Streptococcus pyogenes that serves as a new tool for molecular typing and vaccine development. J Infect Dis. 2014;210(8):1325–38.

    Article  CAS  PubMed  Google Scholar 

  5. Bessen DE, Lizano S. Tissue tropisms in group A streptococcal infections. Future Microbiol. 2010;5(4):623–38.

    Article  PubMed  PubMed Central  Google Scholar 

  6. New Zealand Ministry of Health. Rheumatic fever prevention programme (, last accessed 26th October, 2015.

  7. Williamson DA, Smeesters PR, Steer AC, et al. M-protein analysis of Streptococcus pyogenes isolates associated with acute rheumatic fever in New Zealand. J Clin Microbiol. 2015;53(11):3618–20.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. O'Sullivan CE, Baker MG, Zhang J. Increasing hospitalizations for serious skin infections in New Zealand children, 1990–2007. Epidemiol Infect. 2011;139(11):1794–804.

    Article  PubMed  Google Scholar 

  9. Moreland NJ, Waddington CS, Williamson DA, et al. Working towards a group A Streptococcal vaccine: Report of a collaborative trans-Tasman workshop. Vaccine. 2014;32(30):3713–20.

    Article  PubMed  Google Scholar 

  10. Dale JB, Penfound TA, Chiang EY, Walton WJ. New 30-valent M protein-based vaccine evokes cross-opsonic antibodies against non-vaccine serotypes of group A Streptococci. Vaccine. 2011;29(46):8175–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Dale JB, Penfound TA, Tamboura B, et al. Potential coverage of a multivalent M protein-based group A Streptococcal vaccine. Vaccine. 2013;31(12):1576–81.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Statistics New Zealand. 2013 census of populations and dwellings. Available at:, last accessed 30th November 2015.

  13. Salmond C, Crampton P, Sutton F. NZDep91: A New Zealand index of deprivation. Aust N Z J Public Health. 1998;22(7):835–7.

    Article  CAS  PubMed  Google Scholar 

  14. Simpson E. Measurement of diversity. Nature. 1949;163:688.

    Article  Google Scholar 

  15. Grundmann H, Hori S, Tanner G. Determining confidence intervals when measuring genetic diversity and the discriminatory abilities of typing methods for microorganisms. J Clin Microbiol. 2001;39(11):4190–2.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. McDonald MI, Towers RJ, Fagan P, Carapetis JR, Currie BJ. Molecular typing of Streptococcus pyogenes from remote aboriginal communities where rheumatic fever is common and pyoderma is the predominant streptococcal infection. Epidemiol Infect. 2007;135(8):1398–405.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Shulman ST, Tanz RR, Dale JB, Steer AC, Smeesters PR. Added value of the emm-cluster typing system to analyze Group A Streptococcus epidemiology in high-income settings. Clin Infect Dis. 2014;59(11):1651–2.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Baroux N, D’Ortenzio E, Amedeo N, et al. The emm-cluster typing system for Group A Streptococcus identifies epidemiologic similarities across the pacific region. Clin Infect Dis. 2014;59(7):e84–92.

    Article  CAS  PubMed  Google Scholar 

  19. McDonald MI, Towers RJ, Andrews RM, Benger N, Currie BJ, Carapetis JR. Low rates of streptococcal pharyngitis and high rates of pyoderma in Australian aboriginal communities where acute rheumatic fever is hyperendemic. Clin Infect Dis. 2006;43(6):683–9.

    Article  PubMed  Google Scholar 

  20. Raynes JM, Frost HR, Williamson DA, Young PG, Baker EN, Steemson JD, Loh JM, Proft T, Dunbar PR, Atatoa Carr PE, Bell A, Moreland NJ. Serological evidence of immune priming by Group A streptococci in patients with acute rheumatic fever. Front Microbiol. 2016;7:1119.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Steer AC, Law I, Matatolu L, Beall BW, Carapetis JR. Global emm type distribution of group A Streptococci: Systematic review and implications for vaccine development. Lancet Infect Dis. 2009;9(10):611–6.

    Article  PubMed  Google Scholar 

  22. Institute of Environmental Science and Research. Rheumatic Fever Report, 2014. Available at:, last accessed 31th November 2015.

Download references


Not applicable.


No specific funding to declare.

Availability of data and material

Relevant data generated or analyzed during this study are included in this published article and its supplementary information files. Although all data are de-identified, information which could render participants indirectly identifiable (such as details on age, gender, etc.) are not shared in order to fully protect participants' anonymity.

Authors’ contributions

DW and NJM conceived and designed the study. AU was involved in collation of bacterial isolates and study design. JM and PC performed emm typing. PS, AS, ST, MD and JF provided intellectual input sufficient to warrant authorship. All authors read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interests.

Consent for publication

Not applicable.

Ethics approval and consent to participate

As this study involved de-identified patient data, ethical approval was not deemed necessary by the Health and Disability Ethics Committees, New Zealand. De-identified and encrypted patient data were obtained from the NZ Ministry of Health.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Deborah A Williamson.

Additional file

Additional file 1: Table S1.

Group A streptococcal isolates and emm types included in this study. (XLSX 12 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Williamson, D.A., Smeesters, P.R., Steer, A.C. et al. Comparative M-protein analysis of Streptococcus pyogenes from pharyngitis and skin infections in New Zealand: Implications for vaccine development. BMC Infect Dis 16, 561 (2016).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: