SARS-CoV-2 lineage dynamics in England from September to November 2021: high diversity of Delta sub-lineages and increased transmissibility of AY.4.2

Eales, Oliver; Page, Andrew J.; de Oliveira Martins, Leonardo; Wang, Haowei; Bodinier, Barbara; Haw, David; Jonnerby, Jakob; Atchison, Christina; Ashby, Deborah; Barclay, Wendy; Taylor, Graham; Cooke, Graham; Ward, Helen; Darzi, Ara; Riley, Steven; Chadeau-Hyam, Marc; Donnelly, Christl A.; Elliott, Paul

doi:10.1186/s12879-022-07628-4

Research
Open access
Published: 27 July 2022

SARS-CoV-2 lineage dynamics in England from September to November 2021: high diversity of Delta sub-lineages and increased transmissibility of AY.4.2

Oliver Eales^1,2,
Andrew J. Page³,
Leonardo de Oliveira Martins³,
Haowei Wang^1,2,
Barbara Bodinier^1,4,
David Haw^1,2,
Jakob Jonnerby^1,2,
Christina Atchison¹,
The COVID-19 Genomics UK (COG-UK) Consortium,
Deborah Ashby¹,
Wendy Barclay⁵,
Graham Taylor⁵,
Graham Cooke^5,6,7,
Helen Ward^1,6,7,
Ara Darzi^6,7,8,
Steven Riley^1,2,
Marc Chadeau-Hyam^1,4,
Christl A. Donnelly^1,2,9 &
…
Paul Elliott^{1,2,6,7,10,11}

BMC Infectious Diseases volume 22, Article number: 647 (2022) Cite this article

6613 Accesses
10 Citations
2 Altmetric
Metrics details

Abstract

Background

Since the emergence of SARS-CoV-2, evolutionary pressure has driven large increases in the transmissibility of the virus. However, with increasing levels of immunity through vaccination and natural infection the evolutionary pressure will switch towards immune escape. Genomic surveillance in regions of high immunity is crucial in detecting emerging variants that can more successfully navigate the immune landscape.

Methods

We present phylogenetic relationships and lineage dynamics within England (a country with high levels of immunity), as inferred from a random community sample of individuals who provided a self-administered throat and nose swab for rt-PCR testing as part of the REal-time Assessment of Community Transmission-1 (REACT-1) study. During round 14 (9 September–27 September 2021) and 15 (19 October–5 November 2021) lineages were determined for 1322 positive individuals, with 27.1% of those which reported their symptom status reporting no symptoms in the previous month.

Results

We identified 44 unique lineages, all of which were Delta or Delta sub-lineages, and found a reduction in their mutation rate over the study period. The proportion of the Delta sub-lineage AY.4.2 was increasing, with a reproduction number 15% (95% CI 8–23%) greater than the most prevalent lineage, AY.4. Further, AY.4.2 was less associated with the most predictive COVID-19 symptoms (p = 0.029) and had a reduced mutation rate (p = 0.050). Both AY.4.2 and AY.4 were found to be geographically clustered in September but this was no longer the case by late October/early November, with only the lineage AY.6 exhibiting clustering towards the South of England.

Conclusions

As SARS-CoV-2 moves towards endemicity and new variants emerge, genomic data obtained from random community samples can augment routine surveillance data without the potential biases introduced due to higher sampling rates of symptomatic individuals.

Peer Review reports

Background

Since its first documented case in India in November 2020 [1] the Delta variant of SARS-CoV-2 has spread rapidly across the world and by 16 November 2021 was responsible for 99.7% of all SARS-CoV-2 infections [2]. Its rapid rise to dominance has been attributed to greater levels of transmissibility [3, 4] than previously circulating variants with the reproduction number estimated to be over two-fold higher [5], as well as possible reduced vaccine effectiveness against infection [6]. Since its global dissemination, continued adaptive evolution has led to a diverse set of Delta sub-lineages, with distinct combinations of mutations (especially on the spike protein) [7, 8].

Since July 2021 the lineage AY.4.2 (Pango nomenclature [9]), a descendant of the original Delta variant (henceforth B.1.617.2) has increased in proportion in routine surveillance data for England from 8.5% the week beginning 4 October [10] to 14.7% the week beginning 31 October [11]. AY.4.2 was declared a variant under investigation (VUI) by the UK Health Security Agency on 20 October 2021 [12]. Globally AY.4.2 had been detected in 43 countries by 22 November 2021 [13] but had only been estimated at a cumulative proportion greater than 1% in Poland [14]. AY.4.2 has two defining mutations in the spike protein, Y145H and A222V, but is otherwise similar to AY.4, a lineage that is far more widespread. AY.4 was the most prevalent lineage in England on 29 October 2021 [11] and had been detected in 87 countries by 22 November 2021 [15], in some of which it had already been reported as the most prevalent lineage (by 23 November 2021) [16, 17].

England has recorded high levels of SARS-CoV-2 infection over the course of the pandemic [4, 18] and vaccinated, as part of its mass vaccination campaign (Pfizer/BioNTec, Oxford/AstraZeneca and Moderna), a large proportion of its population (80.3% of over 12 year olds double vaccinated by 27 November 2021), with further booster jabs (Pfizer/BioNTec or Moderna) being rolled out in adults (30.5% of over 12 year olds having received a booster dose by 27 November 2021) [18]. This has led to high levels of antibodies against coronavirus with 92.8% of adults in England estimated to test positive for antibodies (IgG antibodies against the SARS-CoV-2 trimeric spike protein) in the week beginning 1 November 2021 [19]. With high vaccination coverage in the population it is likely that there is substantial selective pressure on SARS-CoV-2 towards immune escape and vaccine breakthrough infections. Genomic surveillance in highly immunised regions is crucial to detect emerging variants that can more successfully navigate the immune landscape that has been created by both natural infection and vaccination.

The REal-time Assessment of Community Transmission-1 (REACT-1) study is a series of cross-sectional surveys of the population of England that seeks to estimate the prevalence of SARS-CoV-2 on a monthly basis [4, 20], with genomic sequencing performed on all positive samples with a low enough cycle threshold (Ct) value (a proxy for viral load) and high enough volume. Due to its sampling procedure it does not suffer from the biases of routine surveillance that can be heavily biased towards symptomatic individuals [21]; symptom status can be highly dependent on levels of immunity [22]. Here we present the genomic analysis of the (N = 2163) positive samples for round 14 and round 15 which were collected from 9 to 27 September 2021 and 19 October to 5 November 2021 respectively.

Material and methods

Viral genome sequencing

The methods of the REACT-1 study have been described elsewhere [23]. REACT-1 is a repeat cross-sectional study whereby in each round a random subset of the English population (selected from the National Health Service general practitioners' patient list) is invited to obtain a self-administered swab test (parent/guardian administered for 5–12 year olds). These tests are then sent to a laboratory to undergo rt-PCR testing for the presence of SARS-CoV-2. A round of the study covers a ~ 2- to 3-week period and has occurred approximately monthly since May 2020 with between 100,000 and 185,000 individuals taking part in each round. Since round 8 in January 2021 all positive samples with a low enough N-gene Ct value (the threshold was 34 in rounds 14 and 15 presented here) and sufficient volume have been sent for genome sequencing. Amplification of the extracted RNA was performed using the ARTIC protocol [24] (version 4 primers), with sequence libraries prepared using CoronaHiT [25]; sequencing was performed on the Illumina NextSeq 500 platform. Raw sequences were analysed using the bioinformatic pipeline [26] and then uploaded to CLIMB [27]. Lineages were assigned using PangoLEARN [28] (database version 2021-11-04), a machine learning-based assignment algorithm, using Pango nomenclature [9]. For some sequences of low overall quality, a lineage designation was not possible and so they were not included in the analyses. Samples with less than 50% of bases covered were further excluded from the analysis. Of the 1322 lineages determined during rounds 14 and 15, 1160 individuals provided information on their symptoms in the previous month with 314 (27.1%) reporting no symptoms.

Phylogeographic model

For all sequences from REACT-1 rounds 11 (15 April–3 May 2021), 12 (20 May–7 June 2021), 13 (24 June–12 July 2021), 14 (9 September–27 September 2021) and 15 (19 October–5 November 2021), in which the lineage designated was Delta or a Delta sub-lineage, a maximum likelihood phylogenetic tree was constructed using a HKY model implemented in IQ-TREE [29]. An uncorrelated relaxed clock model implemented in TreeTime [30], assuming a normal distribution of rates with mean 0.0008 substitutions per site per year and a single coalescent rate for the time scale, was then fit to the maximum likelihood phylogenetic tree producing a time-resolved phylogenetic tree. The mutation rates at the tree’s tips were extracted from the model and a Gaussian regression model was fit to the samples obtained during round 14 and 15 for the 8 most prevalent lineages (AY.39, AY.4, AY.4.2, AY.43, AY.44, AY.5, AY.6, B.1.617.2) including lineage and round as covariates. A mugration model (implemented in TreeTime [30]) was run on the time-resolved phylogenetic tree, treating the region in which each sample was isolated as a discrete state. This allowed estimates of the migration rates between regions to be calculated (assumed to be symmetric).

Statistical analyses

The 95% confidence intervals for lineage proportions were calculated using the Wilson method [31] assuming a Binomial distribution. This method is preferred when the number of positives is low but is still valid when this is not the case [32]. Higher accuracy in confidence interval estimates for when the number of positives is low was chosen so that lower bounds on case numbers for rarer lineages were as accurate as possible.

Estimates of the true number of swab-positive infections in England during round 14 and round 15 for lineages in which only one sample was detected in a round were calculated by multiplying the estimated proportion of the lineage for each round, the weighted prevalence estimated for each round [33], and the population size of England [34]. The 95% confidence intervals were estimated by simulating the entire distribution for proportion and weighted prevalence and multiplying the two together. The distribution of weighted prevalence was estimated by randomly sampling from a normal distribution with mean value the central estimate, and standard deviation the width of the 95% confidence interval divided by 3.92 (2 times 1.96). The distribution of the lineage proportion was estimated by calculating the Wilson confidence intervals at different levels (0.00001 to 0.99999 in intervals of 0.00001).

The significance of differences in proportions of particular lineages by age group and region was calculated using Fisher’s exact test with a binary outcome variable (lineage of interest or not). Differences with a p-value less than 0.05 were considered statistically significant. Analysis was only completed for a lineage in a round if there were more than 90 samples (AY.4 round 14, AY.4 round 15, AY.4.2 round 15, B.1.617.2 round 15), so that there were, on average, more than 10 samples per parameter (9 regions in England).

Shannon diversity was calculated using all data for round 14 and round 15, and for each region for round 14 and round 15 [35]. The significance of any differences in Shannon diversity between round 14 and 15 (for all data) and between regions in each round was assessed using the Hutcheson T-test [36] and its associated p-value.

The relative growth rate of a lineage compared to all other lineages was estimated using a Bayesian logistic regression model fit to the binary outcome variable (lineage of interest or not) over time. The two model parameters (intercept and gradient) were given uninformative constant prior distributions. The probability that the growth rate was greater than zero was calculated from the model's posterior. Lineages were deemed to be different to zero if the posterior probability that the growth rate was greater than zero was greater than 0.975 or less than 0.025, similar to a p-value threshold of 0.05.

The growth rates of AY.4.2 and AY.4 infected individuals were estimated by fitting an exponential model to the daily weighted prevalence using all REACT-1 data (all negatives and all AY.4/AY.4.2 associated positives) for rounds 14 and 15 assuming a Binomial likelihood. Weightings for individual REACT-1 samples were calculated using rim weighting [37] by: sex, deciles of the IMD, LTLA counts and ethnic group. Growth rates were then converted to estimates of the reproduction number R assuming a gamma-distributed generation time with the shape parameter, n = 2.29, and rate parameter, b = 0.36 [38] through the equation \((1 + \frac{r}{b}{)}^{n}\) [39]. The multiplicative R advantage of AY.4.2 over AY.4 was estimated using the entire posterior distribution of \({R}_{AY.4.2}/{R}_{AY.4}\) with the median and 95% credible interval reported.

For each lineage with more than 1 sample in a round the presence of clustering was assessed. The pairwise distance matrix between all n samples that were designated to a specific lineage was calculated and from this a mean pairwise distance was calculated for the lineage. Next, 10,000 random combinations of n positive individuals (n positive individuals chosen each time without replacement), for which any lineage was determined, were selected and for each combination the distance matrix and mean distance was calculated. The proportion of the 10,000 estimated mean distances below the lineage-specific mean distance was then calculated. Clustering was deemed to be significant if this proportion was less than 0.05.

For the 8 most prevalent lineages across rounds 14 and 15 Gaussian regression was performed to estimate the mean N- and E-gene Ct values for each lineage and p-values used to assess the significance of any difference to the reference lineage (AY.4). Models were run on all data (rounds 14 and 15 combined) and then run on data from each individual round as a sensitivity analysis.

The proportion of individuals reporting any symptoms in the month prior to swabbing and any of the most predictive COVID-19 symptoms in the month prior to swabbing was calculated for the 8 most prevalent lineages across rounds 14 and 15. P-values were estimated for each lineage relative to AY.4 by performing logistic regression with the symptom status as a binary variable (any symptoms vs no symptoms, and separately most predictive COVID-19 symptoms vs none of the most predictive COVID-19 symptoms). The sensitivity of the results that AY.4.2 is less likely to exhibit the most predictive COVID-19 symptoms, relative to AY.4, was assessed by fitting further logistic regression models including age, round of study and N-gene Ct value as covariates (E-gene was also investigated but was no different to using N-gene and so this was not included).

Results

Lineage diversity.

In round 14 the lineage was determined for 481 of 764 positive samples. All lineages were Delta or a Delta sub-lineage with the four most prevalent lineages being AY.4 at 65.1% (60.7%, 69.2%, n = 313), AY.43 at 6.0% (4.2%, 8.5%, n = 29), B.1.617.2 (original Delta variant) at 5.2% (3.6%, 7.6%, n = 25) and AY.4.2 at 4.6% (3.0%, 6.8%, n = 22) (Fig. 1-A, Additional file 2: Table S1). In round 15 the lineage was determined for 841 of 1399 positive samples. Again all samples were Delta or a Delta sub-lineage with the most prevalent lineages again being AY.4 at 57.6% (54.2%, 60.9%, n = 484), B.1.617.2 at 12.8% (10.8%, 15.3%, n = 108), AY.4.2 at 11.8% (9.8%, 14.1%, n = 99) and AY.43 at 4.8% (3.5%, 6.4%, n = 40). The next four most prevalent lineages over both rounds combined were AY.5, AY.6, AY.39, and AY.44. However, even a single detection of a lineage corresponded nationally to an average of 971 (95% CI [171, 5463]) individuals that would test swab-positive on any given day during round 14 and 1051 (95% CI [185, 5928]) individuals that would test swab-positive on any given day during round 15. During rounds 14 and 15 there were 33 and 31 unique lineages detected, respectively with 44 unique lineages detected overall. There was no apparent difference in genetic diversity between the two rounds as estimated by the Shannon diversity (p = 0.831) (Additional file 2: Table S2).

Distribution by region and age

During round 15 the proportion of B.1.617.2 was found to be highest in London at 22.1% (14.9%, 31.4%), being greater than the proportion in South East, East of England and Yorkshire and The Humber (Fig. 1B, Additional file 2: Table S3). Conversely, in round 14 and 15 the proportion of AY.4 was lowest in London at 48.1% (35.4%, 61.1%) and 44.2% (34.6%, 54.2%) respectively and was found to be higher in North West, West Midlands and Yorkshire and The Humber during both rounds (Fig. 1C, Additional file 2: Table S3). This reduced proportion of the nationally most prevalent lineage (AY.4) in London coincided with a higher level of genetic diversity in London. The Shannon diversity was highest in London during both rounds at 1.814 in round 14 and 1.809 in round 15 (p < 0.001 and p = 0.002 respectively, reference = West Midlands, Additional file 2: Table S2). Higher levels of genetic diversity were also found during both rounds in the South East and South West, relative to the West Midlands (which showed the lowest levels of genetic diversity in round 14 and the second lowest in round 15). There were no regional differences in the proportion of AY.4.2 during round 15 (Fig. 1D, Additional file 2: Table S3). Regional differences during round 14 and regional differences for other lineages could not be investigated due to small sample sizes but numbers are provided in Additional file 2: Table S4.

Sub-regional analysis was performed in order to investigate the presence of clustering in each round for each lineage (see Methods). Despite being highly geographically dispersed (Fig. 2) clustering was detected in round 14 for AY.4 (p = 0.037) and AY.4.2 (p = 0.029) (Additional file 2: Table S5). However, during round 15 clustering was no longer evident for both AY.4 (p = 0.706) and AY.4.2 (p = 0.067). The only lineage for which clustering was detected in round 15 was AY.6 (p = 0.003) which was found mainly in London and towards the South coast of England.

During round 15 the proportion of B.1.617.2 was higher in individuals ages 25–34 years old at 24.2% (12.8%, 41.0%) relative to those aged 35–44 years old at 8.0% (4.1%, 15.0%) (p = 0.026) (Additional file 2: Table S6). The proportion of AY.4 was found to be lower in 5–12 year olds at 52.1% (44.6%, 59.5%) relative to 35–44 year olds in which the proportion of AY.4 was 65.0% (55.3%, 73.6%) (p = 0.042) in round 15, while it was not in round 14.There were no differences between age groups in the proportion of AY.4.2 during round 15. Differences between age groups during round 14 for AY.4.2 and other lineages could not be investigated due to small sample sizes but numbers are provided in Additional file 2: Table S7.

Detection of increasing sub-lineages

Logistic regression models were fitted to the proportion of each lineage detected in either round 14 or 15, allowing daily growth rates in proportion to be estimated (Fig. 3, Additional file 2: Table S8). Of the 44 unique lineages detected, 6 were estimated to have growth rates different to zero. AY.4, AY.39, AY.98.1 and AY.111 were decreasing in proportion, whereas AY.4.2 and B.1.617.2 were increasing in proportion. The decrease in proportion of AY.4 corresponded to a daily growth rate of − 0.009 (− 0.015, − 0.003). The increase in proportions of B.1.617.2 and AY.4.2 corresponded to growth rates of 0.029 (0.017, 0.041) and 0.028 (0.016, 0.041) respectively.

Comparing estimates of the reproduction number R from round 14 to round 15 for AY.4 and AY.4.2 (see Methods) we estimate a multiplicative R advantage of 1.15 (1.08, 1.23), assuming no change in the generation time distribution.

Differences in cycle threshold values

There were quantitative differences between lineages in the N- and E-gene Ct values. The mean N- and E-gene Ct values were lowest for AY.6 though not materially lower than the values obtained for AY.4 (Fig. 4, Additional file 2: Table S9). Mean N-gene Ct value was 22.14 (20.30, 23.99) for AY.6 compared to 23.98 (23.68, 24.28) for AY.4 (p = 0.054). Mean E-gene Ct value was 20.74 (18.90, 22.59) for AY.6 compared to 22.46 (22.16, 22.76) for AY.4 (p = 0.071). Mean N- and E-gene Ct values were found to be comparable to AY.4 for both AY.4.2 and AY.5. Relative to AY.4, mean N- and E-gene Ct values for AY.43, AY.44, AY.39 and B.1.617.2 were all higher.

Differences in symptomatology

The proportion of individuals exhibiting the most predictive COVID-19 symptoms (loss or change of sense of taste, loss or change of sense of smell, new persistent cough, fever) in the month prior to swabbing was lower (p = 0.029) in those infected with AY.4.2 at 42.1% (33.1%, 51.5%) relative to those infected with AY.4 at 53.4% (49.7%, 57.1%) (Fig. 5A, Additional file 2: Table S10). This difference was not explained by patterns in age, round of the study or N-gene Ct value (Fig. 5B, Additional file 2: Table S11).

In addition, 68.6% (59.8%, 76.3%) of those infected with AY.4.2 reported any symptoms in the month prior to swabbing compared to 75.4% (72.2%, 78.3%) for those infected with AY.4 (p = 0.119). There were no differences evident in symptom reporting between AY.4 infected individuals and the other 6 most prevalent lineages (B.1.617.2, AY.5, AY.6, AY.43, AY.44 and AY.39).

Phylogeographic analysis

A relaxed molecular clock model was fit to the data and used to estimate a time-resolved phylogenetic tree (Fig. 6). AY.4.2 was found to populate two closely related clades that emerged in June/July 2021. AY.43, AY.5 and AY.6 were also observed to have distinct clade groupings having emerged around June/July 2021 as well. The mutation rates inferred at the tree’s tips showed a large degree of variation in all of the 8 most prevalent lineages. The mean mutation rate for AY.4.2 was found to be 0.57 (< 0.01, 1.10)\(\times 1{0}^{-4}\) lower than the mean mutation rate of AY.4 (p = 0.050) (Fig. 6, Additional file 2: Table S2). The mean mutation rate inferred for samples collected in round 15 was found to be 1.00 (0.70, 1.40)\(\times 1{0}^{-4}\) lower than the mean mutation rate for samples collected in round 14 (p < 0.001).

A mugration model was run on the time-resolved phylogenetic tree to estimate the relative virus migration rates between regions, a measure of inter-region transmission (Additional file 2: Table S13). Overall levels of inter-region transmission were lowest for the North East during round 14 and 15. The highest overall level of inter-region transmission was observed for the North West during round 14 and 15, but looking at individual rounds there were higher levels for Yorkshire and The Humber in round 14 and for the South East in round 15. High rates of transmission during round 14 and 15 were found between the North West and Yorkshire and The Humber, the West Midlands and the South East, and also between the South East and London.

Discussion

The proportion of AY.4.2 was found to be increasing between 9 September and 5 November 2021, as also reported in the routine data surveillance for England [11]. In round 15, AY.4.2 represented 11.8% of infections in line with other estimates [11]. This increase in proportion corresponded to a 15% increase in transmission advantage although this assumes the generation time distribution has remained constant; a decrease of the generation time distribution for AY.4.2 would also explain the increased growth but we are unable to test for this with prevalence data. In the past, the A222V mutation, associated with AY.4.2, increased in frequency but this was eventually deemed to be due to a founder effect and not a transmission advantage [40, 41]. Given the high levels of geographic dispersion (though with some clustering) during rounds 14 and 15 it is highly unlikely that a founder effect can explain the current growth, though we can not rule out a similar effect due to higher proportions of AY.4.2 in school-aged children (prevalence increased to a greater extent in school-aged children than in adults from July to September 2021 [4, 42]). However, as the proportion AY.4.2 was approximately constant by age in round 15 this growth advantage would not be detected into the future if this was the case.

Observed distributions of N- and E-gene Ct values were similar in AY.4.2 and AY.4 and so it is unlikely that the transmission advantage observed can be attributed to a higher viral load (a Ct 1 unit lower corresponds to an approximate twofold increase in viral load [43]). However, a reduced proportion of AY.4.2 infected individuals reporting symptoms could explain the increased transmissibility in multiple ways. Higher levels of asymptomatic infection could lead to greater levels of asymptomatic transmission. Further current testing procedures and government isolation advice in England heavily focus on the most predictive COVID-19 symptoms, which are reported less often by AY.4.2 infected individuals compared with AY.4. Thus, symptom-based policies could introduce an advantage for AY.4.2 over AY.4. Finally, the reduced level of symptom reporting could be indicative of greater levels of re-infection if AY.4.2 were more successful at evading the immune response. However, studies have found that vaccines are no less effective against AY.4.2 than other Delta sub-lineages [11] and vaccine-induced antibody neutralisation titres for AY.4.2 are similar to those for AY.4 and B.1.617.2 [44]. However, any possible evasion of the immune response caused by natural infections has yet to be investigated and the numbers reporting previous infection is too small and the proportion vaccinated too large in this REACT-1 dataset to allow a meaningful comparison (715 of 817 [87.5%] individuals aged 18 and over reported having had two vaccine doses). We found a moderately reduced mutation rate of AY.4.2 relative to AY.4 which may also have introduced a fitness advantage due to a smaller number of deleterious mutations [45, 46].

Other lineages

Though we have focused on AY.4.2 we have detected a diverse set of Delta sub-lineages, with even a single detection corresponding to approximately 1000 swab-positive infections in the community at one time during the study period. The short time over which AY.4.2 went from being an undeclared lineage to a variant under investigation shows how crucial it is to have careful surveillance of all lineages irrespective of frequency. For 38 of the 44 detected lineages, it was unable to be determined whether the proportion was increasing or decreasing.

Between rounds 14 and 15 a reduction in the mean mutation rate of the virus was detected suggesting a reduction in the rate of evolution. However, despite this slowdown evolution is still occurring and we observed an increase in the proportion of B.1.617.2, an indicator that the number of undeclared B.1.617.2 sub-lineages was increasing, suggesting even further diversity of Delta sub-lineages that have yet to be given a unique lineage designation. Further, though we capture the dynamics within England, SARS-CoV-2 is a global problem and new variants of concern can arise anywhere in the world and then spread through international travel. Higher proportions of B.1.617.2 were detected in London as well as higher levels of diversity; this likely reflects the role London continues to play in the introduction of international variants [47]. Within England, the North West region played a major role in the dissemination of the virus, having the greatest inferred rate of inter-region transmission.

Analysis of N- and E-gene Ct values found decreased levels in AY.4 and AY.4.2, which is unsurprising given both have successfully disseminated across the country, but AY.5 and AY.6 were also found to have similarly low Ct values suggesting similar viral loads; the mean N- and E-gene Ct value appeared slightly lower for AY.6 compared to AY.4. Clustering was also detected in round 15 for AY.6; careful consideration of AY.6 should be given in the future in case the current lack of growth so far reported [11] has only been due to its geographic isolation.

Limitations

We have presented the inferred dynamics between Delta sub-lineages in England between 9 September and 5 November 2021. Our sample's main strength over those obtained from routine surveillance is the random nature of the testing program leading to a relatively unbiased set of positive samples. However, as the sample sizes we obtain are relatively small compared with routine national surveillance our estimates have lower precision. Lineages were only successfully determined for ~ 61% of positive samples, with the ability to determine a lineage heavily influenced by a sample’s Ct value; this has potentially led to biases with lineages with lower Ct values more heavily represented in the dataset. Detecting distinct sub-lineages is a high-dimensional problem, with often many common mutations being shared between distinct lineages with only a small number of distinguishing mutations. This is exacerbated when all the lineages are highly related, as in the current nature of the pandemic in England where all samples are descendants of Delta (B.1.617.2), and can lead to incorrect designations [48]. Further, only sub-lineages that have been defined are able to be assigned to a sample. During the emergence of a new sub-lineage there is a phase of ambiguity when numbers are small and it is unclear if the mutations present warrant the declaration of a new sub-lineage. This can be seen in the detection of AY.4.2 and AY.43; both lineages had been circulating for months by October 2021 [11] but were not yet declared sub-lineages by pangoLEARN [28] in early October 2021, and so did not appear in the publicly available technical briefings [49]. The analysis of mutation rates using Gaussian regression may also have included biases as individual measurements of mutation rates would not have independent and identically distributed normal errors, a key assumption of these linear models.

Conclusions

Since the beginning of the pandemic, selective pressure has led to rapid evolution in the spike protein [50] driving leaps in transmissibility [5]. However, as a greater proportion of the population acquires immunity through either infection or vaccination there will be a shift in evolutionary pressure towards immune escape. Even in England where there are high levels of vaccination and past infection, new variants such as AY.4.2 have emerged with advantages over previous strains. With the continued emergence of variants able to evade population immunity and undergoing transmission, SARS-CoV-2 is highly unlikely to ever undergo local extinction and is likely moving towards a state of endemicity. At the point of endemicity it is probable that adaptive evolution would more closely resemble the continual antigenic drift observed in influenza H3N2 [51, 52]. As the evolutionary phase of SARS-CoV-2 progresses towards endemicity, continued surveillance is paramount in not only detecting increased levels of transmissibility for specific lineages, but in also better characterising the mechanism behind such changes and informing policy around testing (including case definitions). Representative community studies such as REACT-1 can be useful in measuring the relative growth of lineages and in characterising differences in viral loads, symptomatology and geographic distribution.

Availability of data and materials

Access to REACT-1 data is restricted due to ethical and security considerations. Summary statistics and descriptive tables from the current REACT-1 study are available in the Additional file 2. Additional summary statistics and results from the REACT-1 programme are also available at https://www.imperial.ac.uk/medicine/research-and-impact/groups/react-study/real-time-assessment-of-community-transmission-findings/ and https://github.com/mrc-ide/reactidd/tree/master/inst/extdata REACT-1 Study Materials are available for each round at https://www.imperial.ac.uk/medicine/research-and-impact/groups/react-study/react-1-study-materials/. Sequence read data are available without restriction from the European Nucleotide Archive at https://www.ebi.ac.uk/ena/browser/view/PRJEB37886, and consensus genome sequences are available from the Global initiative on sharing all influenza data at https://www.gisaid.org. Accession numbers are provided in the Additional file 1.

References

Lineage B.1.617.2 Pangolin report. https://cov-lineages.org/global_report_B.1.617.2.html. Accessed 26 Nov 2021.
World Health Organisation. COVID-19 Weekly Epidemiological Update - Edition 66. 2021. https://www.who.int/publications/m/item/weekly-epidemiological-update-on-covid-19---16-november-2021. Accessed 23 Nov 2021.
PHE Genomics Cell, PHE Outbreak Surveillance Team, PHE Epidemiology Cell, PHE Contact Tracing Data Team, PHE Health, Protection Data Science Team, PHE Joint Modelling Team, NHS Test and Trace Joint Biosecurity Centre, Public Health Scotland and EAVE group, Contributions from the Variant Technical Group Members. SARS-CoV-2 variants of concern and variants under investigation in England - Technical briefing 15, 11 June 2021. https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/993879/Variants_of_Concern_VOC_Technical_Briefing_15.pdf
Elliott P, Haw D, Wang H, Eales O, Walters CE, Ainslie KEC, et al. Exponential growth, high prevalence of SARS-CoV-2, and vaccine effectiveness associated with the Delta variant. Science. 2021;374:eabl9551.
Article CAS Google Scholar
Obermeyer F, Schaffner SF, Jankowiak M, Barkas N, Pyle JD, Park DJ, et al. Analysis of 2.1 million SARS-CoV-2 genomes identifies mutations associated with transmissibility. https://doi.org/10.1101/2021.09.07.21263228
Bernal JL, Andrews N, Gower C, Gallagher E, Simmons R, Thelwall S, et al. Effectiveness of COVID-19 vaccines against the B.1.617.2 variant. medRxiv. 2021; 2021.05.22.21257658.
Baj A, Novazzi F, Drago Ferrante F, Genoni A, Tettamanzi E, Catanoso G, et al. Spike protein evolution in the SARS-CoV-2 Delta variant of concern: a case series from Northern Lombardy. Emerg Microbes Infect. 2021;10:2010–5.
Article CAS Google Scholar
Kistler KE, Huddleston J, Bedford T. Rapid and parallel adaptive mutations in spike S1 drive clade success in SARS-CoV-2. BioRxiv. 2021. https://doi.org/10.1101/2021.09.11.459844.
Article Google Scholar
Rambaut A, Holmes EC, O’Toole Á, Hill V, McCrone JT, Ruis C, et al. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat Microbiol. 2020;5:1403–7.
Article CAS Google Scholar
UKHSA Genomics Cell UKHSA Outbreak Surveillance Team UKHSA Epidemiology Cell UKHSA Contact Tracing Data Team UKHSA International Cell UKHSA Environmental Monitoring for Health Protection Team. SARS-CoV-2 variants of concern and variants under investigation in England - Technical briefing 27, 29 October 2021. https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/1029715/technical-briefing-27.pdf
UKHSA Genomics Cell UKHSA Outbreak Surveillance Team UKHSA Epidemiology Cell UKHSA Contact Tracing Data Team UKHSA International Cell UKHSA Environmental Monitoring for Health Protection Team. SARS-CoV-2 variants of concern and variants under investigation in England - Technical briefing 28, 12 November 2021. https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/1033101/Technical_Briefing_28_12_Nov_2021.pdf
UKHSA Genomics Cell UKHSA Outbreak Surveillance Team UKHSA Epidemiology Cell UKHSA Contact Tracing Data Team UKHSA International Cell UKHSA Environmental Monitoring for Health Protection Team. SARS-CoV-2 variants of concern and variants under investigation in England - Technical briefing 26, 22 October 2021. https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/1028113/Technical_Briefing_26.pdf
Lineage AY.4.2 Pangolin report. https://cov-lineages.org/lineage.html?lineage=AY.4.2. Accessed 23 Nov 2021.
Alaa Abdel Latif, Julia L. Mullen, Manar Alkuzweny, Ginger Tsueng, Marco Cano, Emily Haag, Jerry Zhou, Mark Zeller, Emory Hufbauer, Nate Matteson, Chunlei Wu, Kristian G. Andersen, Andrew I. Su, Karthik Gangavarapu, Laura D. Hughes, and the Center for Viral Systems Biology. AY.4 lineage report, outbreak.info. https://outbreak.info/situation-reports?pango=AY.4.2. Accessed 23 Nov 2021.
Lineage AY.4 Pangolin report. https://cov-lineages.org/lineage.html?lineage=AY.4. Accessed 23 Nov 2021.
Umair M, Ikram A, Rehman Z, Haider A, Badar N, Ammar M, et al. Genomic diversity of SARS-CoV-2 in Pakistan during fourth wave of pandemic. BioRxiv. 2021. https://doi.org/10.1101/2021.09.30.21264343.
Article Google Scholar
Danish Covid-19 Genome Consortium. Genomic overview of SARS-CoV-2 in Denmark, 19 November 2021. https://www.covid19genomics.dk/statistics. Accessed 23 Nov 2021.
Official UK Coronavirus Dashboard. https://coronavirus.data.gov.uk/. Accessed 17 May 2021.
Latest insights team. Coronavirus (COVID-19) latest insights - Office for National Statistics. Office for National Statistics; 23 Nov 2021. https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/conditionsanddiseases/articles/coronaviruscovid19latestinsights/antibodies. Accessed 25 Nov 2021.
Riley S, Ainslie KEC, Eales O, Walters CE, Wang H, Atchison C, et al. Resurgence of SARS-CoV-2: detection by community viral surveillance. Science. 2021. https://doi.org/10.1126/science.abf0874.
Article PubMed PubMed Central Google Scholar
Ricoca Peixoto V, Nunes C, Abrantes A. Epidemic surveillance of Covid-19: considering uncertainty and under-ascertainment. Portuguese J Public Health. 2020;38:23–9.
Article Google Scholar
Feng S, Phillips DJ, White T, Sayal H, Aley PK, Bibi S, et al. Correlates of protection against symptomatic and asymptomatic SARS-CoV-2 infection. Nat Med. 2021;27:2032–40.
Article CAS Google Scholar
Riley S, Atchison C, Ashby D, Donnelly CA, Barclay W, Cooke G, et al. REal-time Assessment of Community Transmission (REACT) of SARS-CoV-2 virus: study protocol. Wellcome Open Res. 2020;5:200.
Article Google Scholar
Quick J. nCoV-2019 sequencing protocol v3 (LoCost). 2020. https://www.protocols.io/view/ncov-2019-sequencing-protocol-v3-locost-bh42j8ye. Accessed 4 May 2021.
Baker DJ, Aydin A, Le-Viet T, Kay GL, Rudder S, de Oliveira ML, et al. CoronaHiT: high-throughput sequencing of SARS-CoV-2 genomes. Genome Med. 2021;13:21.
Article CAS Google Scholar
A Nextflow pipeline for running the ARTIC network’s field bioinformatics tools. Github. https://github.com/connor-lab/ncov2019-artic-nf
Connor TR, Loman NJ, Thompson S, Smith A, Southgate J, Poplawski R, et al. CLIMB (the Cloud Infrastructure for Microbial Bioinformatics): an online resource for the medical microbiology community. Microb Genom. 2016;2: e000086.
PubMed PubMed Central Google Scholar
Phylogenetic Assignment of Named Global Outbreak LINeages (PANGOLIN). Github; https://github.com/cov-lineages/pangolin
Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015;32:268–74.
Article CAS Google Scholar
Sagulenko P, Puller V, Neher RA. TreeTime: maximum-likelihood phylodynamic analysis. Virus Evol. 2018;4:vex042.
Article Google Scholar
Wilson EB. Probable inference, the law of succession, and statistical inference. J Am Stat Assoc. 1927;22:209–12.
Article Google Scholar
Brown LD, Tony Cai T, DasGupta A. Interval estimation for a binomial proportion. SSO Schweiz Monatsschr Zahnheilkd. 2001;16:101–33.
Google Scholar
Chadeau-Hyam M, Eales O, Bodinier B, Wang H, Haw D, Whitaker M, et al. REACT-1 round 15 final report: Increased breakthrough SARS-CoV-2 infections among adults who had received two doses of vaccine, but booster doses and first doses in children are providing important protection. 2021. http://spiral.imperial.ac.uk/handle/10044/1/92501. Accessed 26 Nov 2021.
Park N. Population estimates for the UK, England and Wales, Scotland and Northern Ireland - Office for National Statistics. Office for National Statistics; 24 Jun 2021. https://www.ons.gov.uk/peoplepopulationandcommunity/populationandmigration/populationestimates/bulletins/annualmidyearpopulationestimates/mid2020. Accessed 11 Aug 2021.
Spellerberg IF, Fedor PJ. A tribute to Claude Shannon (1916–2001) and a plea for more rigorous use of species richness, species diversity and the “Shannon-Wiener” Index. Glob Ecol Biogeogr. 2003;12:177–9.
Article Google Scholar
Hutcheson K. A test for comparing diversities based on the Shannon formula. J Theor Biol. 1970;29:151–4.
Article CAS Google Scholar
Sharot T. Weighting survey results. J Mark Res Soc. 1986;28:269–84.
Google Scholar
Bi Q, Wu Y, Mei S, Ye C, Zou X, Zhang Z, et al. Epidemiology and Transmission of COVID-19 in Shenzhen China: Analysis of 391 cases and 1,286 of their close contacts. MedRxiv. 2020. https://www.medrxiv.org/content/medrxiv/early/2020/03/19/2020.03.03.20028423.full.pdf
Wallinga J, Lipsitch M. How generation intervals shape the relationship between growth rates and reproductive numbers. Proc Biol Sci. 2007;274:599–604.
CAS PubMed Google Scholar
Díez-Fuertes F, Iglesias-Caballero M, García-Pérez J, Monzón S, Jiménez P, Varona S, et al. A founder effect led early SARS-CoV-2 transmission in Spain. J Virol. 2021. https://doi.org/10.1128/JVI.01583-20.
Article PubMed PubMed Central Google Scholar
Hodcroft EB, Zuber M, Nadeau S, Vaughan TG, Crawford KHD, Althaus CL, et al. Spread of a SARS-CoV-2 variant through Europe in the summer of 2020. Nature. 2021;595:707–12.
Article CAS Google Scholar
Chadeau-Hyam M, Wang H, Eales O, Haw D, Bodinier B, Whitaker M, et al. REACT-1 study round 14: High and increasing prevalence of SARS-CoV-2 infection among school-aged children during September 2021 and vaccine effectiveness against infection in England. medRxiv. 2021; 2021.10.14.21264965.
Yelin I, Aharony N, Tamar ES, Argoetti A, Messer E, Berenbaum D, et al. Evaluation of COVID-19 RT-qPCR Test in Multi sample Pools. Clin Infect Dis. 2020;71:2073–8.
Article CAS Google Scholar
Lassaunière R, Polacek C, Fonager J, Bennedbæk M, Boding L, Rasmussen M, et al. Neutralisation of the SARS-CoV-2 Delta sub-lineage AY.4.2 and B.1.617.2 + E484K by BNT162b2 mRNA vaccine-elicited sera. bioRxiv. 2021. https://doi.org/10.1101/2021.11.08.21266075.
Article Google Scholar
Peck KM, Lauring AS. Complexities of Viral Mutation Rates. J Virol. 2018. https://doi.org/10.1128/JVI.01031-17.
Article PubMed PubMed Central Google Scholar
Koelle K, Rasmussen DA. The effects of a deleterious mutation load on patterns of influenza A/H3N2’s antigenic evolution in humans. Elife. 2015;4: e07361.
Article Google Scholar
Eales O, Page AJ, Tang SN, Walters CE, Wang H, Haw D, et al. SARS-CoV-2 lineage dynamics in England from January to March 2021 inferred from representative community samples. medRxiv. 2021. https://doi.org/10.1101/2021.05.08.21256867.
Article Google Scholar
O’Toole Á, Scher E, Underwood A, Jackson B, Hill V, McCrone JT, et al. Assignment of epidemiological lineages in an emerging pandemic using the pangolin tool. Virus Evol. 2021;7:vea0b64.
Article Google Scholar
UKHSA Genomics Cell UKHSA Outbreak Surveillance Team UKHSA Epidemiology Cell UKHSA Contact Tracing Data Team UKHSA International Cell UKHSA Environmental Monitoring for Health Protection Team. SARS-CoV-2 variants of concern and variants under investigation in England - Technical briefing 25, 1 October 2021. https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/1025827/Technical_Briefing_25.pdf
Saputri DS, Li S, van Eerden FJ, Rozewicki J, Xu Z, Ismanto HS, et al. Flexible, functional, and familiar: characteristics of SARS-CoV-2 spike protein evolution. Front Microbiol. 2020;11:2112.
Article Google Scholar
Bedford T, Riley S, Barr IG, Broor S, Chadha M, Cox NJ, et al. Global circulation patterns of seasonal influenza viruses vary with antigenic drift. Nature. 2015;523:217–20.
Article CAS Google Scholar
Andrew Rambaut, Oliver G. Pybus, Martha I. Nelson, Cecile Viboud, Jeffery K. Taubenberger, Edward C. Holmes. The genomic and epidemiological dynamics of human influenza A virus. Nature. 2008. https://www.nature.com/articles/nature06945

Download references

Acknowledgements

MC-H acknowledges support from the H2020-EXPANSE project (Horizon 2020 Grant No 874627). MC-H and BB acknowledge support from Cancer Research UK, Population Research Committee Project grant 'Mechanomics’ (Grant No 22184 to MC-H). CAD acknowledges support from the MRC Centre for Global Infectious Disease Analysis and National Institute for Health Research (NIHR) Health Protection Research Unit (HPRU). GC is supported by an NIHR Professorship. HW acknowledges support from an NIHR Senior Investigator Award and the Wellcome Trust (205456/Z/16/Z). PE is Director of the Medical Research Council (MRC) Centre for Environment and Health (MR/L01341X/1, MR/S019669/1). PE acknowledges support from Health Data Research UK (HDR UK); the NIHR Imperial Biomedical Research Centre; NIHR Health Protection Research Units in Chemical and Radiation Threats and Hazards, and Environmental Exposures and Health; the British Heart Foundation Centre for Research Excellence at Imperial College London (RE/18/4/34215); and the UK Dementia Research Institute at Imperial College London (MC_PC_17114). AJP acknowledges the support of the Biotechnology and Biological Sciences Research Council (BB/R012504/1). We thank The Huo Family Foundation for their support of our work on COVID-19.

We thank key collaborators on this work—Ipsos MORI: Kelly Beaver, Sam Clemens, Gary Welch, Nicholas Gilby, Kelly Ward, Galini Pantelidou and Kevin Pickering; Institute of Global Health Innovation at Imperial College London: Gianluca Fontana, Justine Alford; School of Public Health, Imperial College London: Eric Johnson, Rob Elliott, Graham Blakoe; Quadram Institute, Norwich, UK: Alexander J. Trotter; North West London Pathology and Public Health England (now UKHSA) for help in calibration of the laboratory analyses; Patient Experience Research Centre at Imperial College London and the REACT Public Advisory Panel; NHS Digital for access to the NHS register; the Department of Health and Social Care for logistic support.

The COVID-19 Genomics UK (COG-UK) consortium—June 2021 V.1

Funding acquisition, Leadership and supervision, Metadata curation, Project administration, Samples and logistics, Sequencing and analysis, Software and analysis tools, and Visualisation:

Dr Samuel C Robson^13,84

Funding acquisition, Leadership and supervision, Metadata curation, Project administration, Samples and logistics, Sequencing and analysis, and Software and analysis tools:

Dr Thomas R Connor^11,74 and Prof Nicholas J Loman⁴³

Leadership and supervision, Metadata curation, Project administration, Samples and logistics, Sequencing and analysis, Software and analysis tools, and Visualisation:

Dr Tanya Golubchik⁵

Funding acquisition, Leadership and supervision, Metadata curation, Samples and logistics, Sequencing and analysis, and Visualisation:

Dr Rocio T Martinez Nunez⁴⁶

Funding acquisition, Leadership and supervision, Project administration, Samples and logistics, Sequencing and analysis, and Software and analysis tools:

Dr David Bonsall⁵

Funding acquisition, Leadership and supervision, Project administration, Sequencing and analysis, Software and analysis tools, and Visualisation:

Prof Andrew Rambaut¹⁰⁴

Funding acquisition, Metadata curation, Project administration, Samples and logistics, Sequencing and analysis, and Software and analysis tools:

Dr Luke B Snell¹²

Leadership and supervision, Metadata curation, Project administration, Samples and logistics, Software and analysis tools, and Visualisation:

Rich Livett¹¹⁶

Funding acquisition, Leadership and supervision, Metadata curation, Project administration, and Samples and logistics:

Dr Catherine Ludden^20,70

Funding acquisition, Leadership and supervision, Metadata curation, Samples and logistics, and Sequencing and analysis:

Dr Sally Corden⁷⁴ and Dr Eleni Nastouli^96,95,30

Funding acquisition, Leadership and supervision, Metadata curation, Sequencing and analysis, and Software and analysis tools:

Dr Gaia Nebbia¹²

Funding acquisition, Leadership and supervision, Project administration, Samples and logistics, and Sequencing and analysis:

Ian Johnston¹¹⁶

Leadership and supervision, Metadata curation, Project administration, Samples and logistics, and Sequencing and analysis:

Prof Katrina Lythgoe⁵, Dr M. Estee Torok^19,20 and Prof Ian G Goodfellow²⁴

Leadership and supervision, Metadata curation, Project administration, Samples and logistics, and Visualisation:

Dr Jacqui A Prieto^97,82 and Dr Kordo Saeed^97,83

Leadership and supervision, Metadata curation, Project administration, Sequencing and analysis, and Software and analysis tools:

Dr David K Jackson¹¹⁶

Leadership and supervision, Metadata curation, Samples and logistics, Sequencing and analysis, and Visualisation:

Dr Catherine Houlihan^96,94

Leadership and supervision, Metadata curation, Sequencing and analysis, Software and analysis tools, and Visualisation:

Dr Dan Frampton^94,95

Metadata curation, Project administration, Samples and logistics, Sequencing and analysis, and Software and analysis tools:

Dr William L Hamilton¹⁹ and Dr Adam A Witney⁴¹

Funding acquisition, Samples and logistics, Sequencing and analysis, and Visualisation:

Dr Giselda Bucca¹⁰¹

Funding acquisition, Leadership and supervision, Metadata curation, and Project administration:

Dr Cassie F Pope^40,41

Funding acquisition, Leadership and supervision, Metadata curation, and Samples and logistics:

Dr Catherine Moore⁷⁴

Funding acquisition, Leadership and supervision, Metadata curation, and Sequencing and analysis:

Prof Emma C Thomson⁵³

Funding acquisition, Leadership and supervision, Project administration, and Samples and logistics:

Dr Ewan M Harrison^116,102

Funding acquisition, Leadership and supervision, Sequencing and analysis, and Visualisation:

Prof Colin P Smith¹⁰¹

Leadership and supervision, Metadata curation, Project administration, and Sequencing and analysis:

Fiona Rogan⁷⁷

Leadership and supervision, Metadata curation, Project administration, and Samples and logistics:

Shaun M Beckwith ⁶, Abigail Murray ⁶, Dawn Singleton ⁶, Dr Kirstine Eastick ³⁷, Dr Liz A Sheridan ⁹⁸, Paul Randell ⁹⁹, Dr Leigh M Jackson ¹⁰⁵, Dr Cristina V Ariani ¹¹⁶ and Dr Sónia Gonçalves¹¹⁶

Leadership and supervision, Metadata curation, Samples and logistics, and Sequencing and analysis:

Dr Derek J Fairley ^3,77, Prof Matthew W Loose ¹⁸ and Joanne Watkins⁷⁴

Leadership and supervision, Metadata curation, Samples and logistics, and Visualisation:

Dr Samuel Moses^25,106

Leadership and supervision, Metadata curation, Sequencing and analysis, and Software and analysis tools:

Dr Sam Nicholls ⁴³, Dr Matthew Bull ⁷⁴ and Dr Roberto Amato¹¹⁶

Leadership and supervision, Project administration, Samples and logistics, and Sequencing and analysis:

Prof Darren L Smith^36,65,66

Leadership and supervision, Sequencing and analysis, Software and analysis tools, and Visualisation:

Prof David M Aanensen^14,116 and Dr Jeffrey C Barrett¹¹⁶

Metadata curation, Project administration, Samples and logistics, and Sequencing and analysis:

Dr Dinesh Aggarwal^20,116,70, Dr James G Shepherd ⁵³, Dr Martin D Curran ⁷¹ and Dr Surendra Parmar⁷¹

Metadata curation, Project administration, Sequencing and analysis, and Software and analysis tools:

Dr Matthew D Parker¹⁰⁹

Metadata curation, Samples and logistics, Sequencing and analysis, and Software and analysis tools:

Dr Catryn Williams⁷⁴

Metadata curation, Samples and logistics, Sequencing and analysis, and Visualisation:

Dr Sharon Glaysher⁶⁸

Metadata curation, Sequencing and analysis, Software and analysis tools, and Visualisation:

Dr Anthony P Underwood ^14,116, Dr Matthew Bashton ^36,65, Dr Nicole Pacchiarini ⁷⁴, Dr Katie F Loveson⁸⁴ and Matthew Byott^95,96

Project administration, Sequencing and analysis, Software and analysis tools, and Visualisation:

Dr Alessandro M Carabelli²⁰

Funding acquisition, Leadership and supervision, and Metadata curation:

Dr Kate E Templeton^56,104

Funding acquisition, Leadership and supervision, and Project administration:

Dr Thushan I de Silva¹⁰⁹, Dr Dennis Wang¹⁰⁹, Dr Cordelia F Langford¹¹⁶ and John Sillitoe¹¹⁶

Funding acquisition, Leadership and supervision, and Samples and logistics:

Prof Rory N Gunson⁵⁵

Funding acquisition, Leadership and supervision, and Sequencing and analysis:

Dr Simon Cottrell⁷⁴, Dr Justin O’Grady^75,103 and Prof Dominic Kwiatkowski^116,108

Leadership and supervision, Metadata curation, and Project administration:

Dr Patrick J Lillie³⁷

Leadership and supervision, Metadata curation, and Samples and logistics:

Dr Nicholas Cortes³³, Dr Nathan Moore³³, Dr Claire Thomas³³, Phillipa J Burns³⁷, Dr Tabitha W Mahungu⁸⁰ and Steven Liggett⁸⁶

Leadership and supervision, Metadata curation, and Sequencing and analysis:

Angela H Beckett^13,81 and Prof Matthew TG Holden⁷³

Leadership and supervision, Project administration, and Samples and logistics:

Dr Lisa J Levett³⁴, Dr Husam Osman^70,35 and Dr Mohammed O Hassan-Ibrahim⁹⁹

Leadership and supervision, Project administration, and Sequencing and analysis:

Dr David A Simpson⁷⁷

Leadership and supervision, Samples and logistics, and Sequencing and analysis:

Dr Meera Chand⁷², Prof Ravi K Gupta¹⁰², Prof Alistair C Darby¹⁰⁷ and Prof Steve Paterson¹⁰⁷

Leadership and supervision, Sequencing and analysis, and Software and analysis tools:

Prof Oliver G Pybus²³, Dr Erik M Volz³⁹, Prof Daniela de Angelis⁵², Prof David L Robertson⁵³, Dr Andrew J Page⁷⁵ and Dr Inigo Martincorena¹¹⁶

Leadership and supervision, Sequencing and analysis, and Visualisation:

Dr Louise Aigrain¹¹⁶ and Dr Andrew R Bassett¹¹⁶

Metadata curation, Project administration, and Samples and logistics:

Dr Nick Wong⁵⁰, Dr Yusri Taha⁸⁹, Michelle J Erkiert⁹⁹ and Dr Michael H Spencer Chapman^116,102

Metadata curation, Project administration, and Sequencing and analysis:

Dr Rebecca Dewar⁵⁶ and Martin P McHugh^56,111

Metadata curation, Project administration, and Software and analysis tools:

Siddharth Mookerjee^38,57

Metadata curation, Project administration, and Visualisation:

Stephen Aplin⁹⁷, Matthew Harvey⁹⁷, Thea Sass⁹⁷, Dr Helen Umpleby⁹⁷ and Helen Wheeler⁹⁷

Metadata curation, Samples and logistics, and Sequencing and analysis:

Dr James P McKenna³, Dr Ben Warne⁹, Joshua F Taylor²², Yasmin Chaudhry²⁴, Rhys Izuagbe²⁴, Dr Aminu S Jahun²⁴, Dr Gregory R Young ^6,65, Dr Claire McMurray⁴³, Dr Clare M McCann^65,66, Dr Andrew Nelson^65,66 and Scott Elliott⁶⁸

Metadata curation, Samples and logistics, and Visualisation:

Hannah Lowe²⁵

Metadata curation, Sequencing and analysis, and Software and analysis tools:

Dr Anna Price¹¹, Matthew R Crown⁶⁵, Dr Sara Rey⁷⁴, Dr Sunando Roy⁹⁶ and Dr Ben Temperton¹⁰⁵

Metadata curation, Sequencing and analysis, and Visualisation:

Dr Sharif Shaaban⁷³ and Dr Andrew R Hesketh¹⁰¹

Project administration, Samples and logistics, and Sequencing and analysis:

Dr Kenneth G Laing⁴¹, Dr Irene M Monahan⁴¹ and Dr Judith Heaney^95,96,34

Project administration, Samples and logistics, and Visualisation:

Dr Emanuela Pelosi⁹⁷, Siona Silviera⁹⁷ and Dr Eleri Wilson-Davies⁹⁷

Samples and logistics, Software and analysis tools, and Visualisation:

Dr Helen Fryer⁵

Sequencing and analysis, Software and analysis tools, and Visualization:

Dr Helen Adams⁴, Dr Louis du Plessis²³, Dr Rob Johnson³⁹, Dr William T Harvey^53,42, Dr Joseph Hughes⁵³, Dr Richard J Orton⁵³, Dr Lewis G Spurgin⁵⁹, Dr Yann Bourgeois⁸¹, Dr Chris Ruis¹⁰², Áine O'Toole¹⁰⁴, Marina Gourtovaia¹¹⁶ and Dr Theo Sanderson¹¹⁶

Funding acquisition, and Leadership and supervision:

Dr Christophe Fraser⁵, Dr Jonathan Edgeworth¹², Prof Judith Breuer^96,29, Dr Stephen L Michell¹⁰⁵ and Prof John A Todd¹¹⁵

Funding acquisition, and Project administration:

Michaela John¹⁰ and Dr David Buck¹¹⁵

Leadership and supervision, and Metadata curation:

Dr Kavitha Gajee³⁷ and Dr Gemma L Kay⁷⁵

Leadership and supervision, and Project administration:

Prof Sharon J Peacock^20,70 and David Heyburn⁷⁴

Leadership and supervision, and Samples and logistics:

Katie Kitchman³⁷, Prof Alan McNally^43,93, David T Pritchard⁵⁰, Dr Samir Dervisevic⁵⁸, Dr Peter Muir⁷⁰, Dr Esther Robinson^70,35, Dr Barry B Vipond⁷⁰, Newara A Ramadan⁷⁸, Dr Christopher Jeanes⁹⁰, Danni Weldon¹¹⁶, Jana Catalan¹¹⁸ and Neil Jones¹¹⁸

Leadership and supervision, and Sequencing and analysis:

Dr Ana da Silva Filipe⁵³, Dr Chris Williams⁷⁴, Marc Fuchs⁷⁷, Dr Julia Miskelly⁷⁷, Dr Aaron R Jeffries¹⁰⁵, Karen Oliver¹¹⁶ and Dr Naomi R Park¹¹⁶

Metadata curation, and Samples and logistics:

Amy Ash¹, Cherian Koshy¹, Magdalena Barrow⁷, Dr Sarah L Buchan⁷, Dr Anna Mantzouratou⁷, Dr Gemma Clark¹⁵, Dr Christopher W Holmes¹⁶, Sharon Campbell¹⁷, Thomas Davis²¹, Ngee Keong Tan²², Dr Julianne R Brown²⁹, Dr Kathryn A Harris^29,2, Stephen P Kidd³³, Dr Paul R Grant³⁴, Dr Li Xu-McCrae³⁵, Dr Alison Cox^38,63, Pinglawathee Madona^38,63, Dr Marcus Pond^38,63, Dr Paul A Randell^38,63, Karen T Withell⁴⁸, Cheryl Williams ⁵¹, Dr Clive Graham⁶⁰, Rebecca Denton-Smith⁶², Emma Swindells⁶², Robyn Turnbull⁶², Dr Tim J Sloan⁶⁷, Dr Andrew Bosworth^70,35, Stephanie Hutchings⁷⁰, Hannah M Pymont⁷⁰, Dr Anna Casey⁷⁶, Dr Liz Ratcliffe⁷⁶, Dr Christopher R Jones^79,105, Dr Bridget A Knight^79,105, Dr Tanzina Haque⁸⁰, Dr Jennifer Hart⁸⁰, Dr Dianne Irish-Tavares⁸⁰, Eric Witele⁸⁰, Craig Mower⁸⁶, Louisa K Watson⁸⁶, Jennifer Collins⁸⁹, Gary Eltringham⁸⁹, Dorian Crudgington⁹⁸, Ben Macklin⁹⁸, Prof Miren Iturriza-Gomara¹⁰⁷, Dr Anita O Lucaci¹⁰⁷ and Dr Patrick C McClure¹¹³

Metadata curation, and Sequencing and analysis:

Matthew Carlile¹⁸, Dr Nadine Holmes¹⁸, Dr Christopher Moore¹⁸, Dr Nathaniel Storey²⁹, Dr Stefan Rooke⁷³, Dr Gonzalo Yebra⁷³, Dr Noel Craine⁷⁴, Malorie Perry⁷⁴, Dr Nabil-Fareed Alikhan⁷⁵, Dr Stephen Bridgett⁷⁷, Kate F Cook ⁸⁴, Christopher Fearn⁸⁴, Dr Salman Goudarzi⁸⁴, Prof Ronan A Lyons⁸⁸, Dr Thomas Williams¹⁰⁴, Dr Sam T Haldenby¹⁰⁷, Jillian Durham¹¹⁶ and Dr Steven Leonard¹¹⁶

Metadata curation, and Software and analysis tools:

Robert M Davies¹¹⁶

Project administration, and Samples and logistics:

Dr Rahul Batra¹², Beth Blane²⁰, Dr Moira J Spyer^30,95,96, Perminder Smith^32,112, Mehmet Yavus^85,109, Dr Rachel J Williams⁹⁶, Dr Adhyana IK Mahanama⁹⁷, Dr Buddhini Samaraweera⁹⁷, Sophia T Girgis¹⁰², Samantha E Hansford¹⁰⁹, Dr Angie Green¹¹⁵, Dr Charlotte Beaver¹¹⁶, Katherine L Bellis^116,102, Matthew J Dorman¹¹⁶, Sally Kay¹¹⁶, Liam Prestwood¹¹⁶ and Dr Shavanthi Rajatileka¹¹⁶

Project administration, and Sequencing and analysis:

Dr Joshua Quick⁴³

Project administration, and Software and analysis tools:

Radoslaw Poplawski⁴³

Samples and logistics, and Sequencing and analysis:

Dr Nicola Reynolds⁸, Andrew Mack¹¹, Dr Arthur Morriss¹¹, Thomas Whalley¹¹, Bindi Patel¹², Dr Iliana Georgana²⁴, Dr Myra Hosmillo²⁴, Malte L Pinckert²⁴, Dr Joanne Stockton⁴³, Dr John H Henderson⁶⁵, Amy Hollis⁶⁵, Dr William Stanley⁶⁵, Dr Wen C Yew⁶⁵, Dr Richard Myers⁷², Dr Alicia Thornton⁷², Alexander Adams⁷⁴, Tara Annett⁷⁴, Dr Hibo Asad⁷⁴, Alec Birchley⁷⁴, Jason Coombes⁷⁴, Johnathan M Evans⁷⁴, Laia Fina⁷⁴, Bree Gatica-Wilcox⁷⁴, Lauren Gilbert⁷⁴, Lee Graham⁷⁴, Jessica Hey⁷⁴, Ember Hilvers⁷⁴, Sophie Jones⁷⁴, Hannah Jones⁷⁴, Sara Kumziene-Summerhayes⁷⁴, Dr Caoimhe McKerr⁷⁴, Jessica Powell⁷⁴, Georgia Pugh⁷⁴, Sarah Taylor⁷⁴, Alexander J Trotter⁷⁵, Charlotte A Williams⁹⁶, Leanne M Kermack¹⁰², Benjamin H Foulkes¹⁰⁹, Marta Gallis¹⁰⁹, Hailey R Hornsby¹⁰⁹, Stavroula F Louka ¹⁰⁹, Dr Manoj Pohare¹⁰⁹, Paige Wolverson¹⁰⁹, Peijun Zhang¹⁰⁹, George MacIntyre-Cockett¹¹⁵, Amy Trebes¹¹⁵, Dr Robin J Moll¹¹⁶, Lynne Ferguson¹¹⁷, Dr Emily J Goldstein¹¹⁷, Dr Alasdair Maclean¹¹⁷ and Dr Rachael Tomb¹¹⁷

Samples and logistics, and Software and analysis tools:

Dr Igor Starinskij⁵³

Sequencing and analysis, and Software and analysis tools:

Laura Thomson⁵, Joel Southgate^11,74, Dr Moritz UG Kraemer²³, Dr Jayna Raghwani²³, Dr Alex E Zarebski²³, Olivia Boyd³⁹, Lily Geidelberg³⁹, Dr Chris J Illingworth⁵², Dr Chris Jackson⁵², Dr David Pascall⁵², Dr Sreenu Vattipally⁵³, Timothy M Freeman¹⁰⁹, Dr Sharon N Hsu¹⁰⁹, Dr Benjamin B Lindsey¹⁰⁹, Dr Keith James¹¹⁶, Kevin Lewis¹¹⁶, Gerry Tonkin-Hill¹¹⁶ and Dr Jaime M Tovar-Corona¹¹⁶

Sequencing and analysis, and Visualisation:

MacGregor Cox²⁰

Software and analysis tools, and Visualisation:

Dr Khalil Abudahab^14,116, Mirko Menegazzo¹⁴, Ben EW Taylor MEng^14,116, Dr Corin A Yeats¹⁴, Afrida Mukaddas⁵³, Derek W Wright⁵³, Dr Leonardo de Oliveira Martins⁷⁵, Dr Rachel Colquhoun¹⁰⁴, Verity Hill¹⁰⁴, Dr Ben Jackson¹⁰⁴, Dr JT McCrone¹⁰⁴, Dr Nathan Medd¹⁰⁴, Dr Emily Scher¹⁰⁴ and Jon-Paul Keatley¹¹⁶

Leadership and supervision:

Dr Tanya Curran³, Dr Sian Morgan¹⁰, Prof Patrick Maxwell²⁰, Prof Ken Smith²⁰, Dr Sahar Eldirdiri²¹, Anita Kenyon²¹, Prof Alison H Holmes^38,57, Dr James R Price^38,57, Dr Tim Wyatt⁶⁹, Dr Alison E Mather⁷⁵, Dr Timofey Skvortsov⁷⁷ and Prof John A Hartley⁹⁶

Metadata curation:

Prof Martyn Guest¹¹, Dr Christine Kitchen¹¹, Dr Ian Merrick¹¹, Robert Munn¹¹, Dr Beatrice Bertolusso³³, Dr Jessica Lynch³³, Dr Gabrielle Vernet³³, Stuart Kirk³⁴, Dr Elizabeth Wastnedge⁵⁶, Dr Rachael Stanley⁵⁸, Giles Idle⁶⁴, Dr Declan T Bradley^69,77, Dr Jennifer Poyner⁷⁹ and Matilde Mori¹¹⁰

Project administration:

Owen Jones¹¹, Victoria Wright¹⁸, Ellena Brooks²⁰, Carol M Churcher²⁰, Mireille Fragakis²⁰, Dr Katerina Galai^20,70, Dr Andrew Jermy²⁰, Sarah Judges²⁰, Georgina M McManus²⁰, Kim S Smith²⁰, Dr Elaine Westwick²⁰, Dr Stephen W Attwood²³, Dr Frances Bolt^38,57, Dr Alisha Davies⁷⁴, Elen De Lacy⁷⁴, Fatima Downing⁷⁴, Sue Edwards⁷⁴, Lizzie Meadows⁷⁵, Sarah Jeremiah⁹⁷, Dr Nikki Smith¹⁰⁹ and Luke Foulser¹¹⁶

Samples and logistics:

Dr Themoula Charalampous^12,46, Amita Patel¹², Dr Louise Berry¹⁵, Dr Tim Boswell¹⁵, Dr Vicki M Fleming¹⁵, Dr Hannah C Howson-Wells¹⁵, Dr Amelia Joseph¹⁵, Manjinder Khakh¹⁵, Dr Michelle M Lister¹⁵, Paul W Bird¹⁶, Karlie Fallon¹⁶, Thomas Helmer¹⁶, Dr Claire L McMurray¹⁶, Mina Odedra¹⁶, Jessica Shaw¹⁶, Dr Julian W Tang¹⁶, Nicholas J Willford¹⁶, Victoria Blakey¹⁷, Dr Veena Raviprakash¹⁷, Nicola Sheriff¹⁷, Lesley-Anne Williams¹⁷, Theresa Feltwell²⁰, Dr Luke Bedford²⁶, Dr James S Cargill²⁷, Warwick Hughes²⁷, Dr Jonathan Moore²⁸, Susanne Stonehouse²⁸, Laura Atkinson²⁹, Jack CD Lee²⁹, Dr Divya Shah²⁹, Adela Alcolea-Medina^32,112, Natasha Ohemeng-Kumi^32,112, John Ramble^32,112, Jasveen Sehmi^32,112, Dr Rebecca Williams³³, Wendy Chatterton³⁴, Monika Pusok³⁴, William Everson³⁷, Anibolina Castigador⁴⁴, Emily Macnaughton⁴⁴, Dr Kate El Bouzidi⁴⁵, Dr Temi Lampejo⁴⁵, Dr Malur Sudhanva⁴⁵, Cassie Breen⁴⁷, Dr Graciela Sluga⁴⁸, Dr Shazaad SY Ahmad^49,70, Dr Ryan P George⁴⁹, Dr Nicholas W Machin^49,70, Debbie Binns⁵⁰, Victoria James⁵⁰, Dr Rachel Blacow⁵⁵, Dr Lindsay Coupland⁵⁸, Dr Louise Smith⁵⁹, Dr Edward Barton⁶⁰, Debra Padgett⁶⁰, Garren Scott⁶⁰, Dr Aidan Cross⁶¹, Dr Mariyam Mirfenderesky⁶¹, Jane Greenaway⁶², Kevin Cole⁶⁴, Phillip Clarke⁶⁷, Nichola Duckworth⁶⁷, Sarah Walsh⁶⁷, Kelly Bicknell⁶⁸, Robert Impey⁶⁸, Dr Sarah Wyllie⁶⁸, Richard Hopes⁷⁰, Dr Chloe Bishop⁷², Dr Vicki Chalker⁷², Dr Ian Harrison⁷², Laura Gifford⁷⁴, Dr Zoltan Molnar⁷⁷, Dr Cressida Auckland⁷⁹, Dr Cariad Evans^85,109, Dr Kate Johnson^85,109, Dr David G Partridge^85,109, Dr Mohammad Raza^85,109, Paul Baker⁸⁶, Prof Stephen Bonner⁸⁶, Sarah Essex⁸⁶, Leanne J Murray⁸⁶, Andrew I Lawton⁸⁷, Dr Shirelle Burton-Fanning⁸⁹, Dr Brendan AI Payne⁸⁹, Dr Sheila Waugh⁸⁹, Andrea N Gomes⁹¹, Maimuna Kimuli⁹¹, Darren R Murray⁹¹, Paula Ashfield⁹², Dr Donald Dobie⁹², Dr Fiona Ashford⁹³, Dr Angus Best⁹³, Dr Liam Crawford⁹³, Dr Nicola Cumley⁹³, Dr Megan Mayhew⁹³, Dr Oliver Megram⁹³, Dr Jeremy Mirza⁹³, Dr Emma Moles-Garcia⁹³, Dr Benita Percival⁹³, Megan Driscoll⁹⁶, Leah Ensell⁹⁶, Dr Helen L Lowe⁹⁶, Laurentiu Maftei⁹⁶, Matteo Mondani⁹⁶, Nicola J Chaloner⁹⁹, Benjamin J Cogger⁹⁹, Lisa J Easton⁹⁹, Hannah Huckson⁹⁹, Jonathan Lewis⁹⁹, Sarah Lowdon⁹⁹, Cassandra S Malone⁹⁹, Florence Munemo⁹⁹, Manasa Mutingwende⁹⁹, Roberto Nicodemi⁹⁹, Olga Podplomyk⁹⁹, Thomas Somassa⁹⁹, Dr Andrew Beggs¹⁰⁰, Dr Alex Richter¹⁰⁰, Claire Cormie¹⁰², Joana Dias¹⁰², Sally Forrest¹⁰², Dr Ellen E Higginson¹⁰², Mailis Maes¹⁰², Jamie Young¹⁰², Dr Rose K Davidson¹⁰³, Kathryn A Jackson¹⁰⁷, Dr Lance Turtle¹⁰⁷, Dr Alexander J Keeley¹⁰⁹, Prof Jonathan Ball¹¹³, Timothy Byaruhanga¹¹³, Dr Joseph G Chappell¹¹³, Jayasree Dey¹¹³, Jack D Hill¹¹³, Emily J Park¹¹³, Arezou Fanaie¹¹⁴, Rachel A Hilson¹¹⁴, Geraldine Yaze¹¹⁴ and Stephanie Lo¹¹⁶

Sequencing and analysis:

Safiah Afifi¹⁰, Robert Beer¹⁰, Joshua Maksimovic¹⁰, Kathryn McCluggage¹⁰, Karla Spellman¹⁰, Catherine Bresner¹¹, William Fuller¹¹, Dr Angela Marchbank¹¹, Trudy Workman¹¹, Dr Ekaterina Shelest^13,81, Dr Johnny Debebe¹⁸, Dr Fei Sang¹⁸, Dr Marina Escalera Zamudio²³, Dr Sarah Francois²³, Bernardo Gutierrez²³, Dr Tetyana I Vasylyeva²³, Dr Flavia Flaviani³¹, Dr Manon Ragonnet-Cronin³⁹, Dr Katherine L Smollett⁴², Alice Broos⁵³, Daniel Mair⁵³, Jenna Nichols⁵³, Dr Kyriaki Nomikou⁵³, Dr Lily Tong⁵³, Ioulia Tsatsani⁵³, Prof Sarah O'Brien⁵⁴, Prof Steven Rushton⁵⁴, Dr Roy Sanderson⁵⁴, Dr Jon Perkins⁵⁵, Seb Cotton⁵⁶, Abbie Gallagher⁵⁶, Dr Elias Allara^70,102, Clare Pearson^70,102, Dr David Bibby⁷², Dr Gavin Dabrera⁷², Dr Nicholas Ellaby⁷², Dr Eileen Gallagher⁷², Dr Jonathan Hubb⁷², Dr Angie Lackenby⁷², Dr David Lee⁷², Nikos Manesis⁷², Dr Tamyo Mbisa⁷², Dr Steven Platt⁷², Katherine A Twohig⁷², Dr Mari Morgan⁷⁴, Alp Aydin⁷⁵, David J Baker⁷⁵, Dr Ebenezer Foster-Nyarko⁷⁵, Dr Sophie J Prosolek⁷⁵, Steven Rudder⁷⁵, Chris Baxter⁷⁷, Sílvia F Carvalho⁷⁷, Dr Deborah Lavin⁷⁷, Dr Arun Mariappan⁷⁷, Dr Clara Radulescu⁷⁷, Dr Aditi Singh⁷⁷, Miao Tang⁷⁷, Helen Morcrette⁷⁹, Nadua Bayzid⁹⁶, Marius Cotic⁹⁶, Dr Carlos E Balcazar¹⁰⁴, Dr Michael D Gallagher¹⁰⁴, Dr Daniel Maloney¹⁰⁴, Thomas D Stanton¹⁰⁴, Dr Kathleen A Williamson¹⁰⁴, Dr Robin Manley¹⁰⁵, Michelle L Michelsen¹⁰⁵, Dr Christine M Sambles¹⁰⁵, Dr David J Studholme¹⁰⁵, Joanna Warwick-Dugdale¹⁰⁵, Richard Eccles¹⁰⁷, Matthew Gemmell¹⁰⁷, Dr Richard Gregory¹⁰⁷, Dr Margaret Hughes¹⁰⁷, Charlotte Nelson¹⁰⁷, Dr Lucille Rainbow¹⁰⁷, Dr Edith E Vamos¹⁰⁷, Hermione J Webster¹⁰⁷, Dr Mark Whitehead¹⁰⁷, Claudia Wierzbicki¹⁰⁷, Dr Adrienn Angyal¹⁰⁹, Dr Luke R Green¹⁰⁹, Dr Max Whiteley¹⁰⁹, Emma Betteridge¹¹⁶, Dr Iraad F Bronner¹¹⁶, Ben W Farr¹¹⁶, Scott Goodwin¹¹⁶, Dr Stefanie V Lensing¹¹⁶, Shane A McCarthy^116,102, Dr Michael A Quail¹¹⁶, Diana Rajan¹¹⁶, Dr Nicholas M Redshaw¹¹⁶, Carol Scott¹¹⁶, Lesley Shirley¹¹⁶ and Scott AJ Thurston¹¹⁶

Software and analysis tools:

Dr Will Rowe⁴³, Amy Gaskin⁷⁴, Dr Thanh Le-Viet⁷⁵, James Bonfield¹¹⁶, Jennifier Liddle¹¹⁶ and Andrew Whitwham¹¹⁶

¹Barking, Havering and Redbridge University Hospitals NHS Trust, ²Barts Health NHS Trust, ³Belfast Health & Social Care Trust, ⁴Betsi Cadwaladr University Health Board, ⁵Big Data Institute, Nuffield Department of Medicine, University of Oxford, ⁶Blackpool Teaching Hospitals NHS Foundation Trust, ⁷Bournemouth University, ⁸Cambridge Stem Cell Institute, University of Cambridge, ⁹Cambridge University Hospitals NHS Foundation Trust, ¹⁰Cardiff and Vale University Health Board, ¹¹Cardiff University, ¹²Centre for Clinical Infection and Diagnostics Research, Department of Infectious Diseases, Guy's and St Thomas' NHS Foundation Trust, ¹³Centre for Enzyme Innovation, University of Portsmouth, ¹⁴Centre for Genomic Pathogen Surveillance, University of Oxford, ¹⁵Clinical Microbiology Department, Queens Medical Centre, Nottingham University Hospitals NHS Trust, ¹⁶Clinical Microbiology, University Hospitals of Leicester NHS Trust, ¹⁷County Durham and Darlington NHS Foundation Trust, ¹⁸Deep Seq, School of Life Sciences, Queens Medical Centre, University of Nottingham, ¹⁹Department of Infectious Diseases and Microbiology, Cambridge University Hospitals NHS Foundation Trust, ²⁰Department of Medicine, University of Cambridge, ²¹Department of Microbiology, Kettering General Hospital, ²²Department of Microbiology, South West London Pathology, ²³Department of Zoology, University of Oxford, ²⁴Division of Virology, Department of Pathology, University of Cambridge, ²⁵East Kent Hospitals University NHS Foundation Trust, ²⁶East Suffolk and North Essex NHS Foundation Trust, ²⁷East Sussex Healthcare NHS Trust^{, 28}Gateshead Health NHS Foundation Trust, ²⁹Great Ormond Street Hospital for Children NHS Foundation Trust, ³⁰Great Ormond Street Institute of Child Health (GOS ICH), University College London (UCL), ³¹Guy's and St. Thomas’ Biomedical Research Centre, ³²Guy's and St. Thomas’ NHS Foundation Trust, ³³Hampshire Hospitals NHS Foundation Trust, ³⁴Health Services Laboratories, ³⁵Heartlands Hospital, Birmingham, ³⁶Hub for Biotechnology in the Built Environment, Northumbria University, ³⁷Hull University Teaching Hospitals NHS Trust, ³⁸Imperial College Healthcare NHS Trust, ³⁹Imperial College London, ⁴⁰Infection Care Group, St George’s University Hospitals NHS Foundation Trust, ⁴¹Institute for Infection and Immunity, St George’s University of London, ⁴²Institute of Biodiversity, Animal Health & Comparative Medicine, ⁴³Institute of Microbiology and Infection, University of Birmingham, ⁴⁴Isle of Wight NHS Trust, ⁴⁵King's College Hospital NHS Foundation Trust, ⁴⁶King's College London, ⁴⁷Liverpool Clinical Laboratories, ⁴⁸Maidstone and Tunbridge Wells NHS Trust, ⁴⁹Manchester University NHS Foundation Trust, ⁵⁰Microbiology Department, Buckinghamshire Healthcare NHS Trust, ⁵¹Microbiology, Royal Oldham Hospital, ⁵²MRC Biostatistics Unit, University of Cambridge, ⁵³MRC-University of Glasgow Centre for Virus Research, ⁵⁴Newcastle University, ⁵⁵NHS Greater Glasgow and Clyde, ⁵⁶NHS Lothian, ⁵⁷NIHR Health Protection Research Unit in HCAI and AMR, Imperial College London, ⁵⁸Norfolk and Norwich University Hospitals NHS Foundation Trust, ⁵⁹Norfolk County Council, ⁶⁰North Cumbria Integrated Care NHS Foundation Trust, ⁶¹North Middlesex University Hospital NHS Trust, ⁶²North Tees and Hartlepool NHS Foundation Trust, ⁶³North West London Pathology, ⁶⁴Northumbria Healthcare NHS Foundation Trust, ⁶⁵Northumbria University, ⁶⁶NU-OMICS, Northumbria University, ⁶⁷Path Links, Northern Lincolnshire and Goole NHS Foundation Trust, ⁶⁸Portsmouth Hospitals University NHS Trust, ⁶⁹Public Health Agency, Northern Ireland, ⁷⁰Public Health England, ⁷¹Public Health England, Cambridge, ⁷²Public Health England, Colindale, ⁷³Public Health Scotland, ⁷⁴Public Health Wales, ⁷⁵Quadram Institute Bioscience, ⁷⁶Queen Elizabeth Hospital, Birmingham, ⁷⁷Queen's University Belfast, ⁷⁸Royal Brompton and Harefield Hospitals, ⁷⁹Royal Devon and Exeter NHS Foundation Trust, ⁸⁰Royal Free London NHS Foundation Trust, ⁸¹School of Biological Sciences, University of Portsmouth, ⁸²School of Health Sciences, University of Southampton, ⁸³School of Medicine, University of Southampton, ⁸⁴School of Pharmacy & Biomedical Sciences, University of Portsmouth, ⁸⁵Sheffield Teaching Hospitals NHS Foundation Trust, ⁸⁶South Tees Hospitals NHS Foundation Trust, ⁸⁷Southwest Pathology Services, ⁸⁸Swansea University, ⁸⁹The Newcastle upon Tyne Hospitals NHS Foundation Trust, ⁹⁰The Queen Elizabeth Hospital King's Lynn NHS Foundation Trust, ⁹¹The Royal Marsden NHS Foundation Trust, ⁹²The Royal Wolverhampton NHS Trust, ⁹³Turnkey Laboratory, University of Birmingham, ⁹⁴ University College London Division of Infection and Immunity^{, 95}University College London Hospital Advanced Pathogen Diagnostics Unit^{, 96}University College London Hospitals NHS Foundation Trust, ⁹⁷University Hospital Southampton NHS Foundation Trust, ⁹⁸University Hospitals Dorset NHS Foundation Trust, ⁹⁹University Hospitals Sussex NHS Foundation Trust, ¹⁰⁰University of Birmingham, ¹⁰¹University of Brighton, ¹⁰²University of Cambridge, ¹⁰³University of East Anglia, ¹⁰⁴University of Edinburgh, ¹⁰⁵University of Exeter, ¹⁰⁶University of Kent, ¹⁰⁷University of Liverpool, ¹⁰⁸University of Oxford, ¹⁰⁹University of Sheffield, ¹¹⁰University of Southampton, ¹¹¹University of St Andrews, ¹¹²Viapath, Guy's and St Thomas' NHS Foundation Trust, and King's College Hospital NHS Foundation Trust, ¹¹³Virology, School of Life Sciences, Queens Medical Centre, University of Nottingham, ¹¹⁴Watford General Hospital, ¹¹⁵Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, ¹¹⁶Wellcome Sanger Institute, ¹¹⁷West of Scotland Specialist Virology Centre, NHS Greater Glasgow and Clyde, ¹¹⁸Whittington Health NHS Trust.

Funding

The study was funded by the Department of Health and Social Care in England. Sequencing was provided through the COVID-19 Genomics UK Consortium (COG-UK) which is supported by funding from the Medical Research Council (MRC) part of UK Research & Innovation (UKRI), the National Institute of Health Research (NIHR) [grant code: MC_PC_19027], and Genome Research Limited, operating as the Wellcome Sanger Institute.

Author information

Authors and Affiliations

School of Public Health, Imperial College London, Norfolk Place, London, W2 1PG, UK
Oliver Eales, Haowei Wang, Barbara Bodinier, David Haw, Jakob Jonnerby, Christina Atchison, Deborah Ashby, Helen Ward, Steven Riley, Marc Chadeau-Hyam, Christl A. Donnelly & Paul Elliott
MRC Centre for Global Infectious Disease Analysis and Jameel Institute, Imperial College London, London, UK
Oliver Eales, Haowei Wang, David Haw, Jakob Jonnerby, Steven Riley, Christl A. Donnelly & Paul Elliott
Quadram Institute, Norwich, UK
Andrew J. Page & Leonardo de Oliveira Martins
MRC Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
Barbara Bodinier & Marc Chadeau-Hyam
Department of Infectious Disease, Imperial College London, London, UK
Wendy Barclay, Graham Taylor & Graham Cooke
Imperial College Healthcare NHS Trust, London, UK
Graham Cooke, Helen Ward, Ara Darzi & Paul Elliott
National Institute for Health Research Imperial Biomedical Research Centre, London, UK
Graham Cooke, Helen Ward, Ara Darzi & Paul Elliott
Institute of Global Health Innovation, Imperial College London, London, UK
Ara Darzi
Department of Statistics, University of Oxford, Oxford, UK
Christl A. Donnelly
Health Data Research (HDR) UK, Imperial College London, London, UK
Paul Elliott
UK Dementia Research Institute Centre at Imperial, Imperial College London, London, UK
Paul Elliott

Authors

Oliver Eales
View author publications
You can also search for this author in PubMed Google Scholar
Andrew J. Page
View author publications
You can also search for this author in PubMed Google Scholar
Leonardo de Oliveira Martins
View author publications
You can also search for this author in PubMed Google Scholar
Haowei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Barbara Bodinier
View author publications
You can also search for this author in PubMed Google Scholar
David Haw
View author publications
You can also search for this author in PubMed Google Scholar
Jakob Jonnerby
View author publications
You can also search for this author in PubMed Google Scholar
Christina Atchison
View author publications
You can also search for this author in PubMed Google Scholar
Deborah Ashby
View author publications
You can also search for this author in PubMed Google Scholar
Wendy Barclay
View author publications
You can also search for this author in PubMed Google Scholar
Graham Taylor
View author publications
You can also search for this author in PubMed Google Scholar
Graham Cooke
View author publications
You can also search for this author in PubMed Google Scholar
Helen Ward
View author publications
You can also search for this author in PubMed Google Scholar
Ara Darzi
View author publications
You can also search for this author in PubMed Google Scholar
Steven Riley
View author publications
You can also search for this author in PubMed Google Scholar
Marc Chadeau-Hyam
View author publications
You can also search for this author in PubMed Google Scholar
Christl A. Donnelly
View author publications
You can also search for this author in PubMed Google Scholar
Paul Elliott
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

The COVID-19 Genomics UK (COG-UK) Consortium

Samuel C. Robson
, Thomas R. Connor
, Nicholas J. Loman
, Tanya Golubchik
, Rocio T. Martinez Nunez
, David Bonsall
, Andrew Rambaut
, Luke B. Snell
, Rich Livett
, Catherine Ludden
, Sally Corden
, Eleni Nastouli
, Gaia Nebbia
, Ian Johnston
, Katrina Lythgoe
, M. Estee Torok
, Ian G. Goodfellow
, Jacqui A. Prieto
, Kordo Saeed
, David K. Jackson
, Catherine Houlihan
, Dan Frampton
, William L. Hamilton
, Adam A. Witney
, Giselda Bucca
, Cassie F. Pope
, Catherine Moore
, Emma C. Thomson
, Ewan M. Harrison
, Colin P. Smith
, Fiona Rogan
, Shaun M. Beckwith
, Abigail Murray
, Dawn Singleton
, Kirstine Eastick
, Liz A. Sheridan
, Paul Randell
, Leigh M. Jackson
, Cristina V. Ariani
, Sónia Gonçalves
, Derek J. Fairley
, Matthew W. Loose
, Joanne Watkins
, Samuel Moses
, Sam Nicholls
, Matthew Bull
, Roberto Amato
, Darren L. Smith
, David M. Aanensen
, Jeffrey C. Barrett
, Dinesh Aggarwal
, James G. Shepherd
, Martin D. Curran
, Surendra Parmar
, Matthew D. Parker
, Catryn Williams
, Sharon Glaysher
, Anthony P. Underwood
, Matthew Bashton
, Nicole Pacchiarini
, Katie F. Loveson
, Matthew Byott
, Alessandro M. Carabelli
, Kate E. Templeton
, Thushan I. de Silva
, Dennis Wang
, Cordelia F. Langford
, John Sillitoe
, Rory N. Gunson
, Simon Cottrell
, Justin O’Grady
, Dominic Kwiatkowski
, Patrick J. Lillie
, Nicholas Cortes
, Nathan Moore
, Claire Thomas
, Phillipa J. Burns
, Tabitha W. Mahungu
, Steven Liggett
, Angela H. Beckett
, Matthew T. G. Holden
, Lisa J. Levett
, Husam Osman
, Mohammed O. Hassan-Ibrahim
, David A. Simpson
, Meera Chand
, Ravi K. Gupta
, Alistair C. Darby
, Steve Paterson
, Oliver G. Pybus
, Erik M. Volz
, Daniela de Angelis
, David L. Robertson
, Inigo Martincorena
, Louise Aigrain
, Andrew R. Bassett
, Nick Wong
, Yusri Taha
, Michelle J. Erkiert
, Michael H. Spencer Chapman
, Rebecca Dewar
, Martin P. McHugh
, Siddharth Mookerjee
, Stephen Aplin
, Matthew Harvey
, Thea Sass
, Helen Umpleby
, Helen Wheeler
, James P. McKenna
, Ben Warne
, Joshua F. Taylor
, Yasmin Chaudhry
, Rhys Izuagbe
, Aminu S. Jahun
, Gregory R. Young
, Claire McMurray
, Clare M. McCann
, Andrew Nelson
, Scott Elliott
, Hannah Lowe
, Anna Price
, Matthew R. Crown
, Sara Rey
, Sunando Roy
, Ben Temperton
, Sharif Shaaban
, Andrew R. Hesketh
, Kenneth G. Laing
, Irene M. Monahan
, Judith Heaney
, Emanuela Pelosi
, Siona Silviera
, Eleri Wilson-Davies
, Helen Fryer
, Helen Adams
, Louis du Plessis
, Rob Johnson
, William T. Harvey
, Joseph Hughes
, Richard J. Orton
, Lewis G. Spurgin
, Yann Bourgeois
, Chris Ruis
, Áine O’Toole
, Marina Gourtovaia
, Theo Sanderson
, Christophe Fraser
, Jonathan Edgeworth
, Judith Breuer
, Stephen L. Michell
, John A. Todd
, Michaela John
, David Buck
, Kavitha Gajee
, Gemma L. Kay
, Sharon J. Peacock
, David Heyburn
, Katie Kitchman
, Alan McNally
, David T. Pritchard
, Samir Dervisevic
, Peter Muir
, Esther Robinson
, Barry B. Vipond
, Newara A. Ramadan
, Christopher Jeanes
, Danni Weldon
, Jana Catalan
, Neil Jones
, Ana da Silva Filipe
, Chris Williams
, Marc Fuchs
, Julia Miskelly
, Aaron R. Jeffries
, Karen Oliver
, Naomi R. Park
, Amy Ash
, Cherian Koshy
, Magdalena Barrow
, Sarah L. Buchan
, Anna Mantzouratou
, Gemma Clark
, Christopher W. Holmes
, Sharon Campbell
, Thomas Davis
, Ngee Keong Tan
, Julianne R. Brown
, Kathryn A. Harris
, Stephen P. Kidd
, Paul R. Grant
, Li Xu-McCrae
, Alison Cox
, Pinglawathee Madona
, Marcus Pond
, Paul A. Randell
, Karen T. Withell
, Cheryl Williams
, Clive Graham
, Rebecca Denton-Smith
, Emma Swindells
, Robyn Turnbull
, Tim J. Sloan
, Andrew Bosworth
, Stephanie Hutchings
, Hannah M. Pymont
, Anna Casey
, Liz Ratcliffe
, Christopher R. Jones
, Bridget A. Knight
, Tanzina Haque
, Jennifer Hart
, Dianne Irish-Tavares
, Eric Witele
, Craig Mower
, Louisa K. Watson
, Jennifer Collins
, Gary Eltringham
, Dorian Crudgington
, Ben Macklin
, Miren Iturriza-Gomara
, Anita O. Lucaci
, Patrick C. McClure
, Matthew Carlile
, Nadine Holmes
, Christopher Moore
, Nathaniel Storey
, Stefan Rooke
, Gonzalo Yebra
, Noel Craine
, Malorie Perry
, Nabil-Fareed Alikhan
, Stephen Bridgett
, Kate F. Cook
, Christopher Fearn
, Salman Goudarzi
, Ronan A. Lyons
, Thomas Williams
, Sam T. Haldenby
, Jillian Durham
, Steven Leonard
, Robert M. Davies
, Rahul Batra
, Beth Blane
, Moira J. Spyer
, Perminder Smith
, Mehmet Yavus
, Rachel J. Williams
, Adhyana I. K. Mahanama
, Buddhini Samaraweera
, Sophia T. Girgis
, Samantha E. Hansford
, Angie Green
, Charlotte Beaver
, Katherine L. Bellis
, Matthew J. Dorman
, Sally Kay
, Liam Prestwood
, Shavanthi Rajatileka
, Joshua Quick
, Radoslaw Poplawski
, Nicola Reynolds
, Andrew Mack
, Arthur Morriss
, Thomas Whalley
, Bindi Patel
, Iliana Georgana
, Myra Hosmillo
, Malte L. Pinckert
, Joanne Stockton
, John H. Henderson
, Amy Hollis
, William Stanley
, Wen C. Yew
, Richard Myers
, Alicia Thornton
, Alexander Adams
, Tara Annett
, Hibo Asad
, Alec Birchley
, Jason Coombes
, Johnathan M. Evans
, Laia Fina
, Bree Gatica-Wilcox
, Lauren Gilbert
, Lee Graham
, Jessica Hey
, Ember Hilvers
, Sophie Jones
, Hannah Jones
, Sara Kumziene-Summerhayes
, Caoimhe McKerr
, Jessica Powell
, Georgia Pugh
, Sarah Taylor
, Alexander J. Trotter
, Charlotte A. Williams
, Leanne M. Kermack
, Benjamin H. Foulkes
, Marta Gallis
, Hailey R. Hornsby
, Stavroula F. Louka
, Manoj Pohare
, Paige Wolverson
, Peijun Zhang
, George MacIntyre-Cockett
, Amy Trebes
, Robin J. Moll
, Lynne Ferguson
, Emily J. Goldstein
, Alasdair Maclean
, Rachael Tomb
, Igor Starinskij
, Laura Thomson
, Joel Southgate
, Moritz U. G. Kraemer
, Jayna Raghwani
, Alex E. Zarebski
, Olivia Boyd
, Lily Geidelberg
, Chris J. Illingworth
, Chris Jackson
, David Pascall
, Sreenu Vattipally
, Timothy M. Freeman
, Sharon N. Hsu
, Benjamin B. Lindsey
, Keith James
, Kevin Lewis
, Gerry Tonkin-Hill
, Jaime M. Tovar-Corona
, MacGregor Cox
, Khalil Abudahab
, Mirko Menegazzo
, Ben E. W. Taylor MEng
, Corin A. Yeats
, Afrida Mukaddas
, Derek W. Wright
, Rachel Colquhoun
, Verity Hill
, Ben Jackson
, J. T. McCrone
, Nathan Medd
, Emily Scher
, Jon-Paul Keatley
, Tanya Curran
, Sian Morgan
, Patrick Maxwell
, Ken Smith
, Sahar Eldirdiri
, Anita Kenyon
, Alison H. Holmes
, James R. Price
, Tim Wyatt
, Alison E. Mather
, Timofey Skvortsov
, John A. Hartley
, Martyn Guest
, Christine Kitchen
, Ian Merrick
, Robert Munn
, Beatrice Bertolusso
, Jessica Lynch
, Gabrielle Vernet
, Stuart Kirk
, Elizabeth Wastnedge
, Rachael Stanley
, Giles Idle
, Declan T. Bradley
, Jennifer Poyner
, Matilde Mori
, Owen Jones
, Victoria Wright
, Ellena Brooks
, Carol M. Churcher
, Mireille Fragakis
, Katerina Galai
, Andrew Jermy
, Sarah Judges
, Georgina M. McManus
, Kim S. Smith
, Elaine Westwick
, Stephen W. Attwood
, Frances Bolt
, Alisha Davies
, Elen De Lacy
, Fatima Downing
, Sue Edwards
, Lizzie Meadows
, Sarah Jeremiah
, Nikki Smith
, Luke Foulser
, Themoula Charalampous
, Amita Patel
, Louise Berry
, Tim Boswell
, Vicki M. Fleming
, Hannah C. Howson-Wells
, Amelia Joseph
, Manjinder Khakh
, Michelle M. Lister
, Paul W. Bird
, Karlie Fallon
, Thomas Helmer
, Claire L. McMurray
, Mina Odedra
, Jessica Shaw
, Julian W. Tang
, Nicholas J. Willford
, Victoria Blakey
, Veena Raviprakash
, Nicola Sheriff
, Lesley-Anne Williams
, Theresa Feltwell
, Luke Bedford
, James S. Cargill
, Warwick Hughes
, Jonathan Moore
, Susanne Stonehouse
, Laura Atkinson
, Jack C. D. Lee
, Divya Shah
, Adela Alcolea-Medina
, Natasha Ohemeng-Kumi
, John Ramble
, Jasveen Sehmi
, Rebecca Williams
, Wendy Chatterton
, Monika Pusok
, William Everson
, Anibolina Castigador
, Emily Macnaughton
, Kate El Bouzidi
, Temi Lampejo
, Malur Sudhanva
, Cassie Breen
, Graciela Sluga
, Shazaad S. Y. Ahmad
, Ryan P. George
, Nicholas W. Machin
, Debbie Binns
, Victoria James
, Rachel Blacow
, Lindsay Coupland
, Louise Smith
, Edward Barton
, Debra Padgett
, Garren Scott
, Aidan Cross
, Mariyam Mirfenderesky
, Jane Greenaway
, Kevin Cole
, Phillip Clarke
, Nichola Duckworth
, Sarah Walsh
, Kelly Bicknell
, Robert Impey
, Sarah Wyllie
, Richard Hopes
, Chloe Bishop
, Vicki Chalker
, Ian Harrison
, Laura Gifford
, Zoltan Molnar
, Cressida Auckland
, Cariad Evans
, Kate Johnson
, David G. Partridge
, Mohammad Raza
, Paul Baker
, Stephen Bonner
, Sarah Essex
, Leanne J. Murray
, Andrew I. Lawton
, Shirelle Burton-Fanning
, Brendan A. I. Payne
, Sheila Waugh
, Andrea N. Gomes
, Maimuna Kimuli
, Darren R. Murray
, Paula Ashfield
, Donald Dobie
, Fiona Ashford
, Angus Best
, Liam Crawford
, Nicola Cumley
, Megan Mayhew
, Oliver Megram
, Jeremy Mirza
, Emma Moles-Garcia
, Benita Percival
, Megan Driscoll
, Leah Ensell
, Helen L. Lowe
, Laurentiu Maftei
, Matteo Mondani
, Nicola J. Chaloner
, Benjamin J. Cogger
, Lisa J. Easton
, Hannah Huckson
, Jonathan Lewis
, Sarah Lowdon
, Cassandra S. Malone
, Florence Munemo
, Manasa Mutingwende
, Roberto Nicodemi
, Olga Podplomyk
, Thomas Somassa
, Andrew Beggs
, Alex Richter
, Claire Cormie
, Joana Dias
, Sally Forrest
, Ellen E. Higginson
, Mailis Maes
, Jamie Young
, Rose K. Davidson
, Kathryn A. Jackson
, Lance Turtle
, Alexander J. Keeley
, Jonathan Ball
, Timothy Byaruhanga
, Joseph G. Chappell
, Jayasree Dey
, Jack D. Hill
, Emily J. Park
, Arezou Fanaie
, Rachel A. Hilson
, Geraldine Yaze
, Stephanie Lo
, Safiah Afifi
, Robert Beer
, Joshua Maksimovic
, Kathryn McCluggage
, Karla Spellman
, Catherine Bresner
, William Fuller
, Angela Marchbank
, Trudy Workman
, Ekaterina Shelest
, Johnny Debebe
, Fei Sang
, Marina Escalera Zamudio
, Sarah Francois
, Bernardo Gutierrez
, Tetyana I. Vasylyeva
, Flavia Flaviani
, Manon Ragonnet-Cronin
, Katherine L. Smollett
, Alice Broos
, Daniel Mair
, Jenna Nichols
, Kyriaki Nomikou
, Lily Tong
, Ioulia Tsatsani
, Prof Sarah O’Brien
, Steven Rushton
, Roy Sanderson
, Jon Perkins
, Seb Cotton
, Abbie Gallagher
, Elias Allara
, Clare Pearson
, David Bibby
, Gavin Dabrera
, Nicholas Ellaby
, Eileen Gallagher
, Jonathan Hubb
, Angie Lackenby
, David Lee
, Nikos Manesis
, Tamyo Mbisa
, Steven Platt
, Katherine A. Twohig
, Mari Morgan
, Alp Aydin
, David J. Baker
, Ebenezer Foster-Nyarko
, Sophie J. Prosolek
, Steven Rudder
, Chris Baxter
, Sílvia F. Carvalho
, Deborah Lavin
, Arun Mariappan
, Clara Radulescu
, Aditi Singh
, Miao Tang
, Helen Morcrette
, Nadua Bayzid
, Marius Cotic
, Carlos E. Balcazar
, Michael D. Gallagher
, Daniel Maloney
, Thomas D. Stanton
, Kathleen A. Williamson
, Robin Manley
, Michelle L. Michelsen
, Christine M. Sambles
, David J. Studholme
, Joanna Warwick-Dugdale
, Richard Eccles
, Matthew Gemmell
, Richard Gregory
, Margaret Hughes
, Charlotte Nelson
, Lucille Rainbow
, Edith E. Vamos
, Hermione J. Webster
, Mark Whitehead
, Claudia Wierzbicki
, Adrienn Angyal
, Luke R. Green
, Max Whiteley
, Emma Betteridge
, Iraad F. Bronner
, Ben W. Farr
, Scott Goodwin
, Stefanie V. Lensing
, Shane A. McCarthy
, Michael A. Quail
, Diana Rajan
, Nicholas M. Redshaw
, Carol Scott
, Lesley Shirley
, Scott A. J. Thurston
, Will Rowe
, Amy Gaskin
, Thanh Le-Viet
, James Bonfield
, Jennifier Liddle
& Andrew Whitwham

Contributions

PE and CAD are joint corresponding authors. OE, SR, MC-H, CAD and PE conceived the study and the analytical plan. OE and LdOM performed the statistical analyses. OE, HWang, DH, BB and JJ curated the data. CA, DA, WB, GT, GC, HW, AD provided insights into the study design and results interpretation. AJP generated the sequencing data. AD and PE obtained funding. All authors revised the manuscript for important intellectual content and approved the submission of the manuscript. PE had full access to the data and takes responsibility for the integrity of the data and the accuracy of the data analysis and for the decision to submit for publication. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Christl A. Donnelly or Paul Elliott.

Ethics declarations

Ethics approval and consent to participate

We obtained research ethics approval from the South Central-Berkshire B Research Ethics Committee (IRAS ID: 283787). All methods were carried out in accordance with relevant guidelines and regulations. Informed consent was obtained from all participants or their parent/guardian for minors. During initial registration for the study participants are asked “Are you willing to take part in this study?/Are you willing for your child to take part in this study?” with possible answers being “1. Yes, I want to take part in this study” or “2. No, I do not want to take part.”. Those who answered “2. No, I do not want to take part.” were not sent testing kits and did not participate further in the study. Full registration forms for all rounds of REACT-1 are available here: https://www.imperial.ac.uk/medicine/research-and-impact/groups/react-study/for-researchers/react-1-study-materials/.

Consent for publication

Not applicable.

Competing interests

We declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

REACT-1 sequence accession numbers for GISAID and the European Nucleotide Archive.

Additional file 2: Table S1.

Lineages detected in rounds 14 and 15 of REACT-1. Table S2. Estimates of Shannon diversity for England, and by region for rounds 14 and 15 of REACT-1. Table S3. Regional distribution of AY.4 (round 14 and round 15), AY.4.2 (round 15) and B.1.617.2 (round 15). Table S4. Raw numbers of all lineages by region for round 14 and 15 of REACT-1. Table S6. Estimated P-value for the presence of clustering for all lineages with more than a single sample in an individual round, for round 14 and 15 of REACT-1. Table S6. Distribution of AY.4 (round 14 and round 15), AY.4.2 (round 15) and B.1.617.2 (round 15) by age group. Table S7. Raw numbers of all lineages by age group for round 14 and 15 of REACT-1. Table S8. Estimated growth rate in the log odds of every lineage detected relative to all other lineages from round 14 to 15 of REACT-1. Table S9. Mean N- and E-gene Ct value for the eight most prevalent lineages as inferred from Gaussian regression. Table S10. Symptom status by lineage for the eight most prevalent lineages in rounds 14 and 15 of REACT-1. Table S11. Multivariable logistic regression models to determine the effect of the lineage AY.4.2 on the odds of an individual reporting any of the most predictive COVID-19 symptoms relative to AY.4. Table S12. Multivariable gaussian regression model to determine the effect of lineage and round of the study on mean mutation rate. Table S13. Average inter-region migration rate, inferred from a mugration model run on a time-resolved phylogenetic tree, for the periods of rounds 14 and 15, round 14 and round 15

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Eales, O., Page, A.J., de Oliveira Martins, L. et al. SARS-CoV-2 lineage dynamics in England from September to November 2021: high diversity of Delta sub-lineages and increased transmissibility of AY.4.2. BMC Infect Dis 22, 647 (2022). https://doi.org/10.1186/s12879-022-07628-4

Download citation

Received: 12 January 2022
Accepted: 04 July 2022
Published: 27 July 2022
DOI: https://doi.org/10.1186/s12879-022-07628-4

SARS-CoV-2 lineage dynamics in England from September to November 2021: high diversity of Delta sub-lineages and increased transmissibility of AY.4.2

Abstract

Background

Methods

Results

Conclusions

Background

Material and methods

Viral genome sequencing

Phylogeographic model

Statistical analyses

Results

Distribution by region and age

Detection of increasing sub-lineages

Differences in cycle threshold values

Differences in symptomatology

Phylogeographic analysis

Discussion

Other lineages

Limitations

Conclusions

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Consortia

The COVID-19 Genomics UK (COG-UK) Consortium

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1.

Additional file 2: Table S1.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Infectious Diseases

Contact us