Created and maintained by Dr Emma B Hodcroft, University of Bern. Please credit and link back to this site if you use this resource!
This page is under construction & more on the missing variants will be added soon.
These collection of Github pages is intended to provide an overview (not necessarily complete) of SARS-CoV-2 mutations that are of interest. It should be noted that these mutations are primarily of interest due to spread in Europe: this is simply a reflection that the primary maintainer/author (Emma Hodcroft) works mostly with European data.
The code used to generate these tables, graphs, and the sequences related to that mutation can be found in this repository.
The SARS-CoV-2 pandemic & research surrounding it is ongoing.
I will make every effort to try to keep this repository up-to-date, but readers should take care to double-check that the information is the latest available.
I welcome PR requests to this repository providing new links and information!
The more detail you can include in a pull request (PR) the faster I'll be able to review it.
If possible, provide a PR that adds/edits the appropriate links/etc, and I can merge it faster - if you can't do that, making an issue is fine, but I might be slower incorporating it.
Where is the 69/70 deletion section?
This deletion has arisen at least 3 times independently: in the S:Y453F
, S:N439K
, and S:N501Y
clusters.
There's also a recent preprint which suggests it may alter the recognition by antibodies, possibly impacting some antibody-therapy treatments, or immunity (Kemp et al. medRxiv).
So, there's a lot of interest in this variant.
For boring, technical, code reasons I do not yet have a dedicated 69/70 deletion build, but I'm working on it!
Overview of all mutation tables & graphs
Overview of all mutation country plots
Clusters/mutations are listed below by the location of a mutation in the spike protein (S:
) - the letter after :
indicates the original amino-acid, the number the position in the spike protein, and the last letter, the 'new' amino-acid.
As S:N501
has multiple amino-acid mutations, there is no second letter.
20A.EU1 and 20A.EU2, because of their prominence, have been given 'subclade' names.
The mutation is listed in parentheses after the name.
Figure made via GISAID
Dedicated 20A.EU1 Nextstrain build
Table and charts of mutation distribution
- Defining mutations:
- Nonsynonymous:
S:A222V
;ORF10:V30L
;N:A220V
orORF14:L67F
(overlapping reading frame withN
) - Synonymous:
T445C
,C6286T
,C26801G
- Nonsynonymous:
S:A222V
- Mutation in the non-terminal domain (NTD), which is not known to play a direct role in receptor binding or membrane fusion
- Associated with a cluster that initially expanded in Spain and spread across Europe via holiday travel (see Hodcroft et al preprint)
Figure made via GISAID
Dedicated 20A.EU2 Nextstrain build
Table and charts of mutation distribution
- Note this cluster is only the European appearance of S:477N
- Defining mutations:
- Nonsynonymous:
S:S477N
;N:M234I
,A376T
;ORF1b:A176S
,V767L
,K1141R
,E1184D
- Synonymous:
C4543T
,G5629T
,C11497T
,T26876C
- Nonsynonymous:
S:S477N
- Mutation is in the receptor binding domain (RDB), important to ACE2 binding and antibody recognition
- Has arisen independently in Australia and was responsible for much of the summer 2020 outbreak (Link to Nextstrain build)
- May slightly increase ACE2 binding: Chen et al. JMB; see also Bloom Lab ACE2 binding website
- May confer resistance to antibodies: Gaebler et al. bioRxiv
Figure made via GISAID
Dedicated S:N439K Nextstrain build
Table and charts of mutation distribution
- Defining mutations:
- Nonsynonymous:
S:N439K
;ORF1a:I2501T
- Synonymous:
C8047T
- Nonsynonymous:
S:N439K
- Mutation is in the receptor binding domain (RDB), important to ACE2 binding and antibody recognition
- About 2/3 of the sequences in the cluster have deletions at Spike amino-acid positions 69/70 (Nextstrain build with deletions in cluster highlighted)
- Has emerged twice independently in Europe, but was exclusive to Scotland in the first wave and went extinct: Thompson et al. bioRxiv
- May increase ACE2 binding: Thompson et al. bioRxiv; see also [Bloom Lab ACE2 binding website](https://jbloomlab.github.io/ SARS-CoV-2-RBD_DMS/)
- May confer resistance to antibodies: C135 (Weisblum et al. eLife, Barnes et al. Nature); a panel of antibodies (Thompson et al. bioRxiv)
Figure made via GISAID
Dedicated S:Y453F Nextstrain build
Table and charts of mutation distribution
- Defining mutations:
- Has appeared multiple times independently: each can be associated with different accompanying mutations
S:Y453F
- Mutation is in the receptor binding domain (RDB), important to ACE2 binding and antibody recognition
- Associated with the 'cluster 5' 'mink' variant that led to some alarm in Denmark in autumn 2020
- This variant has the following additional spike mutations: 60/70 deletion,
I692V
andM1229I
- This variant has the following additional spike mutations: 60/70 deletion,
- It is has also been seen previously in mink in the Netherlands (example Nextstrain build)
- May be a mink-specific adaptation, increasing binding to mink ACE2: (Rodrigues et al. PloS Comp Bio); and appearing multiple times (van Dorp et al. bioRxiv)
- May also increase ACE2 binding in humans: Bloom Lab ACE2 binding website
- May confer resistance to an antibody in the Regeneron cocktail: REGN10933 (Baum et al. Science)
Figure made via GISAID
Dedicated S:S98F Nextstrain build
Table and charts of mutation distribution
- Defining mutations:
- Nonsynonymous:
S:S98F
;N:P199L
orORF14:Q46*
(overlapping reading frames);ORF3a:Q38R
,G172R
,V202L
- Synonymous: C28651T
- Nonsynonymous:
S:S98F
- Mostly found in Belgium and the Netherlands - predominantly Belgium
More coming soon
Figure made via GISAID
Dedicated S:D80Y Nextstrain build
Table and charts of mutation distribution
More coming soon
Figure made via GISAID
Dedicated S:N501 Nextstrain build
Table and charts of mutation distribution
- Defining mutations:
- Has appeared multiple times independently: each can be associated with different accompanying mutations
- Amino-acid changes are
N501Y
(nucleotide mutationA23063T
),N501T
(nucleotide mutationA23064C
), andN501S
(nucleotide mutationA23064G
)
S:N501
- Mutation is in the receptor binding domain (RDB), important to ACE2 binding and antibody recognition
- Associated with a recently reported 'new variant' announced in the South East of England on 14 Dec 2020 (COG-UK Report)
- This particular variant is associated with multiple mutations in Spike, including:
N501Y
, a deletion at 69/70 (as seen inS:N439K
&S:Y453F
), andP681H
- This particular variant is associated with multiple mutations in Spike, including:
- Clusters also seen in USA, South Africa, & Australia
- May be associated with adaptation to rodents and mustelids:
N501T
in ferrets (Richard et al. Nature Comm.);N501Y
in mice (Gu et al. Science)- Some have speculated of risk of a persistent reservoir in wild rodents/mustelids
- May increase ACE2 binding: Bloom Lab ACE2 binding website
N501Y
was found in longitudinally-collected samples from an immunocompromised patient (Choi et al. NEJM)
Figure made via GISAID
Dedicated S:A626S Nextstrain build
Table and charts of mutation distribution
More coming soon
Figure made via GISAID
Dedicated S:V1122L Nextstrain build
Table and charts of mutation distribution
More coming soon
Figure made via GISAID
Note this figure shows both the 69 & 70 deletion.
Dedicated S:H69- Nextstrain build
Table and charts of mutation distribution
Important: Currently this build detects only the deletion at position 69 in spike, as due to alignment/calling differences, detecting the deletion at position 70 is less reliable. However, they seem to be highly associated.
More coming soon