-
Notifications
You must be signed in to change notification settings - Fork 97
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BA.2.10 sublineage with S:W64R, 141-144del, 243-244del, G446S, F486P, R493Q, S494P, P1143L (15 seq as of 2022-08-07, India) #898
Comments
4 new sequences from Telangana, India: |
Sequence just uploaded from from Odisha as well: EPI_ISL_14228411 This looks like another BA.2.75-like lineage (long branch length, mostly mutation in Spike, probably qualifies as a second generation variant). Stuck a monitoring label on it for now but think this should be prioritised for designation if we get many more sequences. |
Hey @thomasppeacock |
@shay671 it seems to me that it might have some QC issue? |
@shay671 @c19850727 yeah, it's a clear QC issue with deletions, seems to be very common in sequences from Odisha. "Deletions replaced with NNNs" is reasonably common and Usher knows how to deal with that, but something weirder is happening here: small sections of the genome adjacent to the deletion get duplicated, e.g. where the deletion at ORF1a:3675-3677 should be, there is instead a repeat of ORF1a:3672-3674, creating a little cluster of "mutations". And instead of T---------CA in S:24-27, Odisha samples usually have TCANNNNNNTCA, which is where the extra "mutations" at S:24 and S:27 are coming from. |
Sequence from Assam: EPI_ISL_14290857 thats 5 different Indian states now - going to recommend as this is clearly widespread and I think this is going to keep growing, and it has some pretty interesting properties. |
i can confirm,after monitoring it from the start via covspectrum, that the early growth rate of this one is apparently comparable to BA.2.75 and BA.2.38.3.X , hard to say more or less but i think a fast designation of this one is required. Also BA.2.75 was designated with just 12-18 sequences thx to the great early spot by @silcn |
3 more from Maharashtra: EPI_ISL_14291303, EPI_ISL_14291304, EPI_ISL_14291417 |
Fantastic work (as always) @silcn. Each state in India should be considered as a different country; it's like we found samples in 5 European countries with clear saltation and super converging mutations. By the way, @silcn - would love to work together on the analysis (I'm part of an Israeli team running an algorithm for saltation tracking that might integrate with your methods) contact through tweeter. |
This is a very interesting lineage. The second only among Omicrons which picked up ORF1b:G662S. The first was - guess what - BA.2.75. The only pre-Omicron VOC with fixed ORF1b:G662S was - guess what - Delta. |
@shay671 I don't have a twitter account and don't intend to get one. My methods are rather more manual than yours - you keep doing your thing. If a 2nd gen variant arises in Europe or the USA you're much more likely to spot it than I am. I see there is already a nickname floating around for this lineage. This is likely a futile request, but please could people hold off on this sort of thing until we have a clearer idea of this lineage's significance? I agree that e.g. BA.5 should have had its own name, but giving out nicknames too easily risks a "boy who cried wolf" effect. The same goes for estimates of the growth advantage based on a very small number of sequences from a country known to have large and inconsistent geographical biases in its coverage. I remember the good old days when BA.2.75 had a 900% advantage over BA.5. |
You are right, now is still an early time to get a nickname for this variant. |
Yes, I agree that BA.2.75 is deserving of a name. I have more reservations about the way it ended up getting its current nickname than about the nickname itself. I'm also not sure BA.2.75 would have got nearly as much attention in the media in early July if it hadn't got a "cool-sounding" nickname. In that sense, maybe it's a good thing it got nicknamed as soon as it did, because it helped draw more attention to it. On the other hand, I'm worried that if this becomes a trend then people might start giving names to every small cluster of sequences with a bunch of spike mutations, many of which won't end up spreading much - and if the media pick up on the names and start reporting on these too, maybe with a way-too-early growth advantage chart for good measure, then it'll become harder for readers to tell when something is actually worrying. |
@cvejris |
Does someone have a good GISAID query for this? I can only catch 12 out of 15 with the following: |
@corneliusroemer unfortunately every defining mutation is covered by NNNs in at least one sequence, so I don't think you can hit all 15. "Spike_G446S, Spike_F486P, NSP13_R392C" gets 14 of them; NSP13_R392C is only there to exclude BA.1 sequences. But I don't expect that'll work too well in the long term as it'll miss sequences with RBD dropouts. |
So do you have a combination of 2 or 3 that are 100% specific (at least within last 2 months) and also as sensitive as possible (in aggregate)? I worry that the Spike RBD is often dropout in Indian sequences. We probably miss at least 10% like this, the Spike_P1143L seems more robust? I guess one can use |
I don't feel like spending too much time coming up with the best 100% specific query when the answer will probably change within a week :) Agree that Spike_P1143L is likely to be most robust - the one sequence that missed it has a 3-nucleotide NNN at that codon only. Not very convenient that there's hardly anything to work with outside the spike. |
Another sequence popped up: EPI_ISL_14355872 from Mumbai Edit it seems to have a rare S:C432R |
@FedeGueli good spot. That S:C432R almost certainly isn't real - it's right next to a block of NNNs and Indian sequences frequently have erroneous mutations close to NNNs. Also EPI_ISL_14357352 from Odisha. |
Sequence from Singapore just uploaded as well: EPI_ISL_14358912 |
Alright, I'll make a PR for this considering it keeps popping up in new states and countries. |
Added new lineage BA.2.10.4 from #898 with 7 new sequence designations
Might I confirm this BA.2.10.4 lineage is already accepted and designated? |
Yes indeed - it's accepted and designated but cannot be assigned by pangolin or Nextclade yet (needs data update) |
@corneliusroemer Thank you so much. |
This keeps popping up across India, though the only non-Indian sequences are the sequence from a traveller in Singapore that's already been mentioned, and a sequence from Hong Kong that might also be from a traveller. At least 28 sequences total. |
I did an analysis with @shay671 to try and understand how many sequences of BA.2.10.4 there are worldwide and the geographic spread of this variant. all sequences that were found in my query were recognized in UShER as BA.2.10.4 except for this one: EPI_ISL_14478719 even though it has 7/9 defining mutations (not including deletions). I think it is BA.2.10.4 but because of bad quality was placed in a different part of the tree in UShER. all sequences were recognized as BA.2.10.4 in nextclade but none in pango here are the full results: |
@talyashe 27 including EPI_ISL_14478719, I take it? The 28th I had in mind was EPI_ISL_14291304. |
@silcn about EPI_ISL_14291304 in my opinion this sequence is very bad quality so you can't tell for sure. |
5 new sequences: France (1) Usa (Virginia 2) India (1 Telangana 1 Karnataka) all collected in August : EPI_ISL_14708679 |
New tree: https://nextstrain.org/fetch/genome.ucsc.edu/trash/ct/subtreeAuspice1_genome_41771_fd0270.json?branchLabel=aa%20mutations&c=pango_lineage_usher&label=nuc%20mutations:G4255A,T23018C,T23019C Main branch seems to be the Orf3a:G100C probably if this will continue to grow will be a good candidate for a sublineage proposal |
one more sequence from California. and it is part of the Orf3a:100C clade. |
One more sequence from Karnataka :EPI_ISL_14810256 |
4 seqs from MH- India collected between July and early August |
One more from Pune, Mh, India |
another new one from MH. 38 samples for gisaid Spike_F486P, NSP13_R392C |
5 from India (plus 4 from Uk spotted yesterday by @thomasppeacock ) |
4 new sequence from India |
1 new sequence from Pennsylvania (fourth samples from United States, from three different states) |
1 more sequence from England |
1 seqs from England total is 54 |
59 as today ( 5 new from India) |
Proposal for a sublineage of BA.2.10
Earliest sequence: 2022-06-24 (India-Maharashtra)
Countries detected: India (5 from Maharashtra, 1 from Karnataka)
Mutations on top of BA.2.10:
S:W64R, 141-144del, 243-244del(*), G446S, F486P, R493Q (reversion), S494P, P1143L
ORF1b:G662S
ORF7a:A105V
nuc:G4255A
(*) this is called as 242-243del by Nextclade, but it's clear from the nucleotides that it's really 243-244del. Both have the same effect (LAL -> L)
Sequences from this lineage were spotted independently by myself, @ryhisner and @zach-hensel. I'm proposing it due to its large number of Spike mutations, now that it's been detected in more than one Indian state. Lots of convergent evolution going on here, e.g. the ubiquitous R493Q reversion, ORF1b:G662S and S:G446S shared with BA.2.75, the 2-nucleotide mutation S:F486P shared with Constellation 1 from #844, S:243-244del and S:S494P shared with the Delta/BA.2 recombinant spotted by @c19850727 (#895)...
https://nextstrain.org/fetch/github.com/silcn/subtreeAuspice1/raw/main/auspice/subtreeAuspice1_genome_363bb_7e5310.json?branchLabel=Spike%20mutations&c=gt-S_64,446&label=nuc%20mutations:G15451A
Sequences:
EPI_ISL_13929780
EPI_ISL_14056750
EPI_ISL_14056762
EPI_ISL_14149647
EPI_ISL_14154126
EPI_ISL_14162192
Cov-spectrum query: https://cov-spectrum.org/explore/World/AllSamples/Past6M/variants?variantQuery=%5B6-of%3A+4255A%2C+ORF1b%3A662S%2C+S%3A64R%2C+S%3A446S%2C+S%3A486P%2C+S%3A494P%2C+S%3A1143L%2C+ORF7a%3A105V%5D&
The text was updated successfully, but these errors were encountered: