Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Linear genome/mutation visualizer #284

Closed
atc3 opened this issue Mar 16, 2021 · 0 comments · Fixed by #337
Closed

Linear genome/mutation visualizer #284

atc3 opened this issue Mar 16, 2021 · 0 comments · Fixed by #337
Assignees
Labels
enhancement New feature or request viz Data Visualization

Comments

@atc3
Copy link
Member

atc3 commented Mar 16, 2021

Intro

When we implement the 3D PDB viewer with a mutation heatmap, it would be good to have some sort of 2D visualization of mutations – mutations that are either lineage defining, or if possible, a continuous scale of mutation frequency mapped along some sort of 2D representation of the genome.

For reference here is what our 3D viewer will eventually look like:

Examples

Here are some attempts by others to map mutations onto a 2D representation of the genome:

New York Times Variant Tracker

The biggest problem with this visualization is that it doesn't fit entirely on my screen – I have to zoom out to capture all of it.

outbreak.info

Solution

Here's some ways I think we can make an improved 2D genome/mutation viewer. For now let's ignore the case where we need to show mutation frequencies – we can focus on the use case where we just need to show the characteristic mutations for a lineage like B.1.1.7

  • The should have a switch between showing either genes or proteins. It's not that helpful to have both shown at the same time and it just makes things more confusing. We should probably default to proteins since that level is more relevant when looking at AA mutations
  • The genome organization itself is pretty limiting. To elaborate:
    • We shouldn't force ourselves to make the visual gene/protein sizes proportional to their actual size in bases. For example, in the outbreak.info example, some of the genes are so small that they can't even be labelled. And then there's so little mutations in ORF1ab that the space dedicated to it is wasted
    • We don't even have to be faithful to the order of the genes. It's useful for people to find their gene of interest, but that's only if they have memorized the order of genes (I forget the order after Spike). It might even be more useful to have the genes/proteins in alphabetical order instead.
    • If we start to ignore the order of the genes, then we don't have to display them in a linear/circular connected layout anymore. We could move to a list-like layout, for example
  • Mutation information should show both the reference and alternate bases/residues. We should also have a place to show the canonical "news article" name, i.e., E484K, so that they are easily recognizable
@atc3 atc3 added enhancement New feature or request viz Data Visualization labels Mar 16, 2021
@atc3 atc3 closed this as completed in #337 Jun 24, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request viz Data Visualization
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants