Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Document where the reference sequences came from #441

Closed
9 tasks done
donkirkby opened this issue Jun 5, 2018 · 1 comment
Closed
9 tasks done

Document where the reference sequences came from #441

donkirkby opened this issue Jun 5, 2018 · 1 comment
Labels
config change Requires a configuration change during release. enhancement

Comments

@donkirkby
Copy link
Member

donkirkby commented Jun 5, 2018

It's useful to be able to answer questions about the reference sequences, so we should at least have a list of the accession numbers. It might also be helpful to have a script that can fetch them all.

As a start, here's a list of the HCV references from the FDA guidance (PDF):

HCV Genotype Reference Strain GenBank Accession ID
Genotype 1a H77 NC_004102
Genotype 1b Con1 AJ238799
Genotype 2 JFH-1 AB047639
Genotype 3 S52 GU814263
Genotype 4 ED43 GU814265
Genotype 5 SA13 AF064490
Genotype 6 EUHK2 Y12083
  • Compare HIV seeds to HIV compendium.
  • Check if we used all references from the HIV compendium.
  • Compare HIV coordinate references to HXB2.
  • Explain changes from HXB2.
  • Compare HCV references.
  • Compare HLA references.
  • Compare V3LOOP reference used in G2P .
  • Remove ERCC references.
  • Use HIV Consensus B for resistance wild types.

Configuration changes:

  • Upload projects to QAI's pipeline configuration. There are changes in HIV1B-vpr and several HCV seed and coordinate references, plus we removed the ERCC project.
@donkirkby
Copy link
Member Author

I think the HIV references are HXB2 (Accession ID K03455).

@donkirkby donkirkby added the config change Requires a configuration change during release. label Apr 8, 2019
donkirkby added a commit that referenced this issue Apr 11, 2019
Remove ERCC references, because we don't need to support them anymore.
donkirkby added a commit that referenced this issue Apr 16, 2019
There were gaps between some of the gene regions, and a few of the seeds had small differences. The HCV-6n seed had a more significant change that will need to be validated.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
config change Requires a configuration change during release. enhancement
Projects
None yet
Development

No branches or pull requests

1 participant