Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add human reference genome to prepared databases #2717

Open
dportik opened this issue Aug 16, 2023 · 5 comments
Open

Add human reference genome to prepared databases #2717

dportik opened this issue Aug 16, 2023 · 5 comments

Comments

@dportik
Copy link

dportik commented Aug 16, 2023

Hi Titus et al,
Given the recent fiasco related to mapping reads to microbial databases without human references (links at bottom), it might be a good time to create a small human genome database for use with sourmash. A standalone database on the database page would be ideal, so that researchers can include with the other databases of interest.

Thanks for considering!

social media discussion: https://twitter.com/StevenSalzberg1/status/1686350449069244416
pre-print: https://doi.org/10.1101/2023.07.28.550993

@luizirber
Copy link
Member

On the "raw" side 1 there are both GRCh38.p14 and T2T-CHM13v2.0 signatures in wort, would that work?

Footnotes

  1. just downloaded the data and calculated a signature, no other pre-processing like repeat masking

@dportik
Copy link
Author

dportik commented Aug 22, 2023

Yep! Those should be plenty.

@ctb ctb changed the title Add human reference to prepared databases Add human reference genome to prepared databases Sep 28, 2023
@ctb
Copy link
Contributor

ctb commented May 11, 2024

Repo to sketch hg38, including all unmapped chromosomes: https://github.com/ctb/2024-human-sketch

@ctb
Copy link
Contributor

ctb commented May 11, 2024

note: decontaminating human WGS samples, #3151

@ctb
Copy link
Contributor

ctb commented May 11, 2024

download at: https://farm.cse.ucdavis.edu/~ctbrown/sourmash-db/hg38/hg38-entire.sig.zip

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants