Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

divide protein blast databases #3

Open
3 tasks
jvollme opened this issue Nov 30, 2021 · 0 comments
Open
3 tasks

divide protein blast databases #3

jvollme opened this issue Nov 30, 2021 · 0 comments
Assignees
Labels
enhancement New feature or request

Comments

@jvollme
Copy link
Collaborator

jvollme commented Nov 30, 2021

Divide protein blast databases into smaller subsets (similar to nucleotide dbs).
possibilities:

  1. seperate by component sub-db (gtdb or refseq_eukaryote/virus)
  2. seperate into roughly equal numbers of proteins
  • Test if this speeds up blasts when blasting one-by-one, always devoting all threads to each single db
  • Test if this speeds up blasts when blasting all simultaneously, dividing threads over all dbs (when using option 1: prioitize larer sub-DBs)
  • select and permanentlyy implement the faster option
@jvollme jvollme added the enhancement New feature or request label Nov 30, 2021
@jvollme jvollme added this to the Improvements A milestone Nov 30, 2021
@jvollme jvollme self-assigned this Nov 30, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant