-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ground truth computation is too slow for realistic data set sizes #30
Labels
Comments
dkoslicki
added a commit
that referenced
this issue
Apr 6, 2020
dkoslicki
added a commit
that referenced
this issue
Apr 6, 2020
dkoslicki
added a commit
that referenced
this issue
Apr 6, 2020
dkoslicki
added a commit
that referenced
this issue
Apr 6, 2020
dkoslicki
added a commit
that referenced
this issue
Apr 6, 2020
dkoslicki
added a commit
that referenced
this issue
Apr 6, 2020
…allelization since it doesn't give its temp files unique names. Got around this by using RAM only mode. Added ability to compute all training kmers via KMC. #30
dkoslicki
added a commit
that referenced
this issue
Apr 6, 2020
dkoslicki
added a commit
that referenced
this issue
Apr 7, 2020
…ces observed between it and the pure python version. Investigating now #30
dkoslicki
added a commit
that referenced
this issue
Apr 7, 2020
…sults agree now. Refactoring now #30
dkoslicki
added a commit
that referenced
this issue
Apr 7, 2020
dkoslicki
added a commit
that referenced
this issue
Apr 7, 2020
dkoslicki
added a commit
that referenced
this issue
Apr 7, 2020
dkoslicki
added a commit
that referenced
this issue
Apr 7, 2020
merged into master and completed. Closing. @ShaopengLiu1 should be ready for you to use |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
In the current
CMash/GroundTruth.py
, since it's using a naive python set calculation, this is much too slow to get results in a reasonable amount of time on larger data sets. Will need to switch to using KMC many times over to calculate the actual ground truth.Work being done on
groundtruth
branch.kmc_tools intersect
and divide by total number of distinct training database k-mersThe text was updated successfully, but these errors were encountered: