Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Index with seeds #127

Closed
wants to merge 368 commits into from
Closed

Index with seeds #127

wants to merge 368 commits into from

Conversation

ctb
Copy link
Member

@ctb ctb commented Aug 27, 2013

No description provided.

jasonpell and others added 30 commits August 20, 2012 13:35
(Thanks to Eric Norige for noticing this. Somehow I overlooked it....)
Use multiple threads in the 'HashTable' test driver.
(TODO: Add O_DIRECT support for raw streams from regular files.)
(Could've sworn that I had already started that task previously....)
Clean up code: tighten down a conditional compilation, remove a stray OMP directive.
Improve performance: precalculate MAX_COUNT and MAX_BIGCOUNT and store with HashTable object.
Improve efficiency of DNA string validator.
…eep-reads2.

'sweep-reads2' loads input reads into a hashbits table and then "sweeps" reads
out of another file if they have a median kmer count of 1, i.e. if half or more
of their k-mers are present in the hashbits table.  Since the median k-mer
count function would have been identical in both the Hashbits and CountingHash
subclasses of Hashtable, I promoted it to a base class function.

One use of this is is to use partitions from diginormed data to recover
the full undiginormed members of the partitions from the original data.
It can also be used as an approximate way to pull out reads from one
data set that overlap with reads in another data set, although cumulative
diginorm + partitioning is a better way to do that if you have the memory.
(Bug: Threads hang if the code is not built with internal tracing enabled. Need to fix.)
ctb and others added 28 commits May 31, 2013 12:01
Change 'maxErrorRegion' in Jason's aligner to be an unsigned int.
@ctb
Copy link
Member Author

ctb commented Aug 27, 2013

Argh, got bitten by a github bug. Grr. Redoing.

@ctb ctb closed this Aug 27, 2013
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants