Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sparse profile #15

Merged
merged 2 commits into from
Jan 8, 2021
Merged

Sparse profile #15

merged 2 commits into from
Jan 8, 2021

Conversation

MatthewRalston
Copy link
Owner

Adds a new feature, referred to as the sparse profile. Also includes neighbor metadata on each row of the k-mer database. This is expected to increase the fixed rate per line by a factor of about 10. That constant plays with the exponential in the final graph storage space needed. This also fixed a bug in kmerdb.fileutil.KDBReader.slurp() where the full profile was not loading properly. We added some code that could need refactoring.

Anyways, the idea behind the sparse profile is that we don't need to store connectivity for a subgraph of the full k-mer space that does not exist in the dataset in any capacity. This will help restrict the size of files for very sparse settings in the k - sequence space.

…ed to matrix processing. Implements sparse profiles and sparse slurping. Adds new dependency on rdflib, since eventually we may want to port the database to a graph database, or be able to export it.
@MatthewRalston MatthewRalston added bug Something isn't working enhancement New feature or request labels Jan 8, 2021
@MatthewRalston MatthewRalston self-assigned this Jan 8, 2021
@MatthewRalston MatthewRalston merged commit 8b5841a into master Jan 8, 2021
@MatthewRalston MatthewRalston deleted the sparse_profile branch January 8, 2021 01:45
MatthewRalston added a commit that referenced this pull request Jan 14, 2021
MatthewRalston added a commit that referenced this pull request May 1, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working enhancement New feature or request
Projects
Bugs
  
Awaiting triage
Features v1.0
  
Awaiting triage
Development

Successfully merging this pull request may close these issues.

None yet

1 participant