Sparse profile #15

MatthewRalston · 2021-01-08T01:35:06Z

Adds a new feature, referred to as the sparse profile. Also includes neighbor metadata on each row of the k-mer database. This is expected to increase the fixed rate per line by a factor of about 10. That constant plays with the exponential in the final graph storage space needed. This also fixed a bug in kmerdb.fileutil.KDBReader.slurp() where the full profile was not loading properly. We added some code that could need refactoring.

Anyways, the idea behind the sparse profile is that we don't need to store connectivity for a subgraph of the full k-mer space that does not exist in the dataset in any capacity. This will help restrict the size of files for very sparse settings in the k - sequence space.

…ed to matrix processing. Implements sparse profiles and sparse slurping. Adds new dependency on rdflib, since eventually we may want to port the database to a graph database, or be able to export it.

Sparse profile

MatthewRalston added 2 commits January 7, 2021 14:31

Preparing for new feature branch called sparse_profile.

acf2fa3

Introduced and fixed a bug in kmerdb/fileutil.KDBReader.slurp() relat…

0ea65f9

…ed to matrix processing. Implements sparse profiles and sparse slurping. Adds new dependency on rdflib, since eventually we may want to port the database to a graph database, or be able to export it.

MatthewRalston added bug Something isn't working enhancement New feature or request labels Jan 8, 2021

MatthewRalston self-assigned this Jan 8, 2021

MatthewRalston merged commit 8b5841a into master Jan 8, 2021

MatthewRalston deleted the sparse_profile branch January 8, 2021 01:45

MatthewRalston added a commit that referenced this pull request Jan 14, 2021

Merge pull request #15 from MatthewRalston/sparse_profile

d033f88

Sparse profile

MatthewRalston added a commit that referenced this pull request May 1, 2022

Merge pull request #15 from MatthewRalston/sparse_profile

c2038d2

Sparse profile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sparse profile #15

Sparse profile #15

MatthewRalston commented Jan 8, 2021

Sparse profile #15

Sparse profile #15

Conversation

MatthewRalston commented Jan 8, 2021