-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
support/provide/require manifests for collections of signatures? #1352
Comments
would be nice to support manifests that are part of the collection, e.g.
|
information that could be part of manifests --
we'd want to figure out the signature vs sketch abstractions properly per #616. So, probably, two tables at least -
and the sketch would then contain a pointer to the actual hashes. |
an example of "optional and useful manifests that are not SBTs" - find signatures in SBTs by name #933 and more generally a |
the most useful manifest would be one(s) that supported all of the selector options so that no signatures would need to be loaded to select them |
I'm not quite sure how to name things, but wrt #1365, an alternate kind of manifest would be one that is a list of signatures to load from elsewhere. This is the "shopping cart" version of manifests, vs the "catalog" kind of manifest. This is something that future prefetch/search/etc could then output a la #1365. p.s. I kind of like the name 'picklist'. |
I still feel like the index description can serve as the manifest for a collection of signature. In this sense, a The benefit on this is that a LinearIndex turns into a base index that can be upgraded to an SBT or LCA/RevIndex/Greyhound as needed. In the LCA case the |
I think we are converging! I'm not 100% sure what the code would look like (I usually only know after writing it and then refactoring it a few times :) and am also curious about how #1374 fits in, since this is/was needed to provided location info instead of modifying (As an alternative, I was thinking about adding a |
Quick comment on supporting multiple indexes per storage: during #648 I added the |
in re
my experiences with It also seems reasonable for manifests to be an "expert level, use with caution" feature that we primarily use on large databases to support better performance. This is as opposed to picklists, which will be directly user-facing and should be robust in terms of reporting misuse or mistakes. |
Yep, I like this! In #1590, I add (also, trivial :) support for signature manifests to Zipfile collections. While the implementation is not very clean or generic, I think we can evolve
In the short term, supporting manifests on SBTs would be pretty easy as long as we don't try to allow multiple indices within an SBT. I would probably start by supporting separate manifest creation, storage, and loading as per the In the longer-term, some questions will arise -
|
hmm, maybe we want to start thinking about using Then the Could also add a Then manifests belong to |
discussed in #1599 and connected links. |
closing in favor of #1901, which has the leftover bits from here. |
In the ZipFileLinearIndex PR #1349 (comment), I'm thinking about signature manifests that provide some kind of metadata about the signature - ksize, moltype, tags, etc.
This could be used by selectors #1072 to avoid loading signatures that don't match the selector #1097 (comment).
The upsides to manifests are -
The downsides are -
My current take is that it would be good to support but not require this. We could prototype it for internal use for some of our larger databases, in particular, and start to build out the commands to create, store, and update manifests.
I guess this could then lead to a gradiation of collection/index storages:
.sig
files.The text was updated successfully, but these errors were encountered: