102 - persist bundle metadata in separate table #185
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
Persist bundle metadata in separate table.
Related Issue(s)
Closes #102, possibly #37 as well.
Changes Made
List the main changes made in this PR. Be as specific as possible.
Checklist
Before submitting this PR, please make sure that:
Additional Information
bundle
). This is linked todataset
viadataset_id
.dataset.bundle_size
have been updated to use the size from the newbundle
table.bundle_alias
is used to obfuscate the actual location of the bundle on Slate-scratch. I am not reusing thestage_alias
for this because the stage_alias directory symlinks to the top-level directory inside dataset, instead of the dataset's directory. Therefore downloading the staged dataset with this approach would end up having the bundle inside the dataset. This is the alias provided to users who attempt to download bundles.I have also added a new script (
populate_bundles.py
) thathsi
utility) from SDA.sync_archived_bundles
workflow on these datasets, which runs the tasks archive (which populates the bundle metadata in thebundle
table), stage, validate, and setup_download steps on each of them, thus preparing them for download.