Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

No warning with inventoried files missing #100

Closed
tbarnes4 opened this issue Jan 26, 2021 · 6 comments
Closed

No warning with inventoried files missing #100

tbarnes4 opened this issue Jan 26, 2021 · 6 comments
Labels
enhancement New feature or request wontfix This will not be worked on

Comments

@tbarnes4
Copy link

Is your feature request related to a problem? Please describe.
When the deep archive tool wraps up a bundle, if a collection primary member is missing, it proceeds onward without error or warning or notice. This can be a problem when you are trying to submit to the deep archive a complete collection, but a single file is missing for whatever reason (or in the case of #69 a file has the wrong LID). This can be a serious problem when down the road someone tries to pull the bundle from the deep archive and finds there is a missing product that was never archived.

Describe the solution you'd like
Check to see if every product (perhaps only check for primary products?) in the inventory.csv file is included in the AIP/SIP files. If not, report to the user that product X is missing. I would recommend not stopping generation of the AIP/SIP files though.

Describe alternatives you've considered
Add a flag that will scan the bundle and report back any missing products. Either this flag (1) will not generate the AIP/SIP files when invoked, (2) will not generate the AIP/SIP files only if there are missing products, or (3) will generate the AIP/SIP files and print to the screen or report file a notice that XYZ file(s) were missing.

Additional context
I discovered this problem when I ran the deep archive tool on the EPOXI bundle mentioned in #69. I did a sanity check and noticed the hartley2_photometry/document/epoxi_photometry_v5.[xml|pdf] files were missing from the manifest tables. It happens that the product is mentioned in the inventory.csv file, but the LID in the xml file is wrong and needs to be corrected.

I will also note, the validate tool (1.23.1 2020-05-16) currently does not check for inclusion of all primary members. I am in the process of checking if the current tool version does this or not.

@tbarnes4 tbarnes4 added enhancement New feature or request requirement-needed labels Jan 26, 2021
@tbarnes4
Copy link
Author

tbarnes4 commented Jan 26, 2021

I said:

I will also note, the validate tool (1.23.1 2020-05-16) currently does not check for inclusion of all primary members. I am in the process of checking if the current tool version does this or not.

I can now confirm the the validate tool version 1.24.0 (2020-09-08) likewise does not check for inclusion of all collections (found in the bundle.xml file), nor primary members of the data collections (as found in the inventory.csv file).

@jordanpadams
Copy link
Member

@tbarnes4 how are you running validate tool? have you run it with the -R pds4.bundle rule? it should catch these things?

@jordanpadams
Copy link
Member

@tbarnes4 please see NASA-PDS/validate#287 (comment) . pds-deep-archive assumes the bundle is valid. in order to avoid duplication of effort, we would prefer validate be executed to find these issues.

@jordanpadams jordanpadams added the wontfix This will not be worked on label Jan 26, 2021
@tbarnes4
Copy link
Author

tbarnes4 commented Jan 26, 2021 via email

@tbarnes4
Copy link
Author

I agree it is sufficient to skip this suggestion for the pds-deep-archive tool and rely on the validate tool.

@jordanpadams
Copy link
Member

Thanks @tbarnes4

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request wontfix This will not be worked on
Projects
None yet
Development

No branches or pull requests

2 participants