-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
S3-only download can't get metadata #82
Comments
Related question - assuming this is intentional, how can you download just the metadata? I know I can run
and then Control-c out of it once it's downloaded |
yes, the issue mentioned in the first comment is related to #67. We are planning to have this fixed in the next release. If you want to download just the package-file-metadata file, you can run : The tool will say it didn't find any matching files because the metadata file doesn't contain a record for itself, but the tool always downloads this file before downloading any other file in the package, so it will be downloaded (if it doens't already exist locally). We can add a low priority ticket to include a record for the metadata file itself to the metadata file so that the output of the program is accurate in this particular case. |
I don't think you were asking about the data-structure files, but in case you were, you can get those with the following regex: |
I wasn't, but that made me realize the solution to "download just the metadata" is:
which will happily download the This still doesn't solve the underlying bug that if you're using S3, the metadata should live there too, though. |
In all new packages the metadata file includes itself, which means you can run
and there should be 1 file that meets the filter criteria. |
Any reason to do that vs. match-nothing? |
And what about what I said about S3? Or is the idea that you don't want the metadata remote? (In my case, it is anyway, because the "local" directory is S3-mounted.) |
End result is the same. I guess the intent of the user is more clear if you actually specify the file you want to download
The metadata file is used extensively by the program so for now we decided to always have that local. Regarding your use of s3fs - the way the program works now (when downloading locally) is to append a .partial extension to files as they are being downloaded, and then rename the file when the download is complete. The rename operation is not implemented by tools like s3fs, so we were under the impression that this needs to change before the downloadcmd can work with s3 mounted directories. Have you not run into this situation? |
(a) s3fs does support rename (even deep directory rename), and has for many years now (almost a decade!) (b) But regardless, we are in fact using the AHA - but, in re-testing this just now, it appears you made an important change in 01a7b08 that changes all of this - you're downloading So this is moot anyway! |
I don't know if this is a dupe of #67 but it seems that if you want to download to S3, you have to download
package_file_metadata.txt.gz
locally first.Is this intentional or a bug?
E.g.:
The text was updated successfully, but these errors were encountered: