-
Notifications
You must be signed in to change notification settings - Fork 5
Proposed addition to the GCS bucket, gs://cmip6/ #12
Comments
👍 from me! |
I would also not be opposed to moving all the older data to the new prefix as well. |
Yes, that would be very easy, since we can copy first, change the csv catalog and THEN remove the old copy |
@charlesbluca , this would require a revision to your rclone github actions from GCS to AWS. As you have currently set it up, this new prefix would get added to the 'other' category. That would be fine for now. We can rethink the division into the 20 jobs when it starts bogging down - okay? |
Sounds good to me! For now I'll continue monitoring the workflow and see how the job handling Let me know if you decide on copying the older data to the new prefix, as this would probably allow us to continue using the current workflow (as long as we change the affected directory names). |
Thanks @charlesbluca, I will certainly let you know if I start moving lots of data! |
I am now in the process of making these changes. Since all of our zarr datasets must be copied and then deleted, it will a few months to complete. In the meantime, the URLs in the new csv catalog files are a mixture of the old and new naming conventions - which should not cause trouble to anyone who uses the catalog to find the data. However, if you find your datasets by assuming a URL naming structure like:
then these URLs are all being replaced by:
I have been doing some preliminary testing, including an The new versions are temporarily called Apologies for all of the confusion, but this will clean up many discrepancies and simplify our 'CMIP' objects in the precious gs://cmip6 bucket to the following:
|
I propose to add any new versions of datasets to our exisiting Google Cloud zarr bucket by adding a new prefix to our CMIP6 collection -
gs://cmip6/CMIP6
which solves multiple problems.This follows the naming of the other CMIP collections,
gs://cmip6/CMIP5
andgs://cmip6/CMIP3
This allows us to start using the
version
in the object names, for example:would now be stored in:
The text was updated successfully, but these errors were encountered: