-
Notifications
You must be signed in to change notification settings - Fork 43
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Expose Intake-ESM test data for download #437
Comments
I like this idea... A pull request is definitely welcome :) |
Absolutely. If I find time, I will. We can keep this here as a record of the idea. |
@jukent, here are some of the intake examples packaged as conda packages: https://github.com/intake/intake-examples |
@andersy005: That @jukent: If you can follow the logic in that repo, that might be a good approach to writing an |
@kmpaul @andersy005 It looks to me like that repo just houses the data -- and then in each tutorial notebook they point to the data catalog directly with the url. Am I missing something?
|
@jukent: I think @andersy005 recommended looking at this repository because it demonstrates (or appears to) how to make downloadable data and corresponding catalogs work from an installable package. If you look at the @andersy005: Is that why you were suggesting looking at this? |
The data-us-states package is a much better example. It's serving data hosted in this repository. So, when user installs the In my opinion, the existing netCDF files aren't really very useful for demonstration purposes, and because accessing netCDF over HTTP is complicated, I would recommend using a subset of the realistic cat = intake_esm.tutorial.load_catalog('aws_cesm1_lens_sample')
cat = intake_esm.tutorial.load_catalog('aws_cesm2_lens_sample')
cat = intake_esm.tutorial.load_catalog('aws_cmip6_sample') |
Is your feature request related to a problem? Please describe.
Understanding and learning intake-esm could be made much easier if the user had a sample dataset and catalog to reference. It doesn't have to be "real" in any sense. It could just be an example dataset, but I think there are pared down datasets in the
tests
directory of this repository that might fit the bill. But I think it would be helpful if this dataset and corresponding catalog could be downloaded with intake-esm in the same way thatxarray.tutorial
provides an easy mechanism for downloading example datasets.Describe the solution you'd like
In the
tests
directory of this repository, the following catalogs and datasets appear to be already available as toy examples:sample-collections/cesm1-lens-netcdf.csv
/cesm1-lens-netcdf.json
+sample-data/cesm-le/*.nc
sample-collections/cmip5-netcdf.csv
/cmip5-netcdf.json
+sample-data/cmip/cmip5/*
sample-collections/cmip6-netcdf-test.csv
/cmip6-netcdf.json
+sample-data/cmip/CMIP6/*
sample-collections/multi-variable-catalog.csv
/multi-variable-catalog.json
+sample-data/cesm-multi-variables/*.nc
I think an excellent solution might be to create an
intake_esm.tutorial
module that contains something like aload_collection(name)
function (orload_catalog(name)
?) that loads the above sample data and corresponding catalogs from GitHub, in the way thatxarray.tutorial.load_dataset(name)
does. I would imagine that this function might return the path to the newly downloaded catalog (or the path to the existing catalog, if already downloaded?).Describe alternatives you've considered
The obvious solution is to just tell people to
git clone
the repo and possibly reorganize the files so they are more easily "discoverable". Perhaps that should be a first (simple) attempt.The text was updated successfully, but these errors were encountered: