New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DCAT support for all datasets #1138
Comments
You can schedule your own RDF dump using the rdf-export command-line tool (I suggest daily) the code for which is at https://github.com/okfn/ckan/blob/master/ckan/lib/cli.py#L488 The required command line args is in the doc at the link above. |
@rossjones but this is a tool for CKAN administrators on the server itself? I cannot use this tool to get a dump from data.gov right? |
You're correct, you'd have to get the admins there to do it but it'll be a lot more efficient way of getting the data than 10s of thousands of API calls. At least until there is VoID support. |
Is this something that might get implemented quickly or is this a wontfix? |
We have an open ticket for it on data.gov.uk, and then it depends whether it gets accepted into core CKAN. It isn't likely to be days until this is done, most likely weeks, and it won't be realtime (we have 9.5k datasets, the US has a lot more). And again, would need the administrator to set up the background task. Easiest approach at present is to ask the data.gov team to enable the RDF dumps I think. |
Can you link us to that issue? |
Afraid it isn't on github/openly accessible. |
https://github.com/okfn/ckanext-dcat This is being addressed here. |
feature request
Allow people and machines to download a datadump which describes all datasets in a CKAN instance using the DCAT vocabulary. This datadump should have an option to be licensed according to the open knowledge definition.
why
I know that we can already request a single dataset's meta-data as RDF using for instance http://datahub.io/dataset/thesesfr.rdf. To get a list of all the datasets we could query the API and then perform a lot of requests to get all the rdf. When trying this on data.gov, the fun ends quickly as http://catalog.data.gov/api/3/action/package_list gives a time out (see #1137)
The text was updated successfully, but these errors were encountered: