Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing Owner Org when harvesting RDF source #26

Closed
Hoedic opened this issue Mar 26, 2015 · 5 comments
Closed

Missing Owner Org when harvesting RDF source #26

Hoedic opened this issue Mar 26, 2015 · 5 comments

Comments

@Hoedic
Copy link

Hoedic commented Mar 26, 2015

While running an import from a DCAT RDF source, the harvester tells the owner org is missing:

2015-03-26 01:54:24,757 INFO [ckanext.harvest.queue] Received harvest object id: 68d9d538-313c-40f8-83e4-711bcc91afd7 2015-03-26 01:54:24,793 DEBUG [ckanext.dcat.harvesters.rdf] In DCATRDFHarvester import_stage 2015-03-26 01:54:24,836 DEBUG [ckanext.harvest.harvesters.base] Create validation Error: {'Owner org': 'Missing value'}

I am trying to import the RDF example : https://github.com/ckan/ckanext-dcat/blob/master/examples/catalog_datasets_list.rdf

When configuring the harvester, I used the field "organization" in the for to specify an organization in CKAN. At some point I also tried to put owner_org: my-organization in the configuration textarea.

now my configuration fields look like that (I don't know if it's of any use...):
{ "default_groups":["a-group"], "user":"admin", "read_only": false }

After looking at the code, I don't see any place where the fetch process tries to read the configuration.

amercader added a commit that referenced this issue Mar 27, 2015
@amercader
Copy link
Member

You are correct @Hoedic, the harvester does not handle owner organizations and that will fail if your CKAN instance requires datasets to have one. Can you try this branch and see if it solves the issue:

#27

It should set the harvested datasets to the same organization that the harvest source belongs to (the one you set when creating the harvest source)

@lubertorubior
Copy link

Hi, could this also be done for a "DCAT JSON Harvester" source?
The configuration of the source doesn't seem to be read.

@amercader
Copy link
Member

@lubertorubior I've pushed changes to support the JSON harvester as well. Can you test the branch on #27 to see if it works as expected?

@lubertorubior
Copy link

It works great, thank you @amercader !

@amercader
Copy link
Member

Fixed in #27

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants