Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to match license when uploading EML? #1778

Closed
peterdesmet opened this issue May 4, 2022 · 14 comments
Closed

How to match license when uploading EML? #1778

peterdesmet opened this issue May 4, 2022 · 14 comments
Assignees
Milestone

Comments

@peterdesmet
Copy link
Member

The dataset license is exported in the EML as:

<intellectualRights>
    <para>To the extent possible under law, the publisher has waived all rights to these data and has dedicated them to the <ulink url="http://creativecommons.org/publicdomain/zero/1.0/legalcode"><citetitle>Public Domain (CC0 1.0)</citetitle></ulink>. Users may copy, modify, distribute and use the work, including for commercial purposes, without restriction.</para>
</intellectualRights>

I'm generating an EML file, that I then upload to the IPT. Should I match the above verbatim if I want the IPT to recognize the license I'm providing? Or is it possible to provide something shorter that the IPT will still recognize as that license? E.g. the license name:

license.name.cczero=Public Domain (CC0 1.0)

@mike-podolskiy90
Copy link
Contributor

Thanks @peterdesmet for your question
Yes, you can provide the license name, IPT should recognize that.

@mike-podolskiy90
Copy link
Contributor

mike-podolskiy90 commented May 4, 2022

But there will be an issue with GBIF infrastracture. Other systems (namely registry) will fail to digest it:

No machine readable license was detected

So to be processed they have to contain this part:

<ulink url="..."><citetitle>...</citetitle></ulink>

@mike-podolskiy90 mike-podolskiy90 self-assigned this May 4, 2022
@peterdesmet
Copy link
Member Author

Aha, thanks! I notice the name needs to be wrapped in <para> for the IPT to recognize it:

<intellectualRights><para>Public Domain (CC0 1.0)</para></intellectualRights>

Maybe it would be worth using license identifiers provided by https://opendefinition.org/licenses/api/ to make the match. Those identifiers are typically returned by APIs, such as Zenodo. The identifiers would be:

CC0-1.0
CC-BY-4.0
CC-BY-NC-4.0

@mike-podolskiy90
Copy link
Contributor

Sounds reasonable to me, but unfortunately there is nothing we can do in the IPT with this

@peterdesmet
Copy link
Member Author

So I should try to match the (long) values verbatim?

@mike-podolskiy90
Copy link
Contributor

@peterdesmet Yes

@timrobertson100
Copy link
Member

Sounds reasonable to me, but unfortunately there is nothing we can do in the IPT with this

It does seem like a reasonable thing for the EML parser to support and that a user can set a license in an EML file that the IPT detects. Could we just handle this somewhere around here perhaps?

@mike-podolskiy90
Copy link
Contributor

Thanks Tim, I'll try to test that in the gbif-metadata-profile project

mike-podolskiy90 added a commit to gbif/gbif-metadata-profile that referenced this issue May 23, 2022
@mike-podolskiy90 mike-podolskiy90 added this to the 3.0 milestone May 23, 2022
@peterdesmet
Copy link
Member Author

Thanks for implementing, where can I test this? https://ipt.gbif.org?

@mike-podolskiy90
Copy link
Contributor

https://ipt.gbif-uat.org/
Do you have an account there?

@peterdesmet
Copy link
Member Author

No, but I do have one at https://ipt.gbif.org. Testing is not urgent for me.

@mike-podolskiy90
Copy link
Contributor

It's a snapshot version, and we usually don't deploy snapshots to ipt.gbif.org. But I can create an account on UAT for you

@peterdesmet
Copy link
Member Author

Tested: works well, but would it be possible to make matching case insensitive? I noticed that the official key CC0-1.0 is expressed as cc0-1.0 by e.g. DataCite.

@mike-podolskiy90
Copy link
Contributor

Thank you, good point. I'll make it case insensitive

mike-podolskiy90 added a commit to gbif/gbif-metadata-profile that referenced this issue May 25, 2022
@mike-podolskiy90 mike-podolskiy90 modified the milestones: 3.0, 2.6 Jul 6, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants