Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Explore data access and license details. #1

Open
bbarker opened this issue May 17, 2019 · 0 comments
Open

Explore data access and license details. #1

bbarker opened this issue May 17, 2019 · 0 comments
Labels
question Further information is requested

Comments

@bbarker
Copy link
Contributor

bbarker commented May 17, 2019

I will update this post as necessary to reflect the plan and my understanding as it evolves.

Let's specify an algorithm for detecting data access categories (which are currently not part of metajelo itself).

  1. If a license is specified (in a future version of metajelo; attribute about open/non-open for licenses, policies, etc. metajelo#8), then assign some categories from {C_i} based on a hardcoded mapping we have devised from a list of licenses to categories we support. Examples for categories could be:
    1. Open Science Badges (or really, their underlying definition).
  2. If an unknown license is specified or specified in text:
    1. Each categoryC_i could have a set of positive phrases {P_i} that indicate a match. We could use something like https://github.com/spencermountain/compromise for normalization, and for creating a custom lexicon and matching utilities.
@bbarker bbarker added the question Further information is requested label May 17, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

1 participant