Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

affiliated CKAN Service Provider jobs - "DataGroomers" that are meant to periodically groom datastore data #13

Open
6 tasks
Tracked by #5
jqnatividad opened this issue Apr 27, 2022 · 0 comments
Labels
enhancement New feature or request

Comments

@jqnatividad
Copy link
Contributor

jqnatividad commented Apr 27, 2022

"Datagroomers" as the name implies, continuously "groom" the data in the background based on certain rules/recipes.

At the moment, I envision them as CKAN service provider jobs.

Several "datagroomers" come to mind:

  • libpostal datagroomer - for normalizing addresses
  • geocoding datagroomer
    • using qsv's built-in, low-resolution geonames geocoder
    • using the user's preferred geocoding service, leverage qsv fetch
  • auto-tagging datagroomer - for adding tags based on certain domains (e.g. clean-energy tagger, internet of water tagger, etc)
  • related resources datagroomer - Link related resources based on their data dictionaries
@jqnatividad jqnatividad added the enhancement New feature or request label Dec 19, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant