Skip to content
This repository has been archived by the owner on Feb 13, 2023. It is now read-only.

Harvester for old website data #5

Open
8 of 9 tasks
sirex opened this issue May 5, 2017 · 2 comments
Open
8 of 9 tasks

Harvester for old website data #5

sirex opened this issue May 5, 2017 · 2 comments

Comments

@sirex
Copy link
Contributor

sirex commented May 5, 2017

There is a similar tool:

https://github.com/sirex/ckan-ivpk-import

But in this case we will be synchronizing data directly from MySQL database.

Code for this script is hosted here (based on the mentioned similar tool):

https://github.com/ivpk/opendata.gov.lt-mysql-import

Todo

  • Data packages (datasets).
  • Add cron job to do the update continuously.
  • Package tags.
  • Contact information.
  • Organizations.
  • Synchronize groups.
  • Investigate, maybe it is possible to import CSV resources into CKAN.

Extra tasks:

  • Possibility to update groups if for example name has changed.
  • Delete groups if groups where deleted in IVPK database.
@sirex sirex created this issue from a note in Replace old opendata.gov.lt to CKAN (In progress) May 5, 2017
@sirex sirex self-assigned this May 5, 2017
@sirex
Copy link
Contributor Author

sirex commented Aug 8, 2017

It was decided to refactor whole synchronization thing to a harvester and only synchronizing datasets.

@sirex sirex changed the title Synchronization script Harvester for old website data Aug 8, 2017
@sirex sirex removed their assignment Aug 31, 2017
@sirex
Copy link
Contributor Author

sirex commented Nov 24, 2017

@JustNindze I'm attaching list of links used in opendata.gov.lt packages, use these links to extract all package resources for each link.

rinkmenos.csv.gz

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Development

No branches or pull requests

2 participants