Skip to content
This repository has been archived by the owner on Mar 6, 2024. It is now read-only.

Copy interesting datasets from old datahub [part 2] #214

Closed
11 tasks done
anuveyatsu opened this issue Jul 16, 2018 · 1 comment
Closed
11 tasks done

Copy interesting datasets from old datahub [part 2] #214

anuveyatsu opened this issue Jul 16, 2018 · 1 comment
Assignees

Comments

@anuveyatsu
Copy link
Member

anuveyatsu commented Jul 16, 2018

Take a look at old.datahub.io/dataset and copy interesting datasets from it with their tags. Use dataflows for this scrape. The current status of the analysis can be found in our DataHub Analysis document under title Copy over popular datasets from old datahub

Acceptance criteria

  • these are published:
    • ATP dataset
    • DBLP
    • opencorporates
    • yago

Tasks

  • prepare and publish ATP data
  • prepare and publish DBLP data
  • prepare and publish opencorporates data (need to speak to their rep)
  • prepare and publish yago data
  • Scrape interesting datasets [12 hrs]
  • Push them to datahub.io with appropriate descriptions [6 hrs]
@anuveyatsu anuveyatsu added this to the Sprint - 30 Jul 2018 milestone Jul 16, 2018
anuveyatsu added a commit to datasets/core-datasets that referenced this issue Jul 17, 2018
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants