Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Store metadata in storage in push/pull? #69

Closed
roll opened this issue Mar 28, 2016 · 5 comments
Closed

Store metadata in storage in push/pull? #69

roll opened this issue Mar 28, 2016 · 5 comments
Labels

Comments

@roll
Copy link
Member

roll commented Mar 28, 2016

For now we do not store any metadata in the storage (bigquery, sql etc). It means pull_datapackage works only using reflection of the database. So additional metadata can't be restored (description, user defined fields etc).

As a solution we can store additional table with stringified datapackage.json.


pwalsh

@roll excellent observation. Seeing as we can't rely on a backend supporting, for example, JSON storage, and neither can we expect any particular fields on any given data package, I agree that we can just have a table with a stringified datapackage.json, but it might have other columns too, that for example point to the tables used for the DP in question (Imagine a storage backend that holds data from many datapackages - we might expect a common meta table that points out to the various tables for each package).

@roll roll added this to the Backlog milestone Mar 28, 2016
@roll roll changed the title Store metadata in storage? Store metadata in storage in push/pull? Mar 31, 2016
@rufuspollock
Copy link
Contributor

I'm not sure we'd want to push full metadata into the backend storage would we? Normally we just want to push the data in order to do some analysis, not as permanent storage for the data packages.

@roll
Copy link
Member Author

roll commented May 12, 2016

Here on JTS level - frictionlessdata/tableschema-py#70 - I'm also leaning to the idea to not store metadata in backend to do not complicate things.

@roll roll modified the milestone: datapackage-v1 Aug 7, 2016
@roll roll added the backlog label Aug 8, 2016
@roll roll removed this from the tools-v1 milestone Aug 8, 2016
@roll
Copy link
Member Author

roll commented Aug 8, 2016

CLOSED FOR NOW (will be solved with other approach at jts level)

@roll roll closed this as completed Aug 8, 2016
@roll roll removed the backlog label Aug 8, 2016
@rufuspollock
Copy link
Contributor

@roll what's the referencing issue for closing this - if it is going to be closed in JTS somewhere could you reference the issue there?

@roll
Copy link
Member Author

roll commented Aug 9, 2016

@rgrp
Sorry here it is - frictionlessdata/tableschema-py#70 - it will fix problem with types on JTS level. On DP level based on your words we don't need to store metadata like licence etc.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants