Store metadata in storage in push/pull? #69

roll · 2016-03-28T10:16:51Z

For now we do not store any metadata in the storage (bigquery, sql etc). It means pull_datapackage works only using reflection of the database. So additional metadata can't be restored (description, user defined fields etc).

As a solution we can store additional table with stringified datapackage.json.

pwalsh

@roll excellent observation. Seeing as we can't rely on a backend supporting, for example, JSON storage, and neither can we expect any particular fields on any given data package, I agree that we can just have a table with a stringified datapackage.json, but it might have other columns too, that for example point to the tables used for the DP in question (Imagine a storage backend that holds data from many datapackages - we might expect a common meta table that points out to the various tables for each package).

The text was updated successfully, but these errors were encountered:

rufuspollock · 2016-04-29T11:05:22Z

I'm not sure we'd want to push full metadata into the backend storage would we? Normally we just want to push the data in order to do some analysis, not as permanent storage for the data packages.

roll · 2016-05-12T08:26:15Z

Here on JTS level - frictionlessdata/tableschema-py#70 - I'm also leaning to the idea to not store metadata in backend to do not complicate things.

roll · 2016-08-08T17:28:37Z

CLOSED FOR NOW (will be solved with other approach at jts level)

rufuspollock · 2016-08-09T09:15:31Z

@roll what's the referencing issue for closing this - if it is going to be closed in JTS somewhere could you reference the issue there?

roll · 2016-08-09T09:18:26Z

@rgrp
Sorry here it is - frictionlessdata/tableschema-py#70 - it will fix problem with types on JTS level. On DP level based on your words we don't need to store metadata like licence etc.

roll added this to the Backlog milestone Mar 28, 2016

This was referenced Mar 28, 2016

Store metadata in storage? openknowledge-archive/datapackage-storage-py#3

Closed

Store schemas in storage frictionlessdata/tableschema-sql-py#27

Closed

roll changed the title ~~Store metadata in storage?~~ Store metadata in storage in push/pull? Mar 31, 2016

roll added the backlog label May 5, 2016

roll removed this from the Backlog milestone May 5, 2016

roll added feature and removed backlog labels May 5, 2016

This was referenced May 11, 2016

Storage: add schema argument to describe? frictionlessdata/tableschema-py#70

Closed

Add optional schema argument to Storage.read/write? frictionlessdata/tableschema-py#69

Closed

roll modified the milestone: datapackage-v1 Aug 7, 2016

roll added the backlog label Aug 8, 2016

roll removed this from the tools-v1 milestone Aug 8, 2016

roll closed this as completed Aug 8, 2016

roll removed the backlog label Aug 8, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Store metadata in storage in push/pull? #69

Store metadata in storage in push/pull? #69

roll commented Mar 28, 2016 •

edited

rufuspollock commented Apr 29, 2016

roll commented May 12, 2016

roll commented Aug 8, 2016

rufuspollock commented Aug 9, 2016

roll commented Aug 9, 2016 •

edited

Store metadata in storage in push/pull? #69

Store metadata in storage in push/pull? #69

Comments

roll commented Mar 28, 2016 • edited

rufuspollock commented Apr 29, 2016

roll commented May 12, 2016

roll commented Aug 8, 2016

rufuspollock commented Aug 9, 2016

roll commented Aug 9, 2016 • edited

roll commented Mar 28, 2016 •

edited

roll commented Aug 9, 2016 •

edited