Write mode for topen #36

roll · 2016-01-08T14:04:38Z

Overview

It's a long shot but eventually I suppose it could be implemented.

Having a task to write some tabular data to the filesystem is casual. With all the boilerplate code to support py2/3, csv verbose interface etc - it's a little bit annoying.

Analysis

Interface could be:

with topen('table.csv', mode='w') as table:
    table.write(data)

Implementation could be:

topen returns ReadTable or WriteTable regarding to the mode='r/w'
for writing there will be new modules like Formatter (anti-parser) and Writer (anti-loader) with the same modular arhictecture for different targets and formats.

So we will be able to have memory lean things like:

with (topen('http://site.com/source.xls') as source, 
      topen('target.csv', mode='w') as target):
    target.write(source)

Or even with something like tcopy helper:

tcopy(data, 'target.csv')
tcopy('http://site.com/source.xls', 'target.csv')

Tasks

TBD
We should support recoding - Encoding and recoding #50

The text was updated successfully, but these errors were encountered:

pwalsh · 2016-01-10T06:05:22Z

@roll excellent idea. Presumably this could DRY code for exporting to Data Package from other data stores, like SQL, BigQuery, etc.

roll · 2016-01-11T07:30:00Z

Yes, I've written too many boilerplate code lately 😃

pwalsh · 2016-08-07T18:07:10Z

Is this really the same as #50? I guess internally, both could be supported by the same processor? I'm just a bit worried that by closing #50 we've lost the particularity of that request.

roll · 2016-08-07T18:10:26Z

@pwalsh
I've added it to the tasks list. I think it's exactly what we need - on a write stage we will be able to set encoding so any recoding could be possible.

pwalsh · 2016-08-07T18:12:40Z

@roll I see, but it still seems to me that a high-level write interface is different from a write processor, used in the read interface, to create a new, recoded file. No?

roll · 2016-08-07T18:20:43Z

@pwalsh
I've re-opened those issue because if something like this is not enough:

with topen('source.xls') as source:
  with topen('target.csv', mode='w', encoding='utf-8') as target:
    target.write(source)

than it's really different. For now it's just not clear from high-level requirements why you need it as a side effect using processor. This processor will be much less powerful than general writing system and will require some duplication.

pwalsh · 2016-08-07T18:25:31Z

@roll

Maybe it is, not sure. Imagine piping data through a chain of processors like so:

source -> structure | schema | recoder | writer

the recoder just recodes the stream, and is followed by a final processor writing the stream to some new destination.

roll · 2016-08-07T18:27:44Z

@pwalsh
After loader and parser there is no encodings - it's python objects. So I suppose this WriteTable functionality will be your recoder from pipeline.

So let see closer to real design proposals 😃
I've reopened the recoding issue to be sure.

pwalsh · 2016-08-07T18:31:56Z

@roll ok, if that is the internal design. In your examples above, would the writing require the file contents to be loaded to memory? The API description looks like yes, but that would be a mistake IMHO.

roll · 2016-08-07T18:32:53Z

@pwalsh
Just lazy example) We use streams here) So it should be memory lean.

After we will finish it my idea to create example like loading 1GB xls from the web and saving it to csv file with memory profiler showing we don't use memory)

roll added the feature label Jan 8, 2016

pwalsh added this to the Backlog milestone Jan 10, 2016

roll mentioned this issue Mar 6, 2016

Feature/plugins and storage frictionlessdata/tableschema-py#51

Merged

roll added the backlog label May 5, 2016

roll removed this from the Backlog milestone May 5, 2016

roll removed the backlog label May 11, 2016

roll mentioned this issue Aug 6, 2016

Encapsulate all tabular read/write operations into tabulator frictionlessdata/frictionlessdata.io#266

Closed

4 tasks

roll modified the milestone: tabulator-v1 Aug 7, 2016

roll mentioned this issue Aug 7, 2016

Encoding and recoding #50

Closed

roll added the priority label Aug 8, 2016

roll removed this from the tools-v1 milestone Aug 8, 2016

roll added breaking feature and removed feature breaking labels Aug 9, 2016

roll modified the milestone: tabulator-v1 Aug 9, 2016

roll added feature v0.7 and removed breaking labels Aug 11, 2016

roll mentioned this issue Aug 31, 2016

Rebase Table.write on tabulator write mode when it will be available frictionlessdata/tableschema-py#93

Closed

roll added v0.6 and removed v0.7 labels Sep 11, 2016

roll self-assigned this Sep 11, 2016

roll added the deprecating label Sep 11, 2016

roll mentioned this issue Sep 12, 2016

Feature/updated api #80

Merged

roll added review and removed priority labels Sep 12, 2016

roll closed this as completed in #80 Sep 13, 2016

roll removed the review label Sep 13, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Write mode for topen #36

Write mode for topen #36

roll commented Jan 8, 2016 •

edited

pwalsh commented Jan 10, 2016

roll commented Jan 11, 2016

pwalsh commented Aug 7, 2016

roll commented Aug 7, 2016 •

edited

pwalsh commented Aug 7, 2016

roll commented Aug 7, 2016 •

edited

pwalsh commented Aug 7, 2016 •

edited

roll commented Aug 7, 2016 •

edited

pwalsh commented Aug 7, 2016

roll commented Aug 7, 2016 •

edited

Write mode for topen #36

Write mode for topen #36

Comments

roll commented Jan 8, 2016 • edited

Overview

Analysis

Tasks

pwalsh commented Jan 10, 2016

roll commented Jan 11, 2016

pwalsh commented Aug 7, 2016

roll commented Aug 7, 2016 • edited

pwalsh commented Aug 7, 2016

roll commented Aug 7, 2016 • edited

pwalsh commented Aug 7, 2016 • edited

roll commented Aug 7, 2016 • edited

pwalsh commented Aug 7, 2016

roll commented Aug 7, 2016 • edited

roll commented Jan 8, 2016 •

edited

roll commented Aug 7, 2016 •

edited

roll commented Aug 7, 2016 •

edited

pwalsh commented Aug 7, 2016 •

edited

roll commented Aug 7, 2016 •

edited

roll commented Aug 7, 2016 •

edited