ARROW-1194: [Python] Expose MockOutputStream in pyarrow. #830

robertnishihara · 2017-07-11T14:23:10Z

This allows you to get the size of a record batch and schema through pyarrow by writing to a mock output stream. You can then use the resulting size to allocate an appropriately sized buffer to actually write to.

Example usage.

import pyarrow as pa
import pandas as pd

val = pd.DataFrame({'a': [1, 2, 3]})
record_batch = pa.RecordBatch.from_pandas(val)

# Get the size of the record batch and schema
sink = pa.MockOutputStream()
stream_writer = pa.RecordBatchStreamWriter(sink, record_batch.schema)
stream_writer.write_batch(record_batch)
size = sink.size()

robertnishihara · 2017-07-11T15:04:26Z

I think the Travis test failures are unrelated to this PR.

wesm · 2017-07-11T15:50:07Z

They're unrelated, I'm working on fixing parquet-cpp after the API changes

wesm · 2017-07-12T06:27:09Z

The builds should be OK now. Reviewing this

wesm

LGTM. Could you rebase so we hopefully get a passing build?

robertnishihara · 2017-07-13T03:40:05Z

Looks like the tests are passing now.

wesm

+1, thanks!

robertnishihara changed the title ~~ARROW-1194: [Python] Expose MockOutputStream to in pyarrow.~~ ARROW-1194: [Python] Expose MockOutputStream in pyarrow. Jul 11, 2017

wesm approved these changes Jul 12, 2017

View reviewed changes

Expose MockOutputStream to Python.

4e15cd9

wesm approved these changes Jul 13, 2017

View reviewed changes

asfgit closed this in 28e06d8 Jul 13, 2017

robertnishihara deleted the mockoutputstream branch July 13, 2017 15:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARROW-1194: [Python] Expose MockOutputStream in pyarrow. #830

ARROW-1194: [Python] Expose MockOutputStream in pyarrow. #830

Uh oh!

robertnishihara commented Jul 11, 2017

Uh oh!

robertnishihara commented Jul 11, 2017

Uh oh!

wesm commented Jul 11, 2017

Uh oh!

wesm commented Jul 12, 2017

Uh oh!

wesm left a comment

Uh oh!

robertnishihara commented Jul 13, 2017

Uh oh!

wesm left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ARROW-1194: [Python] Expose MockOutputStream in pyarrow. #830

ARROW-1194: [Python] Expose MockOutputStream in pyarrow. #830

Uh oh!

Conversation

robertnishihara commented Jul 11, 2017

Uh oh!

robertnishihara commented Jul 11, 2017

Uh oh!

wesm commented Jul 11, 2017

Uh oh!

wesm commented Jul 12, 2017

Uh oh!

wesm left a comment

Choose a reason for hiding this comment

Uh oh!

robertnishihara commented Jul 13, 2017

Uh oh!

wesm left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants