Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Python] document differences w.r.t. fastparquet #16375

Closed
asfimport opened this issue Apr 3, 2017 · 5 comments
Closed

[Python] document differences w.r.t. fastparquet #16375

asfimport opened this issue Apr 3, 2017 · 5 comments

Comments

@asfimport
Copy link

differences in options and/or actual written file formats w.r.t. https://fastparquet.readthedocs.io/en/latest/

  • null handling
  • non-supported type handling
  • options that can be passed via top-level functions

Reporter: Jeff Reback / @jreback

Note: This issue was originally created as ARROW-760. Please see the migration documentation for further details.

@asfimport
Copy link
Author

@asfimport
Copy link
Author

Wes McKinney / @wesm:
@jreback I just moved this JIRA to the Arrow side – I think the differences mostly have to do with the Python API, but we can make a list of C++-only requirements that we need from parquet-cpp (e.g. an API in parquet_arrow for reading a single row group vs. the entire file)

@asfimport
Copy link
Author

Jeff Reback / @jreback:
yep, was just writing the docs and wanted to list (or probably better), point to a document with any differences.

@asfimport
Copy link
Author

Wes McKinney / @wesm:
I think it makes most sense to document this in the pandas docs where users may encounter differences between the libraries.

@asfimport
Copy link
Author

Todd Farmer / @toddfarmer:
Transitioning issue from Resolved to Closed to based on resolution field value.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant