Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Python] Random schema and data generator for Arrow conversion and Parquet testing #19021

Closed
asfimport opened this issue May 22, 2018 · 2 comments

Comments

@asfimport
Copy link

asfimport commented May 22, 2018

See discussion in #2067

Being able to generate random complex schemas and corresponding example data sets will help with exercising edge cases in many different parts of the codebase. One practical example: reading and writing nested data to Parquet format

Reporter: Wes McKinney / @wesm
Assignee: Krisztian Szucs / @kszucs

Related issues:

PRs and other links:

Note: This issue was originally created as ARROW-2624. Please see the migration documentation for further details.

@asfimport
Copy link
Author

Antoine Pitrou / @pitrou:
The ASV benchmark suite has some support to generate random data for some types, but not random schema.
(see the BuiltinsGenerator in python/benchmarks/common.py)

@asfimport
Copy link
Author

Krisztian Szucs / @kszucs:
Issue resolved by pull request 3046
#3046

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants