Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add R csv benchmarks #37

Closed
jonkeane opened this issue May 17, 2021 · 0 comments · Fixed by #101
Closed

Add R csv benchmarks #37

jonkeane opened this issue May 17, 2021 · 0 comments · Fixed by #101

Comments

@jonkeane
Copy link
Contributor

Arrow bench has a read_csv benchmark that would be nice to have.

These are the arguments (the defaults from are {arrowbench} are ~what we want to run, though I'm happy to adjust them if we decide that only a subset of sources should be default)

  • It supports all of the arrowbench sources (though we don't need to run all of them, the first three are probably most important) as the source argument:
    • fanniemae_2016Q4
    • nyctaxi_2010-01
    • chi_traffic_2020_Q1
    • type_strings
    • type_dict
    • type_integers
    • type_floats
    • type_nested
    • type_simple_features
  • It supports uncompressed and gzip compressed files as the compression argument
  • It supports output formats arrow_table and data_frame as the output argument
  • The reader argument should be arrow (the other readers it knows how to test are not important for and should not be run on conbench)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
1 participant