[C++] Create macro-benchmarks of file format readers

Currently we have (some) microbenchmarks, but measuring performance of our various readers (CSV, JSON, IPC, Parquet, ORC) over "real world" files would also be interesting and hopefully more illustrative of the use cases we actually care about. Such benchmarks may be expensive, though.

Ideally, we would do this in a variety of scenarios: in-memory (to focus on CPU optimization), on-disk (though such measurements would likely be extremely noisy?), and over the network (perhaps with something like Minio + Toxiproxy to try to have a consistent, reproducible setup) so that we can also judge the I/O characteristics of the readers.

**Reporter**: [David Li](https://issues.apache.org/jira/browse/ARROW-16944) / @lidavidm

<sub>**Note**: *This issue was originally created as [ARROW-16944](https://issues.apache.org/jira/browse/ARROW-16944). Please see the [migration documentation](https://github.com/apache/arrow/issues/14542) for further details.*</sub>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[C++] Create macro-benchmarks of file format readers #32263

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[C++] Create macro-benchmarks of file format readers #32263

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions