It would be great to have a suite of relevant benchmarks for the Python serialization code in ARROW-759. These could be used to guide profiling and performance improvements.
Relevant use cases include:
- dictionaries of large numpy arrays that are used to represent weights of a neural network
- long lists of primitive types like ints, floats or strings
- lists of user defined python objects
Reporter: Philipp Moritz / @pcmoritz
Note: This issue was originally created as ARROW-1391. Please see the migration documentation for further details.
It would be great to have a suite of relevant benchmarks for the Python serialization code in ARROW-759. These could be used to guide profiling and performance improvements.
Relevant use cases include:
Reporter: Philipp Moritz / @pcmoritz
Note: This issue was originally created as ARROW-1391. Please see the migration documentation for further details.