[C++] Improve performance of ExecuteScalarExpression

One of the things we want to be able to do in the streaming execution engine is process data in small L2 sized batches.  Based on literature we might like to use batches somewhere in the range of 1k to 16k rows.  In ARROW-16014 we created a benchmark to measure the performance of ExecuteScalarExpression as the size of our batches got smaller.  There are two things we observed:

- Something is causing thread contention.  We should be able to get pretty close to perfect linear speedup when we are evaluating scalar expressions and the batch size fits entirely into L2.  We are not seeing that.
- The overhead of ExecuteScalarExpression is too high when processing small batches.  Even when the expression is doing real work (e.g. copies, comparisons) the execution time starts to be dominated by overhead when we have 10k sized batches.

**Reporter**: [Weston Pace](https://issues.apache.org/jira/browse/ARROW-16138) / @westonpace
#### Subtasks:
- [ ] [[C++] Overhead of std::shared_ptr<DataType> copies is causing thread contention](https://github.com/apache/arrow/issues/31567)
- [X] [[C++] Avoid copying shared_ptr in Expression::type()](https://github.com/apache/arrow/issues/31672)
- [X] [[C++] Avoid slicing array inputs in ExecBatchIterator that would result in one slice](https://github.com/apache/arrow/issues/31921)
- [X] [[C++] Implementation of ExecuteScalarExpressionOverhead benchmarks without arrow for comparision](https://github.com/apache/arrow/issues/20250)
#### Original Issue Attachments:
- [Flamegraph.png](https://issues.apache.org/jira/secure/attachment/13042181/Flamegraph.png)

<sub>**Note**: *This issue was originally created as [ARROW-16138](https://issues.apache.org/jira/browse/ARROW-16138). Please see the [migration documentation](https://github.com/apache/arrow/issues/14542) for further details.*</sub>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[C++] Improve performance of ExecuteScalarExpression #31546

Subtasks:

Original Issue Attachments:

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[C++] Improve performance of ExecuteScalarExpression #31546

Description

Subtasks:

Original Issue Attachments:

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions