perf: faster execution ctx and no opt#7597
Conversation
Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>
Merging this PR will degrade performance by 15.29%
|
| Mode | Benchmark | BASE |
HEAD |
Efficiency | |
|---|---|---|---|---|---|
| ⚡ | Simulation | chunked_bool_canonical_into[(10, 1000)] |
948.1 µs | 779.1 µs | +21.69% |
| ⚡ | Simulation | chunked_bool_canonical_into[(100, 100)] |
118.3 µs | 100.1 µs | +18.23% |
| ⚡ | Simulation | chunked_opt_bool_canonical_into[(10, 1000)] |
1,142.9 µs | 915 µs | +24.91% |
| ⚡ | Simulation | chunked_opt_bool_canonical_into[(1000, 10)] |
62.4 µs | 55.9 µs | +11.51% |
| ⚡ | Simulation | chunked_opt_bool_into_canonical[(1000, 10)] |
70 µs | 61.9 µs | +12.98% |
| ⚡ | Simulation | chunked_opt_bool_into_canonical[(10, 1000)] |
1.3 ms | 1 ms | +32.88% |
| ⚡ | Simulation | chunked_opt_bool_into_canonical[(100, 100)] |
277 µs | 220.3 µs | +25.77% |
| ⚡ | Simulation | chunked_varbinview_opt_into_canonical[(100, 100)] |
563.3 µs | 503.7 µs | +11.84% |
| ⚡ | Simulation | chunked_varbinview_opt_canonical_into[(10, 1000)] |
2.8 ms | 2.3 ms | +19.57% |
| ⚡ | Simulation | chunked_varbinview_opt_into_canonical[(10, 1000)] |
3 ms | 2.4 ms | +21.5% |
| ⚡ | Simulation | chunked_varbinview_into_canonical[(10, 1000)] |
1.9 ms | 1.7 ms | +13.64% |
| ⚡ | Simulation | chunked_varbinview_opt_canonical_into[(100, 100)] |
544.1 µs | 490 µs | +11.05% |
| ⚡ | Simulation | chunked_opt_bool_canonical_into[(100, 100)] |
252.1 µs | 206.1 µs | +22.35% |
| ❌ | Simulation | old_bp_prim_test_between[i64, 16384] |
176.4 µs | 202.8 µs | -13.01% |
| ⚡ | Simulation | new_bp_prim_test_between[i16, 32768] |
134.5 µs | 121.4 µs | +10.77% |
| ❌ | Simulation | old_bp_prim_test_between[i64, 32768] |
252.5 µs | 298.1 µs | -15.29% |
| ⚡ | Simulation | decompress[u8, (1000, 16)] |
19 µs | 16.7 µs | +13.87% |
| ⚡ | Simulation | decompress[u16, (1000, 256)] |
18 µs | 16.3 µs | +10.38% |
| ⚡ | Simulation | decompress[u8, (1000, 256)] |
16.6 µs | 15 µs | +10.73% |
| ⚡ | Simulation | null_count_run_end[(10000, 256, 0.01)] |
4.7 µs | 4.2 µs | +11.59% |
| ... | ... | ... | ... | ... | ... |
ℹ️ Only the first 20 benchmarks are displayed. Go to the app to view all benchmarks.
Comparing ji/faster-execute (4dc7602) with develop (bfb5dba)
Polar Signals Profiling ResultsLatest Run
Powered by Polar Signals Cloud |
Benchmarks: PolarSignals ProfilingVortex (geomean): 1.038x ➖ datafusion / vortex-file-compressed (1.038x ➖, 0↑ 2↓)
|
File Sizes: PolarSignals ProfilingNo file size changes detected. |
Benchmarks: FineWeb NVMeVerdict: No clear signal (low confidence) datafusion / vortex-file-compressed (1.028x ➖, 0↑ 1↓)
datafusion / vortex-compact (1.032x ➖, 0↑ 0↓)
datafusion / parquet (1.023x ➖, 0↑ 0↓)
duckdb / vortex-file-compressed (0.992x ➖, 1↑ 0↓)
duckdb / vortex-compact (0.989x ➖, 1↑ 0↓)
duckdb / parquet (1.037x ➖, 0↑ 1↓)
Full attributed analysis
|
File Sizes: FineWeb NVMeNo file size changes detected. |
Benchmarks: TPC-H SF=1 on NVMEVerdict: No clear signal (low confidence) datafusion / vortex-file-compressed (1.127x ❌, 0↑ 15↓)
datafusion / vortex-compact (1.088x ➖, 0↑ 7↓)
datafusion / parquet (1.068x ➖, 1↑ 7↓)
datafusion / arrow (1.064x ➖, 0↑ 4↓)
duckdb / vortex-file-compressed (1.092x ➖, 0↑ 7↓)
duckdb / vortex-compact (1.081x ➖, 0↑ 6↓)
duckdb / parquet (1.035x ➖, 0↑ 1↓)
duckdb / duckdb (1.074x ➖, 0↑ 4↓)
Full attributed analysis
|
File Sizes: TPC-H SF=1 on NVMENo file size changes detected. |
Benchmarks: TPC-DS SF=1 on NVMEVerdict: No clear signal (low confidence) datafusion / vortex-file-compressed (0.966x ➖, 2↑ 0↓)
datafusion / vortex-compact (0.982x ➖, 1↑ 0↓)
datafusion / parquet (0.973x ➖, 1↑ 1↓)
duckdb / vortex-file-compressed (0.982x ➖, 3↑ 2↓)
duckdb / vortex-compact (0.978x ➖, 4↑ 0↓)
duckdb / parquet (0.991x ➖, 2↑ 1↓)
duckdb / duckdb (0.984x ➖, 2↑ 1↓)
Full attributed analysis
|
File Sizes: TPC-DS SF=1 on NVMENo file size changes detected. |
Benchmarks: FineWeb S3Verdict: No clear signal (environment too noisy confidence) datafusion / vortex-file-compressed (1.199x ➖, 0↑ 2↓)
datafusion / vortex-compact (1.151x ➖, 0↑ 2↓)
datafusion / parquet (1.088x ➖, 0↑ 1↓)
duckdb / vortex-file-compressed (1.225x ➖, 0↑ 4↓)
duckdb / vortex-compact (1.039x ➖, 1↑ 0↓)
duckdb / parquet (1.083x ➖, 0↑ 0↓)
Full attributed analysis
|
Benchmarks: Random AccessVortex (geomean): 0.885x ✅ unknown / unknown (0.909x ➖, 19↑ 0↓)
|
Benchmarks: TPC-H SF=10 on NVMEVerdict: No clear signal (low confidence) datafusion / vortex-file-compressed (0.949x ➖, 0↑ 0↓)
datafusion / vortex-compact (0.953x ➖, 0↑ 0↓)
datafusion / parquet (0.955x ➖, 0↑ 0↓)
datafusion / arrow (0.944x ➖, 0↑ 0↓)
duckdb / vortex-file-compressed (0.954x ➖, 0↑ 0↓)
duckdb / vortex-compact (0.964x ➖, 0↑ 0↓)
duckdb / parquet (0.978x ➖, 1↑ 0↓)
duckdb / duckdb (0.973x ➖, 0↑ 0↓)
Full attributed analysis
|
File Sizes: TPC-H SF=10 on NVMENo file size changes detected. |
Benchmarks: Statistical and Population GeneticsVerdict: No clear signal (low confidence) duckdb / vortex-file-compressed (0.991x ➖, 1↑ 0↓)
duckdb / vortex-compact (1.017x ➖, 0↑ 0↓)
duckdb / parquet (1.016x ➖, 0↑ 0↓)
Full attributed analysis
|
File Sizes: Statistical and Population GeneticsNo file size changes detected. |
Benchmarks: Clickbench on NVMEVerdict: No clear signal (low confidence) datafusion / vortex-file-compressed (1.008x ➖, 2↑ 1↓)
datafusion / parquet (1.002x ➖, 0↑ 0↓)
duckdb / vortex-file-compressed (1.031x ➖, 0↑ 3↓)
duckdb / parquet (1.000x ➖, 0↑ 0↓)
duckdb / duckdb (1.005x ➖, 0↑ 1↓)
Full attributed analysis
|
File Sizes: Clickbench on NVMEFile Size Changes (1 files changed, -0.0% overall, 0↑ 1↓)
Totals:
|
Benchmarks: TPC-H SF=1 on S3Verdict: No clear signal (environment too noisy confidence) datafusion / vortex-file-compressed (1.053x ➖, 0↑ 4↓)
datafusion / vortex-compact (1.201x ➖, 1↑ 9↓)
datafusion / parquet (1.304x ❌, 1↑ 10↓)
duckdb / vortex-file-compressed (1.045x ➖, 0↑ 1↓)
duckdb / vortex-compact (1.056x ➖, 0↑ 3↓)
duckdb / parquet (1.086x ➖, 0↑ 0↓)
Full attributed analysis
|
Benchmarks: CompressionVortex (geomean): 0.992x ➖ unknown / unknown (0.967x ➖, 13↑ 1↓)
|
Benchmarks: TPC-H SF=10 on S3Verdict: No clear signal (environment too noisy confidence) datafusion / vortex-file-compressed (1.336x ❌, 0↑ 13↓)
datafusion / vortex-compact (1.201x ➖, 0↑ 8↓)
datafusion / parquet (1.139x ➖, 0↑ 4↓)
duckdb / vortex-file-compressed (1.116x ➖, 0↑ 4↓)
duckdb / vortex-compact (1.045x ➖, 0↑ 1↓)
duckdb / parquet (1.061x ➖, 0↑ 1↓)
Full attributed analysis
|
Makes the
ExecutionCtxfaster to create and destroy.Remove unconditional optimize from execute