Issues with the benchmark results

The benchmark results feel more and more like a mess due to a number of issues in the way the benchmark is organized and how the results are shown.

1. As the title says, `Cold Run` results make no sense with in-memory and remote storage, so ideally they should not be shown on the generated html page.

2. The benchmark rules mention the following:
```
run.sh: a loop for running the queries; every query is run three times; if it's a database with local on-disk storage, the first query should be run after dropping the page cache;
```

This rule is not applied to some of the contestants that run on the local disk like [Arc](https://github.com/ClickHouse/ClickBench/blob/318389533f6696852b0dccadb0fc5c10d9852a68/arc/run.sh) which produces wrong results in Combined and Cold Run categories. For instance, Arc uses DuckDB as its query engine: it outperforms DuckDB+parquet in Cold Run by 3.08x, but it's slower than DuckDB in Hot Run by 2.7x - that's quite puzzling. More than that, many if the Arc's Cold Run query execution times are worse than the corresponding Hot Run times.

3. The benchmark rules should specify the disk/volume type and its settings. The `machine` field used to have `gp2 500GB` in the text - now it doesn't, so it looks like contestants can use any volume type, e.g. io2, making the Cold Run results questionable.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Issues with the benchmark results #656

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issues with the benchmark results #656

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions