[Graph][Benchmark] Update benchmark function #363

Aalanli · 2023-10-10T16:41:11Z

The old benchmarking function did not clear the l2 cache, so repeated runs are biased.
This is especially prevalent in tuning for parallel-k parts, which always selects k_parts=1 due to l2 cache hits, even when it is not the fastest implementation.

yaoyaoding · 2023-10-10T18:12:09Z

Hi Allan,

The PR looks good to me. But before merging, it is better to have some demos on the improvement of the accuracy on the selection of parallel k when we clear the L2 cache.

Aalanli · 2023-10-11T15:17:14Z

After some further investigation, it appears that the clearing of the l2 cache is not the greatest contributor, but the usage of torch.cuda.Event, which I assume to be more accurate than time.time().
Here is my benchmarking script for reference: https://gist.github.com/Aalanli/b81d1a751a78ea72b491d872aa993f9e

Aalanli · 2023-10-11T15:19:08Z

orig-latency is the original benchmark function
new-latency is the benchmark function used by this pr
orig-latency-with-event is a benchmark function that uses torch.cuda.Event, but does not clear l2 cache.

Aalanli · 2023-10-11T16:24:45Z

I just removed the torch dependencies.

python/hidet/cuda/device.py

Allan Lin and others added 9 commits July 22, 2023 12:55

add new test

b2223e8

Merge branch 'hidet-org:main' into main

72b64fd

Merge branch 'hidet-org:main' into main

77ee2c1

fix test

c9061ac

Merge branch 'main' of https://github.com/Aalanli/hidet

e014161

Merge branch 'main' of https://github.com/Aalanli/hidet

7663a2d

Merge branch 'hidet-org:main' into main

a81233e

Merge branch 'main' of https://github.com/Aalanli/hidet into main

b643413

remove old benchmarking function

ac72cda

remove torch reliance

70d14ff

yaoyaoding approved these changes Oct 11, 2023

View reviewed changes

python/hidet/cuda/device.py Outdated Show resolved Hide resolved

use event class

495ca95

Aalanli merged commit 82ddb8c into hidet-org:main Oct 12, 2023
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Graph][Benchmark] Update benchmark function #363

[Graph][Benchmark] Update benchmark function #363

Aalanli commented Oct 10, 2023

yaoyaoding commented Oct 10, 2023

Aalanli commented Oct 11, 2023

Aalanli commented Oct 11, 2023

Aalanli commented Oct 11, 2023

[Graph][Benchmark] Update benchmark function #363

[Graph][Benchmark] Update benchmark function #363

Conversation

Aalanli commented Oct 10, 2023

yaoyaoding commented Oct 10, 2023

Aalanli commented Oct 11, 2023

Aalanli commented Oct 11, 2023

Aalanli commented Oct 11, 2023