-
Notifications
You must be signed in to change notification settings - Fork 58
Open
Labels
Description
I'm using release 1.3.0 with the HIP backend and PMT observer. In the output I see an extra zero in the timings, e.g.:
"32,16,1,256,256,1": {"block_size_x": 32, "block_size_y": 16, "block_size_z": 1, "M_PER_BLOCK": 256, "N_PER_BLOCK": 256, "NBUFFER": 1, "amdsmi_energy": 56.82277934357711, "amdsmi_power": 256.77714592177557, "time": 221.29219927106584, "times": [[0, 443.0099792480469], [0, 443.59039306640625], [0, 442.9225158691406], [0, 441.43218994140625], [0, 442.8607177734375], [0, 442.8009948730469], [0, 441.4739990234375]], "compile_time": 1048.8195329999144, "verification_time": 0, "benchmark_time": 5309.674527001334, "TFLOPS": 19.874385656571352, "N_PER_WARP": 16, "M_PER_WARP": 256, "Watt": 256.77714592177557, "strategy_time": 0, "framework_time": 0.21460700008901767, "timestamp": "2025-09-30 11:13:25.134561+00:00"}
This causes all performance measurements to be wrong, luckily all by the same factor 2 so it can be corrected.
I haven't checked where this is coming from, perhaps something specific to the ContinuousObserver
used by the PMTObserver
?