Conversation
ea4601c to
922f2c2
Compare
tests/tt_metal/microbenchmarks/ethernet/test_fabric_edm_bandwidth.py
Outdated
Show resolved
Hide resolved
SeanNijjar
left a comment
There was a problem hiding this comment.
Overall looks good - minor suggestion
tt_metal/hw/inc/dataflow_api.h
Outdated
There was a problem hiding this comment.
I think we should take the write_cmd_buf here so this can be useable by the write to local noc and the forward to next EDM paths (I believe those use different cmd bufs atm?)
There was a problem hiding this comment.
you mean expose the cmd_buf as input arg?
forward to next edm is not using this api (use the stateful one).
There was a problem hiding this comment.
yes that's what I meant. And good point, I forgot about that detail.
There was a problem hiding this comment.
Just wanted to highlight that on BH we only do inline writes to stream registers. Due to HW bug inline writes to L1 are emulated by first writing the value into src L1 and then doing a Noc async write
update fabric perf tests to use IRAM by defaults and update targets accordingly. Note this has additional changes which are from this pr: #19440 I have built this branch on top because that will be merged soon and it updates the targets Additionally, several additional unicast tests were added (and a small performance bug in the test kernel resolved). New numbers are as follows (for 4k packet size): | Test Case | Bandwidth GB/s/dir | Comment | |-|-|-| |  | 11.03 | This was the old unicast test but performance has improved due to fix in worker kernel, unicast 1 hop | |  | 9.74 | Unicast, multiple producers/dir, multiple hops, bidir | |  | 9.35 | Currently unclear why this performs worse than above. There should be enough buffering for latency hiding but maybe not | Unicast, single producer/dir, multiple hops, bidir | ### Checklist - [x] [All post commit](https://github.com/tenstorrent/tt-metal/actions/workflows/all-post-commit-workflows.yaml) CI: https://github.com/tenstorrent/tt-metal/actions/runs/14010116751 - [x] New/Existing tests provide coverage for changes
Use stateful API in fabric worker sync path to save some cycles, also remove sw counter update in noc txn.
some perf numbers
no IRAM

with IRAM

Checklist