Skip to content

Conversation

@aagarwalTT
Copy link
Contributor

@aagarwalTT aagarwalTT commented Apr 10, 2025

Ticket

Link to Github Issue

Problem description

Currently the ops and tests utilize the fabric setup done via sub-devices.

What's changed

  1. Migrated ops and tests to rather rely on the fabric setup during device init and hence use the appropriate APIs to setup connections with the fabric kernels
  2. Cleaned-up tests to get rid of options/vars to setup/teardown fabric
  3. Cleaned up the enable_persistent_fabric_mode arg indicating that the ops will run in persistent fabric mode by default.

Checklist

@@ -0,0 +1,55 @@
// SPDX-FileCopyrightText: © 2025 Tenstorrent Inc.
Copy link
Contributor

@tt-asaigal tt-asaigal Apr 11, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For clarity, the functions below are exposed to TTNN CCL ops. They are hence in the API folder.
cc: @pgkeller

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Further note - these will be deleted soon as a part of merging fabric code-bases and migrating the "1d" portion of that impl to the proper control plane based APIs

@aagarwalTT aagarwalTT added this pull request to the merge queue Apr 16, 2025
github-merge-queue bot pushed a commit that referenced this pull request Apr 16, 2025
### Ticket
[Link to Github
Issue](#19961)

### Problem description
Currently the ops and tests utilize the fabric setup done via
sub-devices.

### What's changed
1. Migrated ops and tests to rather rely on the fabric setup during
device init and hence use the appropriate APIs to setup connections with
the fabric kernels
2. Cleaned-up tests to get rid of options/vars to setup/teardown fabric
3. Cleaned up the `enable_persistent_fabric_mode` arg indicating that
the ops will run in persistent fabric mode by default.

### Checklist
- [x] [All post
commit](https://github.com/tenstorrent/tt-metal/actions/workflows/all-post-commit-workflows.yaml)
CI passes
(https://github.com/tenstorrent/tt-metal/actions/runs/14481011716)
- [ ] [Blackhole Post
commit](https://github.com/tenstorrent/tt-metal/actions/workflows/blackhole-post-commit.yaml)
CI passes (if applicable)
- [ ] [Model
regression](https://github.com/tenstorrent/tt-metal/actions/workflows/perf-models.yaml)
CI passes (if applicable)
- [ ] [Device performance
regression](https://github.com/tenstorrent/tt-metal/actions/workflows/perf-device-models.yaml)
CI passes (if applicable)
- [ ] **(For models and ops writers)** Full [new models
tests](https://github.com/tenstorrent/tt-metal/actions/workflows/full-new-models-suite.yaml)
CI passes (if applicable)
- [ ] New/Existing tests provide coverage for changes
- [ ] [TG Unit
Tests](https://github.com/tenstorrent/tt-metal/actions/workflows/tg-unit-tests.yaml)
(https://github.com/tenstorrent/tt-metal/actions/runs/14447931719)
- [x] [TG
Quick](https://github.com/tenstorrent/tt-metal/actions/workflows/tg-quick-trigger.yaml)
(https://github.com/tenstorrent/tt-metal/actions/runs/14447914998)
- [ ] [TG Demo
Test](https://github.com/tenstorrent/tt-metal/actions/workflows/tg-demo-tests.yaml)
(https://github.com/tenstorrent/tt-metal/actions/runs/14447942813)
- [ ] [TG
Nightly](https://github.com/tenstorrent/tt-metal/actions/workflows/tg-nightly-tests.yaml)
(https://github.com/tenstorrent/tt-metal/actions/runs/14447925563)
- [ ] [TG Frequent
Tests](https://github.com/tenstorrent/tt-metal/actions/workflows/tg-frequent-tests.yaml)
(https://github.com/tenstorrent/tt-metal/actions/runs/14447936614)
- [ ] [TG Model Perf
Tests](https://github.com/tenstorrent/tt-metal/actions/workflows/tg-model-perf-tests.yaml)
(https://github.com/tenstorrent/tt-metal/actions/runs/14447904540)
- [ ] [TG
Stress](https://github.com/tenstorrent/tt-metal/actions/workflows/tg-stress-trigger.yaml)
(https://github.com/tenstorrent/tt-metal/actions/runs/14447919423)
- [ ] [T3K Unit]
(https://github.com/tenstorrent/tt-metal/actions/runs/14409722583)
- [x] [T3K Nightly]
(https://github.com/tenstorrent/tt-metal/actions/runs/14409735747)
- [ ] [T3K Frequent]
(https://github.com/tenstorrent/tt-metal/actions/runs/14409732454)
- [x] [T3K Multiple Pipelines]
(https://github.com/tenstorrent/tt-metal/actions/runs/14481016916)

---------

Co-authored-by: asaigal <asaigal@tenstorrent.com>
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Apr 16, 2025
@tt-asaigal tt-asaigal enabled auto-merge April 16, 2025 02:19
@tt-asaigal tt-asaigal added this pull request to the merge queue Apr 16, 2025
@tt-asaigal tt-asaigal removed this pull request from the merge queue due to a manual request Apr 16, 2025
@tt-asaigal tt-asaigal enabled auto-merge April 16, 2025 02:57
@tt-asaigal tt-asaigal added this pull request to the merge queue Apr 16, 2025
Merged via the queue into main with commit 959239f Apr 16, 2025
28 checks passed
@tt-asaigal tt-asaigal deleted the aagarwal/ccl-port branch April 16, 2025 03:47
djordje-tt added a commit that referenced this pull request Apr 16, 2025
johanna-rock-tt added a commit that referenced this pull request Apr 16, 2025
tt-asaigal added a commit that referenced this pull request Apr 16, 2025
tt-asaigal added a commit that referenced this pull request Apr 16, 2025
roseli-TT pushed a commit that referenced this pull request Apr 16, 2025
This reverts commit 959239f.

Changes getting reverted
[here](2665505)
(due to broken APC) make this commit non-functional on a 6U.

The 6U changes need to go in first.

### Ticket
Link to Github Issue

### Problem description
Provide context for the problem.

### What's changed
Describe the approach used to solve the problem.
Summarize the changes made and its impact.

### Checklist
- [ ] [All post
commit](https://github.com/tenstorrent/tt-metal/actions/workflows/all-post-commit-workflows.yaml)
CI passes
- [ ] [Blackhole Post
commit](https://github.com/tenstorrent/tt-metal/actions/workflows/blackhole-post-commit.yaml)
CI passes (if applicable)
- [ ] [Model
regression](https://github.com/tenstorrent/tt-metal/actions/workflows/perf-models.yaml)
CI passes (if applicable)
- [ ] [Device performance
regression](https://github.com/tenstorrent/tt-metal/actions/workflows/perf-device-models.yaml)
CI passes (if applicable)
- [ ] **(For models and ops writers)** Full [new models
tests](https://github.com/tenstorrent/tt-metal/actions/workflows/full-new-models-suite.yaml)
CI passes (if applicable)
- [ ] New/Existing tests provide coverage for changes
johanna-rock-tt added a commit that referenced this pull request Apr 17, 2025
ssinghalTT pushed a commit that referenced this pull request Apr 23, 2025
### Ticket
[Link to Github
Issue](#19961)

### Problem description
Currently the ops and tests utilize the fabric setup done via
sub-devices.

### What's changed
1. Migrated ops and tests to rather rely on the fabric setup during
device init and hence use the appropriate APIs to setup connections with
the fabric kernels
2. Cleaned-up tests to get rid of options/vars to setup/teardown fabric
3. Cleaned up the `enable_persistent_fabric_mode` arg indicating that
the ops will run in persistent fabric mode by default.

### Checklist
- [x] [All post
commit](https://github.com/tenstorrent/tt-metal/actions/workflows/all-post-commit-workflows.yaml)
CI passes
(https://github.com/tenstorrent/tt-metal/actions/runs/14481011716)
- [ ] [Blackhole Post
commit](https://github.com/tenstorrent/tt-metal/actions/workflows/blackhole-post-commit.yaml)
CI passes (if applicable)
- [ ] [Model
regression](https://github.com/tenstorrent/tt-metal/actions/workflows/perf-models.yaml)
CI passes (if applicable)
- [ ] [Device performance
regression](https://github.com/tenstorrent/tt-metal/actions/workflows/perf-device-models.yaml)
CI passes (if applicable)
- [ ] **(For models and ops writers)** Full [new models
tests](https://github.com/tenstorrent/tt-metal/actions/workflows/full-new-models-suite.yaml)
CI passes (if applicable)
- [ ] New/Existing tests provide coverage for changes
- [ ] [TG Unit
Tests](https://github.com/tenstorrent/tt-metal/actions/workflows/tg-unit-tests.yaml)
(https://github.com/tenstorrent/tt-metal/actions/runs/14447931719)
- [x] [TG
Quick](https://github.com/tenstorrent/tt-metal/actions/workflows/tg-quick-trigger.yaml)
(https://github.com/tenstorrent/tt-metal/actions/runs/14447914998)
- [ ] [TG Demo
Test](https://github.com/tenstorrent/tt-metal/actions/workflows/tg-demo-tests.yaml)
(https://github.com/tenstorrent/tt-metal/actions/runs/14447942813)
- [ ] [TG
Nightly](https://github.com/tenstorrent/tt-metal/actions/workflows/tg-nightly-tests.yaml)
(https://github.com/tenstorrent/tt-metal/actions/runs/14447925563)
- [ ] [TG Frequent
Tests](https://github.com/tenstorrent/tt-metal/actions/workflows/tg-frequent-tests.yaml)
(https://github.com/tenstorrent/tt-metal/actions/runs/14447936614)
- [ ] [TG Model Perf
Tests](https://github.com/tenstorrent/tt-metal/actions/workflows/tg-model-perf-tests.yaml)
(https://github.com/tenstorrent/tt-metal/actions/runs/14447904540)
- [ ] [TG
Stress](https://github.com/tenstorrent/tt-metal/actions/workflows/tg-stress-trigger.yaml)
(https://github.com/tenstorrent/tt-metal/actions/runs/14447919423)
- [ ] [T3K Unit]
(https://github.com/tenstorrent/tt-metal/actions/runs/14409722583)
- [x] [T3K Nightly]
(https://github.com/tenstorrent/tt-metal/actions/runs/14409735747)
- [ ] [T3K Frequent]
(https://github.com/tenstorrent/tt-metal/actions/runs/14409732454)
- [x] [T3K Multiple Pipelines]
(https://github.com/tenstorrent/tt-metal/actions/runs/14481016916)

---------

Co-authored-by: asaigal <asaigal@tenstorrent.com>
ssinghalTT pushed a commit that referenced this pull request Apr 23, 2025
This reverts commit 959239f.

Changes getting reverted
[here](2665505)
(due to broken APC) make this commit non-functional on a 6U.

The 6U changes need to go in first.

### Ticket
Link to Github Issue

### Problem description
Provide context for the problem.

### What's changed
Describe the approach used to solve the problem.
Summarize the changes made and its impact.

### Checklist
- [ ] [All post
commit](https://github.com/tenstorrent/tt-metal/actions/workflows/all-post-commit-workflows.yaml)
CI passes
- [ ] [Blackhole Post
commit](https://github.com/tenstorrent/tt-metal/actions/workflows/blackhole-post-commit.yaml)
CI passes (if applicable)
- [ ] [Model
regression](https://github.com/tenstorrent/tt-metal/actions/workflows/perf-models.yaml)
CI passes (if applicable)
- [ ] [Device performance
regression](https://github.com/tenstorrent/tt-metal/actions/workflows/perf-device-models.yaml)
CI passes (if applicable)
- [ ] **(For models and ops writers)** Full [new models
tests](https://github.com/tenstorrent/tt-metal/actions/workflows/full-new-models-suite.yaml)
CI passes (if applicable)
- [ ] New/Existing tests provide coverage for changes
djordje-tt pushed a commit that referenced this pull request Apr 29, 2025
This reverts commit 959239f.

Changes getting reverted
[here](2665505)
(due to broken APC) make this commit non-functional on a 6U.

The 6U changes need to go in first.

### Ticket
Link to Github Issue

### Problem description
Provide context for the problem.

### What's changed
Describe the approach used to solve the problem.
Summarize the changes made and its impact.

### Checklist
- [ ] [All post
commit](https://github.com/tenstorrent/tt-metal/actions/workflows/all-post-commit-workflows.yaml)
CI passes
- [ ] [Blackhole Post
commit](https://github.com/tenstorrent/tt-metal/actions/workflows/blackhole-post-commit.yaml)
CI passes (if applicable)
- [ ] [Model
regression](https://github.com/tenstorrent/tt-metal/actions/workflows/perf-models.yaml)
CI passes (if applicable)
- [ ] [Device performance
regression](https://github.com/tenstorrent/tt-metal/actions/workflows/perf-device-models.yaml)
CI passes (if applicable)
- [ ] **(For models and ops writers)** Full [new models
tests](https://github.com/tenstorrent/tt-metal/actions/workflows/full-new-models-suite.yaml)
CI passes (if applicable)
- [ ] New/Existing tests provide coverage for changes
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.