-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
tt_lib.tensor.sum operation with dim=2 failed with low PCC [Grayskull] #7006
Comments
@nemanjagrujic @banekg @jliangTT Reduce op is failing in
PFA |
@umadevimcw is your test case for bfloat8_b? Could you check if 21e0671 fixes it? If it does you can cherry-pick it to your pr to merge and enable the tests for it. |
I merged my commit to main so you can rebase to pick it up instead of cherry-picking |
Sure. Will check it out. |
@tt-aho In the recent changes, the test cases with bf8 and tile layout passed. But the
|
What is the specific test to repro the error? |
@tt-aho |
I don't see an error when changing the test to that shape. Could you rebuild/retry? |
@tt-aho Cloned the repo and compiled it. Now I am not getting the above error |
Closing as the test passes with the updated config |
Describe the bug
ttlib.sum_2 operation breaks with low PCC value error in some test cases. And with BFLOAT8_B in many test cases. In BFLOAT8_B operation fails on both Grayskull and Wormhole cards.
To Reproduce
Steps to reproduce the behavior:
main
branchtest_sum_2.py
using this command:pytest tests/tt_eager/python_api_testing/non_working_unit_tests/grayskull/test_sum_2.py
Expected behavior
There is a test case presented in the unit test
tests/tt_eager/python_api_testing/non_working_unit_tests/grayskull/test_sum_2.py
and it is are expected to fail with low PCC value.Getting Additional info for the operation under test and its behavior
To get additional information and results for different combinations of input shapes, types, layouts and memory configs for which this operation was tested you can also run locally sweeps for tt_lib.tensor.sum and check the results. To do this you should:
Getting Started
page to setup the repo, environment variables andpython-env
source build/python_env/bin/activate
python tests/tt_eager/python_api_testing/sweep_tests/run_pytorch_test.py -i tests/tt_eager/python_api_testing/sweep_tests/test_configs/ci_sweep_tests_broken/grayskull/ttlib_sum_2_test.yaml -o ./result-sweeps
sum_2_sweep.csv
which holds all executed sweeps, among which you can also find the ones that failed and were recreated by the unit test, which you can get by searching uniquedata_seed
field.The text was updated successfully, but these errors were encountered: