-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
random failure in processing seqpool_concat_fuse_pass when doing stress test for test_analyzer_small_dam #16586
Comments
请用```或者`来标注代码块,issue格式略乱。 |
How to confirm failed inside seqpool_concat_fuse_pass? Do you have trackback? |
can't confirm, just from log analysis, it doesn't enter next fuse pass processing. |
I still did not reproduce it with both release and debug build. Does it related with #16688 ? |
quit hard to reproduce, and not sure if it is related with #16688 |
I think it is due to some system issues, I ran 3 processes to do same test simultaneously, and these tests hit same failure almost within closed 1 second.
|
I ran 10000 times in both release and debug model with command |
my cmd is "ctest -R test_analyzer_small_dam -V" in a batch run script, it needs a test suite sequence instead of just one case, otherwise some failures can't reproduce. |
Is it possible related with some env exports? |
Since |
OK for me, we can only enable But the problem is still there as @LeoZhao-Intel said |
I suspect this is a kind of system issue, and I see it impacts all test processes in same time not just one, and reproduce rate is much much low, I suggest we can keep here to monitor later. |
Attach more tests running all night, |
what's seqpool_concat_fuse used for? From log, seems this pass takes much time on graph pattern detection. |
Paddle/paddle/fluid/framework/ir/seqpool_concat_fuse_pass.h Lines 26 to 39 in 1c8b34d
Because pattern is very large and support at most 200 inputs. |
Thus, could you create a PR to disable |
Yes, WIP. |
System information
-PaddlePaddle version: develop branch
-CPU: CPUMKL ON/OFF
-GPU: No
-OS Platform: Linux
-Python version: N/A
-Cmake orders
-C++version.txt
-API information
To Reproduce
Run "ctest -R test_analyzer_small_dam -V" for more than 5000 tests
Describe your current behavior
Random failure in seqpool_concat_fuse_pass processing in both builds with cmake option -DWITH_MKL=ON/OFF
Code to reproduce the issue
Other info / logs
The text was updated successfully, but these errors were encountered: