-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
runtime_context_cache_pass breaks accuracy of FP32 C-API inference of ResNet50 and MobileNet-v1 #16609
Comments
Could you tell me how to reproduce it? Could you paste the command to me directly? |
It is already reproduced on CI (http://ci.paddlepaddle.org/viewLog.html?buildId=79052&buildTypeId=Paddle_PrCi) for build PR_CI (Paddle) in #16559. Otherwise build #16559 and run the test
or
I have modified the test so it runs 2 iterations. With the |
I reproduce it, and I am debugging now. |
Do you have any observation why with
https://github.com/PaddlePaddle/Paddle/compare/develop...luotao1:int8_cache?expand=1
|
Some observation:
|
The
runtime_context_cache_pass
breaks accuracy of inference of (at least) ResNet50 and MobileNet-v1 if the inference requires more than 1 iteration.INT8 inference works fine.
Can be verified using the test in PR #16559.
With a single iteration, the test passes. With multiple iterations, the test fails.
The text was updated successfully, but these errors were encountered: