Different result on GPU and CPU #12027

bahar3474 · 2024-04-29T10:13:27Z

Hello everyone,

I'm currently facing an issue and would greatly appreciate any assistance you can offer.

I have a paddle model that I'm serving through a Docker image based on version 2.5.1 of the paddlepaddle/paddle image. On one workstation, it works well with the 'use_gpu' attribute set to True or False. However, on another workstation, the outputs of the model are incorrect when it runs on GPU. I have attached the results of the model in both situations.

CPU result:

GPU result:

It appears that the computation of the model is incorrect when it's running on GPU. It's important to note that I'm not encountering any errors or warnings, just inaccurate results.

Some additional context:

This issue did not occur with version 2.4 of the Paddle library, but I need to upgrade my Paddle version.
I have attempted to make the environments of the two workstations as similar as possible, but due to them having different GPUs (3090 Ti and Tesla P40), achieving complete parity is not feasible. And while it runs on Docker, I'm not sure how the host effect on the reult of the model on container.
The images above show the result of the text detection model, but I've had the same experience with the text recognition and layout models as well.

What could be the root cause of this inconsistency?

Thank you in advance for any insights or suggestions you can provide.

tink2123 · 2024-04-30T06:42:51Z

You can try upgrading the Paddle version to 2.5.2.

bahar3474 · 2024-05-02T06:05:16Z

Unfortunately, updating the Paddle version to 2.5.2 doesn't fix my problem. I've added more description about this problem on the issue that Vvsmile mentioned.

tink2123 · 2024-05-07T03:35:21Z

It's indeed very strange. Maybe the cuda version in the docker image does not match the installed paddle package.
There is a solution, you can try to run gpu inference through the official image. refer to: https://www.paddlepaddle.org.cn/en

For example, if your environment is cuda11.7, please use the following command

nvidia-docker pull registry.baidubce.com/paddlepaddle/paddle:2.6.1-gpu-cuda11.7-cudnn8.4-trt8.4

nvidia-docker run --name paddle -it -v $PWD:/paddle registry.baidubce.com/paddlepaddle/paddle:2.6.1-gpu-cuda11.7-cudnn8.4-trt8.4 /bin/bash

paddle-bot bot assigned tink2123 Apr 29, 2024

Vvsmile mentioned this issue Apr 30, 2024

Different result on CPU and GPU PaddlePaddle/Paddle#63984

Open

UserWangZz closed this as completed May 10, 2024

paddle-bot bot added the status/close label May 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Different result on GPU and CPU #12027

Different result on GPU and CPU #12027

bahar3474 commented Apr 29, 2024 •

edited

tink2123 commented Apr 30, 2024

bahar3474 commented May 2, 2024

tink2123 commented May 7, 2024

Different result on GPU and CPU #12027

Different result on GPU and CPU #12027

Comments

bahar3474 commented Apr 29, 2024 • edited

tink2123 commented Apr 30, 2024

bahar3474 commented May 2, 2024

tink2123 commented May 7, 2024

bahar3474 commented Apr 29, 2024 •

edited