Incorrect number of batches when multiple test loaders are used and test_percent_check is specified #1899

binshengliu · 2020-05-20T06:25:58Z

🐛 Bug

When there are multiple test dataloaders and test_percent_check is specified. The estimated total batches are incorrect and progress bar doesn't show properly.

For example, when I specify two dataloaders each of which has 100 batches and test_percent_check=0.1. The expected total batches are 200*0.1=20. But actually, 40 batches are run.

At this line, num_batches is the global number of batches and will be assigned to self.num_test_batches. https://github.com/PyTorchLightning/pytorch-lightning/blob/3459a546672303204a4ae6efcc2613a90f003903/pytorch_lightning/trainer/data_loading.py#L243

while in the evaluation loop, max_batches is regarded as the number of batches for one data loader.
https://github.com/PyTorchLightning/pytorch-lightning/blob/3459a546672303204a4ae6efcc2613a90f003903/pytorch_lightning/trainer/evaluation_loop.py#L262

To Reproduce

Steps to reproduce the behavior:

Return multiple dataloaders from test_dataloaders()
Specify test_percent_check.
Run trainer.test()
Observe expected_batches * num_loaders be run. The progress bar also fails to show progress after expected_batches as it exceeds its specified total steps.

Expected behavior

Run correct number of batches.

Environment

CUDA:
- GPU:
- available: False
- version: 10.2
Packages:
- numpy: 1.18.4
- pyTorch_debug: False
- pyTorch_version: 1.5.0
- pytorch-lightning: 0.7.6
- tensorboard: 2.2.0
- tqdm: 4.45.0
System:
- OS: Linux
- architecture:
  - 64bit
- processor:
- python: 3.7.6
- version: Proposal for help #1 SMP Debian 4.19.118-2 (2020-04-29)

The text was updated successfully, but these errors were encountered:

awaelchli · 2020-05-20T12:33:46Z

Just had a look at this. The problem is in the trainer as you say, not the progress bar.
There are two loops, the outer runs through the number of dataloaders and the inner loop runs through each.

so the max batches should be the number of batches to run in each dataloader, not totally.

We can easily fix this.
There should really be a test. There seems to be no test that checks that *_percent_check works with the correct amount of data. we should definitely have these tests.

rohitgr7 · 2020-05-20T22:19:23Z

Same case might be hapenning with val_dataloaders. max_batches should be a list I suggest.

awaelchli · 2020-05-21T04:25:02Z

That's true yes, I agree, because they could have different length.

rohitgr7 · 2020-05-21T17:53:56Z

@awaelchli Anyone working on this or should I submit a PR? Need this to be fixed for a personal project debugging and testing.

awaelchli · 2020-05-21T17:59:23Z

if you like, that would help us a lot .)
I could help with the tests if you need help :)

binshengliu added bug Something isn't working help wanted Open to be worked on labels May 20, 2020

awaelchli self-assigned this May 20, 2020

rohitgr7 mentioned this issue May 21, 2020

Fix num batches in case of multiple dataloaders and percent_check #1920

Merged

5 tasks

awaelchli linked a pull request Jun 17, 2020 that will close this issue

Add missing test for "multiple dataloader + percent_check fix" #2226

Merged

williamFalcon closed this as completed in #2226 Jun 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorrect number of batches when multiple test loaders are used and test_percent_check is specified #1899

Incorrect number of batches when multiple test loaders are used and test_percent_check is specified #1899

binshengliu commented May 20, 2020

awaelchli commented May 20, 2020 •

edited

Loading

rohitgr7 commented May 20, 2020

awaelchli commented May 21, 2020

rohitgr7 commented May 21, 2020

awaelchli commented May 21, 2020

Incorrect number of batches when multiple test loaders are used and test_percent_check is specified #1899

Incorrect number of batches when multiple test loaders are used and test_percent_check is specified #1899

Comments

binshengliu commented May 20, 2020

🐛 Bug

To Reproduce

Expected behavior

Environment

awaelchli commented May 20, 2020 • edited Loading

rohitgr7 commented May 20, 2020

awaelchli commented May 21, 2020

rohitgr7 commented May 21, 2020

awaelchli commented May 21, 2020

awaelchli commented May 20, 2020 •

edited

Loading