Set total number of batches in progress bar while testing #425

kuynzereb · 2019-10-24T11:56:14Z

Now progress bar doesn't show total number of batches in test mode. This PR fixes it.

Borda

please provide more information to reproduce not showing the total number of iterations, Thx

kuynzereb · 2019-10-24T20:19:10Z

Yeah, here is a dummy example:

from time import sleep
import torch
from torch.utils.data import DataLoader, Dataset

import pytorch_lightning as pl


class DummyDataset(Dataset):
    def __init__(self):
        super().__init__()

    def __len__(self):
        return 10

    def __getitem__(self, idx):
        return torch.rand(1)


class CoolSystem(pl.LightningModule):
    def __init__(self):
        super(CoolSystem, self).__init__()

    def forward(self, x):
        return 0

    def training_step(self, batch, batch_nb):
        return {}

    def test_step(self, batch, batch_nb):
        sleep(1)
        return {}

    def test_end(self, outputs):
        return {}

    def configure_optimizers(self):
        return []

    @pl.data_loader
    def train_dataloader(self):
        return []

    @pl.data_loader
    def test_dataloader(self):
        return DataLoader(DummyDataset(), batch_size=1)

model = CoolSystem()
trainer = pl.Trainer(weights_summary=None, nb_sanity_val_steps=0)
trainer.test(model)

If you run this code with current master you will obtain the following output:
3it [00:03, 1.00s/it]

Whereas with this PR you will obtain:
30%|██████████████▋ | 3/10 [00:03<00:08, 1.00s/it]

Borda

tested and it looks good to me, @williamFalcon

kuynzereb · 2019-10-25T09:30:07Z

I have just realized that it is slightly more complicated. This PR only fixes the problem when .test(model) is called without .fit(). But if you call .test() after .fit(model) there again will be strange behavior. For example, run the following code:

from time import sleep
import torch
from torch.utils.data import DataLoader, Dataset

import pytorch_lightning as pl


class DummyDataset(Dataset):
    def __init__(self, n):
        super().__init__()
        self.n = n

    def __len__(self):
        return self.n

    def __getitem__(self, idx):
        return torch.rand(10)


class CoolSystem(pl.LightningModule):
    def __init__(self):
        super(CoolSystem, self).__init__()
        self.layer = torch.nn.Linear(10, 10)

    def forward(self, x):
        return self.layer(x)

    def training_step(self, batch, batch_nb):
        # REQUIRED
        sleep(1)
        return {'loss': torch.mean(self.forward(batch) ** 2)}

    def test_step(self, batch, batch_nb):
        # OPTIONAL
        sleep(1)
        return {}

    def test_end(self, outputs):
        # OPTIONAL
        return {}

    def configure_optimizers(self):
        # REQUIRED
        # can return multiple optimizers and learning_rate schedulers
        # (LBFGS it is automatically supported, no need for closure function)
        return [torch.optim.Adam(self.layer.parameters())]

    @pl.data_loader
    def train_dataloader(self):
        # REQUIRED
        return DataLoader(DummyDataset(10), batch_size=1)

    @pl.data_loader
    def test_dataloader(self):
        # OPTIONAL
        return DataLoader(DummyDataset(5), batch_size=1)

model = CoolSystem()
trainer = pl.Trainer(weights_summary=None, nb_sanity_val_steps=0, early_stop_callback=False,
                     check_val_every_n_epoch=100, max_nb_epochs=1)
trainer.fit(model)
trainer.test()

It will end with

15it [00:15,  1.01s/it, batch_nb=9, epoch=0, loss=0.107, v_nb=26]

We can reset the progress bar in that case too and it will show correct total number of iterations. But then this testing progress bar will show old postfixes from the training. So it seems that actually we should distinguish between train_progress_bar and test_progress_bar. In that sense it seems related to #420.

Borda · 2019-10-25T09:44:20Z

maybe think about moving from tqdm to enlighten
https://github.com/Rockhopper-Technologies/enlighten
the advantage is that the progress bar is not affected by mean-time messages
also see: https://pydigger.com/keyword/bar

williamFalcon · 2019-10-25T10:22:01Z

let’s keep it tqdm for now. we can consider this in a separate PR

williamFalcon · 2019-10-25T21:15:33Z

should we just have the following bar setup?
train bar
val bar
test bar

each shown on top of each other depending on what's happening?

kuynzereb · 2019-10-26T05:17:10Z

Yes, it sounds good. I like your idea that main train bar should have total number of batches (train + val) and that validation bar just pop ups as additional bar. And I just point out that test bar seems to be totally independent of the main train bar.

williamFalcon · 2019-10-30T16:14:42Z

@kuynzereb thanks! want to do a PR for splitting the bars?

kuynzereb · 2019-10-30T16:42:50Z

Yeah, I can give it a try!

Set total number of batches in progress bar while testing

2e6c87c

Borda requested changes Oct 24, 2019

View reviewed changes

Borda approved these changes Oct 24, 2019

View reviewed changes

williamFalcon merged commit f79bdf2 into Lightning-AI:master Oct 30, 2019

kuynzereb mentioned this pull request Nov 1, 2019

Split progress bar #449

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Set total number of batches in progress bar while testing #425

Set total number of batches in progress bar while testing #425

Uh oh!

kuynzereb commented Oct 24, 2019

Uh oh!

Borda left a comment •

edited

Loading

Uh oh!

kuynzereb commented Oct 24, 2019

Uh oh!

Borda left a comment •

edited

Loading

Uh oh!

kuynzereb commented Oct 25, 2019

Uh oh!

Borda commented Oct 25, 2019

Uh oh!

williamFalcon commented Oct 25, 2019

Uh oh!

williamFalcon commented Oct 25, 2019 •

edited

Loading

Uh oh!

kuynzereb commented Oct 26, 2019

Uh oh!

williamFalcon commented Oct 30, 2019

Uh oh!

kuynzereb commented Oct 30, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Set total number of batches in progress bar while testing #425

Set total number of batches in progress bar while testing #425

Uh oh!

Conversation

kuynzereb commented Oct 24, 2019

Uh oh!

Borda left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kuynzereb commented Oct 24, 2019

Uh oh!

Borda left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kuynzereb commented Oct 25, 2019

Uh oh!

Borda commented Oct 25, 2019

Uh oh!

williamFalcon commented Oct 25, 2019

Uh oh!

williamFalcon commented Oct 25, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kuynzereb commented Oct 26, 2019

Uh oh!

williamFalcon commented Oct 30, 2019

Uh oh!

kuynzereb commented Oct 30, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Borda left a comment •

edited

Loading

Borda left a comment •

edited

Loading

williamFalcon commented Oct 25, 2019 •

edited

Loading