Dict specified val_dataloader, test_dataloader and predict_dataloader #16015

kotchin · 2022-12-12T15:30:14Z

Description & Motivation

Conveniently, a DataModule can return a train_dataloader() which is a dict, where the name of a dataset is a key and the value is the dataloader.
This is unfortunately not possible with val_dataloader(), test_dataloader() and predict_dataloader(). Yet this could be particularly interesting from a logging perspective allowing for the DataModule to name the datasets and reuse those names when logging the data from the LitModule.

Perhaps there is a specific reason for why this hasn't been implemented or can't be implemented, but it's not obvious to me.

I have tested the dict based approaches above and the documentation in DataModule.from_datasets() also reflects that impossibility.

Pitch

No response

Alternatives

No response

Additional context

No response

Edit: typo

cc @Borda @justusschock @awaelchli

The text was updated successfully, but these errors were encountered:

awaelchli · 2022-12-13T03:11:53Z

Hi, thanks for your interest in this feature!

Perhaps there is a specific reason for why this hasn't been implemented or can't be implemented, but it's not obvious to me.

Yes sort of. Multiple dataloader support for validation has existed before it was added to the training loop. Until that point, validation was running sequentially over the multiple dataloaders, which made the most sense because validation was typically just about collecting the metrics. When multiple dataloader support was considered for training_step, we found that the most practical use would be to fetch the data for the dataloaders individually and give the collated batches to the training_step. This is why there is a difference.

As you say, perhaps in some cases it also makes sense to do the same for validation, where you could log the outputs side-by-side as we iterate through the dataloaders jointly.

@justusschock probably knows a bit more about this and the edge cases here.

stale · 2023-01-21T23:47:31Z

This issue has been automatically marked as stale because it hasn't had any recent activity. This issue will be closed in 7 days if no further activity occurs. Thank you for your contributions - the Lightning Team!

stale · 2023-04-14T11:23:48Z

This issue has been automatically marked as stale because it hasn't had any recent activity. This issue will be closed in 7 days if no further activity occurs. Thank you for your contributions - the Lightning Team!

awaelchli · 2023-04-14T18:19:42Z

Support for dictionary in the eval steps was added in #16726 :)
(release notes: https://github.com/Lightning-AI/lightning/releases/tag/2.0.0)

awaelchli · 2023-04-14T18:22:12Z

Support for this will be added with #17163
cc @carmocca

carmocca · 2023-04-14T18:56:16Z

It's not clear to me what this issue requests. Support for multiple dataloaders has existed for a while, but they are run sequentially during evaluation.

#17163 removes this sequential constraint for .validate() and .test()

But the top post doesn't explicitly state that the problem is running sequentiallly

kotchin · 2023-04-17T14:48:59Z

@carmocca this issue requests the same dict based dataloader support for val_dataloader, test_dataloader and predict_dataloader as exists for train_dataloader.

Specifically, when setting up a LightningDataModule, a train_dataloader can return a dict, where a key can correspond to a dataloader.
For val_dataloader, test_dataloader and predict_dataloader, this is not possible. At best, one can return a list of dataloaders, but this breaks a possible naming potential of each of the dataloaders in the list.

One example is that if I want my val_dataloader to contain 3 different dataloaders "val_dataloader A", "val_dataloader B" and "val_dataloader C", the only thing I can get is a list of dataloaders, but I can't reuse any name or any definition to what each dataloader is, so if by any chance the order of the dataloaders is changed, this can't be automatically detected/accounted for my the LightningModule.

This is why the dict based train_dataloader is convenient, it allows to return:

{"dataloader A": dataloader_A, "dataloader B": dataloader_B, "dataloader C": dataloader_C}

instead of
[dataloader_A, dataloader_B, dataloader_C]

The question is mostly why this dict based dataloader organization is only available for train_dataloader and not the other methods.

Edit: This is also reflected in the documentation for the LightningDataModule:
https://lightning.ai/docs/pytorch/latest/api/lightning.pytorch.core.LightningDataModule.html#lightning.pytorch.core.LightningDataModule.from_datasets

Where the val_dataset, test_dataset and predict_dataset can't be of type Mapping[str, Dataset] unlike train_dataset.

Edit2: this person is also reporting the limitation I'm talking about: #16830 (comment)

carmocca · 2023-04-17T15:42:18Z

A dict can be returned at least in version 2.0. Try playing around with this script:

import os

import torch
from torch.utils.data import DataLoader, Dataset

from lightning.pytorch import LightningModule, Trainer


class RandomDataset(Dataset):
    def __init__(self, size, length):
        self.len = length
        self.data = torch.randn(length, size)

    def __getitem__(self, index):
        return self.data[index]

    def __len__(self):
        return self.len


class BoringModel(LightningModule):
    def __init__(self):
        super().__init__()
        self.layer = torch.nn.Linear(32, 2)

    def forward(self, x):
        return self.layer(x)

    def test_step(self, batch, batch_idx, dataloader_idx=0):
        print(dataloader_idx, batch_idx, batch)

    def test_dataloader(self):
        return {"A": DataLoader(RandomDataset(32, 64), batch_size=2), "B": DataLoader(RandomDataset(32, 64), batch_size=2)}

def run():
    model = BoringModel()
    trainer = Trainer(
        default_root_dir=os.getcwd(),
        limit_test_batches=1,
        barebones=True,
    )
    trainer.test(model)


if __name__ == "__main__":
    run()

But the dataloaders will be consumed sequentially (during validate, test and predict), and the dictionary keys are not included in the batch. Perhaps you meant that you are interested in consuming them using a different mode: https://lightning.ai/docs/pytorch/stable/data/iterables.html#multiple-iterables

Example:

    def test_dataloader(self):
        from lightning.pytorch.utilities import CombinedLoader
        dataloaders = {"A": DataLoader(RandomDataset(32, 64), batch_size=2), "B": DataLoader(RandomDataset(32, 64), batch_size=2)}
        return CombinedLoader(dataloaders, mode="max_size_cycle")

Which in 2.0.1 raises:

ValueError: `trainer.test()` only supports the `CombinedLoader(mode="sequential")` mode.

But I just implemented support in master with #17163 if this is blocking you

carmocca · 2023-04-17T15:43:18Z

The from_datasets method needs an update though. Good catch. Opened #17402

kotchin · 2023-08-23T09:33:37Z

@carmocca it seems your proposal goes in the direction I'm pitching.

If I can indeed consume the validation data and test data with ("dataloader name", batch) that should be fine. This helps the validation and test set, when validating and testing on multiple dataloaders while knowing which dataloader is provided everytime and allow for the logging to reuse the name of the dataloader.

Thank you for your help.

Edit: we may close this issue as resolved.

kotchin added the needs triage Waiting to be triaged by maintainers label Dec 12, 2022

awaelchli added discussion In a discussion stage data handling Generic data-related topic feature Is an improvement or enhancement and removed needs triage Waiting to be triaged by maintainers labels Dec 13, 2022

stale bot added the won't fix This will not be worked on label Apr 14, 2023

awaelchli closed this as completed Apr 14, 2023

awaelchli reopened this Apr 14, 2023

stale bot removed the won't fix This will not be worked on label Apr 14, 2023

awaelchli mentioned this issue Apr 14, 2023

Support all CombinedLoader modes during evaluation #17163

Merged

carmocca closed this as completed in #17163 Apr 16, 2023

carmocca mentioned this issue Apr 17, 2023

Update from_datasets to support arbitrary iterables #17402

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dict specified val_dataloader, test_dataloader and predict_dataloader #16015

Dict specified val_dataloader, test_dataloader and predict_dataloader #16015

kotchin commented Dec 12, 2022 •

edited by github-actions bot

Loading

awaelchli commented Dec 13, 2022

stale bot commented Jan 21, 2023

stale bot commented Apr 14, 2023

awaelchli commented Apr 14, 2023 •

edited

Loading

awaelchli commented Apr 14, 2023

carmocca commented Apr 14, 2023

kotchin commented Apr 17, 2023 •

edited

Loading

carmocca commented Apr 17, 2023

carmocca commented Apr 17, 2023 •

edited

Loading

kotchin commented Aug 23, 2023 •

edited

Loading

Dict specified val_dataloader, test_dataloader and predict_dataloader #16015

Dict specified val_dataloader, test_dataloader and predict_dataloader #16015

Comments

kotchin commented Dec 12, 2022 • edited by github-actions bot Loading

Description & Motivation

Pitch

Alternatives

Additional context

awaelchli commented Dec 13, 2022

stale bot commented Jan 21, 2023

stale bot commented Apr 14, 2023

awaelchli commented Apr 14, 2023 • edited Loading

awaelchli commented Apr 14, 2023

carmocca commented Apr 14, 2023

kotchin commented Apr 17, 2023 • edited Loading

carmocca commented Apr 17, 2023

carmocca commented Apr 17, 2023 • edited Loading

kotchin commented Aug 23, 2023 • edited Loading

kotchin commented Dec 12, 2022 •

edited by github-actions bot

Loading

awaelchli commented Apr 14, 2023 •

edited

Loading

kotchin commented Apr 17, 2023 •

edited

Loading

carmocca commented Apr 17, 2023 •

edited

Loading

kotchin commented Aug 23, 2023 •

edited

Loading