Queue length modification with the use of DDP #1127

haughty-yeon · 2023-11-23T01:30:36Z

num_subjects()
iterations_per_epoch()
modified

Fixes #1125.

Description

Checklist

I have read the CONTRIBUTING docs and have a developer setup (especially important are pre-commitand pytest)
Non-breaking change (would not break existing functionality)
Breaking change (would cause existing functionality to change)
Tests added or modified to cover the changes
Integration tests passed locally by running pytest
In-line docstrings updated
Documentation updated, tested running make html inside the docs/ folder
This pull request is ready to be reviewed

num_subjects() iterations_per_epoch() modified

for more information, see https://pre-commit.ci

fepegar · 2023-11-24T00:25:31Z

This makes total sense. I've added some minor readability changes and tested the new implementation as follows:

import os

import torch
import torch.distributed as dist
import torchio as tio
from loguru import logger


num_subjects = 6
samples_per_volume = 2
max_length = 1000

subjects = []
tensor = torch.ones(1, 16, 16, 16)
for i in range(num_subjects):
    subject = tio.Subject(
        image=tio.ScalarImage(tensor=i * tensor),
        id=i,
    )
    subjects.append(subject)
dataset = tio.SubjectsDataset(subjects)

is_distributed = bool(os.environ.get('WORLD_SIZE'))
if is_distributed:
    dist.init_process_group()
    subject_sampler = torch.utils.data.distributed.DistributedSampler(
        dataset,
        shuffle=False,
    )
    rank = dist.get_rank()
else:
    subject_sampler = None
    rank = 0

patch_sampler = tio.sampler.UniformSampler(patch_size=2)

queue = tio.Queue(
    dataset,
    max_length,
    sampler=patch_sampler,
    samples_per_volume=samples_per_volume,
    num_workers=0,
    shuffle_subjects=False,
    shuffle_patches=False,
    subject_sampler=subject_sampler,
)

loader = torch.utils.data.DataLoader(
    queue,
    batch_size=1,
    num_workers=0,
    shuffle=False,
    collate_fn=lambda x: x[0],
)

for i, patch in enumerate(loader):
    logger.info(f'Rank {rank} | Batch {i} | Subject {patch["id"]}')

Run with

torchrun --nproc_per_node=3 /tmp/ddp.py

Output:

2023-11-23 16:19:14.933 | INFO     | __main__:<module>:57 - Rank 1 | Batch 0 | Subject 1
2023-11-23 16:19:14.933 | INFO     | __main__:<module>:57 - Rank 1 | Batch 1 | Subject 1
2023-11-23 16:19:14.933 | INFO     | __main__:<module>:57 - Rank 1 | Batch 2 | Subject 4
2023-11-23 16:19:14.933 | INFO     | __main__:<module>:57 - Rank 1 | Batch 3 | Subject 4
2023-11-23 16:19:14.935 | INFO     | __main__:<module>:57 - Rank 0 | Batch 0 | Subject 0
2023-11-23 16:19:14.935 | INFO     | __main__:<module>:57 - Rank 0 | Batch 1 | Subject 0
2023-11-23 16:19:14.935 | INFO     | __main__:<module>:57 - Rank 0 | Batch 2 | Subject 3
2023-11-23 16:19:14.935 | INFO     | __main__:<module>:57 - Rank 0 | Batch 3 | Subject 3
2023-11-23 16:19:14.947 | INFO     | __main__:<module>:57 - Rank 2 | Batch 0 | Subject 2
2023-11-23 16:19:14.947 | INFO     | __main__:<module>:57 - Rank 2 | Batch 1 | Subject 2
2023-11-23 16:19:14.947 | INFO     | __main__:<module>:57 - Rank 2 | Batch 2 | Subject 5
2023-11-23 16:19:14.947 | INFO     | __main__:<module>:57 - Rank 2 | Batch 3 | Subject 5

fepegar · 2023-11-24T00:53:49Z

Thanks for your contribution, @haughty-yeon!

@allcontributors please add @haughty-yeon for bug

allcontributors · 2023-11-24T00:53:58Z

@fepegar

I couldn't determine any contributions to add, did you specify any contributions?
Please make sure to use valid contribution names.

I've put up a pull request to add @haughty-yeon! 🎉

haughty-yeon and others added 2 commits November 22, 2023 17:03

Update queue.py

7b86a1d

num_subjects() iterations_per_epoch() modified

[pre-commit.ci] auto fixes from pre-commit.com hooks

9405a82

for more information, see https://pre-commit.ci

haughty-yeon mentioned this pull request Nov 23, 2023

Halve queue length when using DDP #1125

Closed

distributed sampler length modififed

2b77b25

This comment was marked as outdated.

Sign in to view

Minor readability changes

893a695

fepegar added 3 commits November 24, 2023 00:29

Fix mypy error

fbe260c

Add assertion for mypy

3123562

Ignore mypy error

f2d3e1b

haughty-yeon closed this Nov 24, 2023

haughty-yeon reopened this Nov 24, 2023

fepegar merged commit befd121 into fepegar:main Nov 24, 2023
23 of 24 checks passed

allcontributors bot mentioned this pull request Nov 24, 2023

docs: add haughty-yeon as a contributor for bug #1128

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Queue length modification with the use of DDP #1127

Queue length modification with the use of DDP #1127

haughty-yeon commented Nov 23, 2023 •

edited by fepegar

Loading

This comment was marked as outdated.

fepegar commented Nov 24, 2023

fepegar commented Nov 24, 2023

allcontributors bot commented Nov 24, 2023

Queue length modification with the use of DDP #1127

Queue length modification with the use of DDP #1127

Conversation

haughty-yeon commented Nov 23, 2023 • edited by fepegar Loading

This comment was marked as outdated.

fepegar commented Nov 24, 2023

fepegar commented Nov 24, 2023

allcontributors bot commented Nov 24, 2023

haughty-yeon commented Nov 23, 2023 •

edited by fepegar

Loading