A small multi GPU training bug #5

Aotle · 2023-03-26T07:35:06Z

It's on lines 98-100 of IRRA/processor/processor.py

if args.distributed: 
     top1 = evaluator.eval(model.module.eval())
else:
     top1 = evaluator.eval(model.eval())

The text was updated successfully, but these errors were encountered:

anosorae · 2023-03-26T07:37:22Z

Thank you for pointing out this issue.

Aotle · 2023-03-27T00:36:34Z

lines 108 of IRRA/processor/processor.py

if get_rank() == 0:
    logger.info(f"best R1: {best_top1} at epoch {arguments['epoch']}")

anosorae · 2023-03-27T08:28:21Z

We have only used a single GPU in our experiments and have not verified the validity of the multi-GPU training code, so we cannot guarantee its reliability if you want to train with DDP.

Anyway, you are welcome to keep submitting bugs about multi GPU training and we will fix it as soon as possible.

Pefect96 · 2023-05-26T14:20:13Z

lines 108 of IRRA/processor/processor.py

if get_rank() == 0:
    logger.info(f"best R1: {best_top1} at epoch {arguments['epoch']}")

I want to know if the multi GPU training is work?

anosorae closed this as completed Apr 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A small multi GPU training bug #5

A small multi GPU training bug #5

Aotle commented Mar 26, 2023

anosorae commented Mar 26, 2023

Aotle commented Mar 27, 2023

anosorae commented Mar 27, 2023

Pefect96 commented May 26, 2023

A small multi GPU training bug #5

A small multi GPU training bug #5

Comments

Aotle commented Mar 26, 2023

anosorae commented Mar 26, 2023

Aotle commented Mar 27, 2023

anosorae commented Mar 27, 2023

Pefect96 commented May 26, 2023