[Bug] Duplicate logging #1012

Vozf · 2020-09-24T13:09:25Z

🐛 Bug

Description

I am using pytorch lightning with hydra and output from pytorch-lightning is duplicated by hydra
Here is an example

CometLogger will be initialized in online mode
[2020-09-24 16:04:43,389][lightning][INFO] - CometLogger will be initialized in online mode
GPU available: True, used: True
[2020-09-24 16:04:43,441][lightning][INFO] - GPU available: True, used: True
TPU available: False, using: 0 TPU cores
[2020-09-24 16:04:43,442][lightning][INFO] - TPU available: False, using: 0 TPU cores
CUDA_VISIBLE_DEVICES: [2]
[2020-09-24 16:04:43,442][lightning][INFO] - CUDA_VISIBLE_DEVICES: [2]

The text was updated successfully, but these errors were encountered:

omry · 2020-09-24T15:27:25Z

Thanks for reporting.
You are missing a minimal repro.
Please reopen if you can provide it.

As a side tip:
You can disable Hydra's logging configuration by overriding the config group hydra/job_logging to disabled.

Huizerd · 2020-12-24T09:33:03Z

Could this be reopened? I'm having the same issue, please see this MWE:

config.yaml:

defaults:
  - hydra/job_logging: disabled

hydra:
  output_subdir: Null
  run:
    dir: .

main.py:

import hydra
from omegaconf import OmegaConf

import pytorch_lightning as pl
import torch.nn as nn
import torch.optim as optim
import torch.nn.functional as F


class Model(pl.LightningModule):
    def __init__(self, in_size, hid_size, out_size):
        super().__init__()

        self.fc1 = nn.Linear(in_size, hid_size)
        self.fc2 = nn.Linear(hid_size, out_size)

    def forward(self, x):
        return self.fc2(self.fc1(x))

    def training_step(self, batch, batch_idx):
        x, y = batch
        logits = self(x)
        loss = F.cross_entropy(logits, y)
        return loss
    
    def configure_optimizers(self):
        return optim.Adam(self.parameters(), lr=1e-3)


@hydra.main(config_name="config")
def main(cfg):
    print(OmegaConf.to_yaml(cfg))

    model = Model(5, 10, 2)
    trainer = pl.Trainer()


if __name__ == "__main__":
    main()

Having hydra/logging set to disabled (which is desired in my case) leads to the following output (disregard the warning):

{}

/*/issues/venv/lib/python3.8/site-packages/pytorch_lightning/utilities/distributed.py:49: UserWarning: GPU available but not used. Set the --gpus flag when calling the script.
  warnings.warn(*args, **kwargs)

Commenting out hydra/logging leads to the following output:

{}

GPU available: True, used: False
[2020-12-24 10:30:34,753][lightning][INFO] - GPU available: True, used: False
TPU available: None, using: 0 TPU cores
[2020-12-24 10:30:34,753][lightning][INFO] - TPU available: None, using: 0 TPU cores
/home/huis/Documents/issues/venv/lib/python3.8/site-packages/pytorch_lightning/utilities/distributed.py:49: UserWarning: GPU available but not used. Set the --gpus flag when calling the script.
  warnings.warn(*args, **kwargs)

As you can see, with hydra/logging: disabled also the output from PyTorch Lightning is blocked in some way, whereas with Hydra logging enabled, it is printed in duplicate.

In my case, I would want Hydra logging to be disabled, while still being able to see the output from PyTorch Lightning.

pip freeze:

absl-py==0.11.0
antlr4-python3-runtime==4.8
cachetools==4.2.0
certifi==2020.12.5
chardet==4.0.0
fsspec==0.8.5
future==0.18.2
google-auth==1.24.0
google-auth-oauthlib==0.4.2
grpcio==1.34.0
hydra-core==1.0.4
idna==2.10
importlib-resources==4.0.0
Markdown==3.3.3
numpy==1.19.4
oauthlib==3.1.0
omegaconf==2.0.5
pkg-resources==0.0.0
protobuf==3.14.0
pyasn1==0.4.8
pyasn1-modules==0.2.8
pytorch-lightning==1.1.2
PyYAML==5.3.1
requests==2.25.1
requests-oauthlib==1.3.0
rsa==4.6
six==1.15.0
tensorboard==2.4.0
tensorboard-plugin-wit==1.7.0
torch==1.7.1
tqdm==4.54.1
typing-extensions==3.7.4.3
urllib3==1.26.2
Werkzeug==1.0.1

omry · 2020-12-24T16:49:15Z

Thanks for the repro. We will take a look.

About disabled as a logging config:
The intention there is to disable logging output, not to disable the configuration of the logging.

There is a pull request that added support for not configuring the logging in Hydra 1.1 (which is not yet released).
Take a look at that PR. you should be apply the config there to your own application using Hydra 1.0.

Generally speaking, when using Hydra - by default it configures the logging.
You can use Hydra to configure all the application logging. The problem is that many other libraries are also configuring the logging and sometimes the two logging configurations are clashing.
My guess is that both Hydra and Lightning are adding a console appender, which is why you are seeing the log outputs twice.
I further guess that you can fix this by doing one of the following:

Have Hydra not configure the logging at all (see PR above).
Have Hydra configure file logging, but not add a console appender.
Somehow tell Lightning to not configure the logging and trust Hydra to do it.

I am interested in community help to identify the root cause, if you want to do some digging it will help.

Huizerd · 2020-12-24T21:23:33Z

Thanks for the very quick response and the directions, I will try them out and get back to you with the root cause as soon as I have found it!

Huizerd · 2020-12-28T20:41:36Z

Ok, multiple things to go on:

Configuring the configs like in Implemented disabling logging configuration & added config file #1130 (hydra/job_logging=none and hydra/hydra_logging=none) works perfectly; only the Lightning things will be printed
It seems to be related to disable_exisiting_loggers in hydra/job_logging. For the default.yaml config, this is set to false, which leads to duplicate output; however setting it to true (as in disabled.yaml) leads to no output, seemingly because Hydra in that case doesn't receive anything from Lightning..

omry · 2020-12-28T23:24:28Z

I don't want to disable existing loggers by default because there are many scenarios where you would want to preserve the existing loggers.

Did you find what in Lightning is configuring the logging?
Another idea is to explicitly configure the lightning loggers in your Hydra logging config.

JackUrb · 2020-12-31T00:25:06Z

Running into a related issue with Mephisto as well. In my case, Hydra doesn't simply duplicate logs, but it turns on logging of all loggers globally to the INFO level:

hydra/hydra/core/utils.py

Line 46 in bc3567d

root.setLevel(logging.INFO)

This leads me to get all kinds of spam from other used packages... Is this actually a desired behavior? If so I'll have to have Mephisto add:

 {"hydra/job_logging": "disabled"},
 {"hydra/hydra_logging": "disabled"},

to all defaults lists, but I'm not sure if I'm losing important logging info by doing this. Am I missing something?

omry · 2020-12-31T03:10:07Z

@JackUrb, since the problem you are describing is different we should discuss it on it's own issue.

Huizerd · 2021-01-11T07:41:01Z

@omry it seems to be related to this issue with PyTorch Lightning, and not anything hydra-related, so will continue the search there

omry · 2021-01-11T07:48:32Z

@Huizerd, thanks for the update!
Closing this, feel free to comment here if you have any follow-up questions/comments.

hughplay · 2021-03-25T11:54:54Z

I provide my observation here for anyone who encounters similar problems.

Show answer

Make sure only the root logger has StreamHandler. For example, if you use pytorch-lightning 1.2.5, you can do:

In your main function, run

pl._logger.handlers = []
pl._logger.propagate = True

Or just simply delay your import pytorch_lightning as pl in your main function.

Long answer and explanation

Basically, this is because there are two loggers both have StreamHandler.
In the newest version of PytorchLightning, they even check whether root logger already has a handler.

# pytorch_lightning.__init__.py, version: 1.2.5
_root_logger = logging.getLogger()
_logger = logging.getLogger(__name__)
_logger.setLevel(logging.INFO)

# if root logger has handlers, propagate messages up and let root logger process them
if not _root_logger.hasHandlers():
    _logger.addHandler(logging.StreamHandler())
    _logger.propagate = False

However, in most cases, we use PL and Hydra like this:

import hydra
import pytorch_lightning
from omegaconf import DictConf

@hydra.main(config_path='conf', config_name='config')
def main(cfg: DictConfig) -> None:
...

if __name__ == '__main__':
    main()

The problem is the checking in step 2 is too early before hydra constructing its logger. The _root_logger is still empty in the checking time until hydra.main.
Therefore, we can use the two ways introduced in the short answer to solve this.

omry · 2021-03-25T20:42:29Z

Thanks for the comment @hughplay.
I think it's not great that importing PL has a side effect of manipulating the logging subsystem.
Consider reaching out to the PL team to allow some way to opt out of this behavior (or for them to make it opt-in).

One thing you can try is to use hydra.hydra_logging.disable_existing_loggers=true.
I have yet to see a minimal repro using Hydra and PL, so if you can come up with one it will help (Please file it in a different issue to avoid overloading this one).

Vozf added the bug Something isn't working label Sep 24, 2020

omry closed this as completed Sep 24, 2020

omry reopened this Dec 24, 2020

omry added the help wanted Community help is wanted label Dec 24, 2020

JackUrb mentioned this issue Jan 4, 2021

[Bug?] Hydra turns on global logging to the info level #1262

Closed

2 tasks

omry closed this as completed Jan 11, 2021

Huizerd mentioned this issue Jan 11, 2021

Weird logging to console behavior. Lightning-AI/pytorch-lightning#4621

Closed

Daraan mentioned this issue Apr 30, 2024

[Feature Request & Solution] Troublehooting guide to prevent duplicated log messages #2898

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] Duplicate logging #1012

[Bug] Duplicate logging #1012

Vozf commented Sep 24, 2020

omry commented Sep 24, 2020 •

edited

Loading

Huizerd commented Dec 24, 2020 •

edited

Loading

omry commented Dec 24, 2020

Huizerd commented Dec 24, 2020

Huizerd commented Dec 28, 2020

omry commented Dec 28, 2020

JackUrb commented Dec 31, 2020

omry commented Dec 31, 2020

Huizerd commented Jan 11, 2021 •

edited

Loading

omry commented Jan 11, 2021

hughplay commented Mar 25, 2021 •

edited

Loading

omry commented Mar 25, 2021 •

edited

Loading

[Bug] Duplicate logging #1012

[Bug] Duplicate logging #1012

Comments

Vozf commented Sep 24, 2020

🐛 Bug

Description

omry commented Sep 24, 2020 • edited Loading

Huizerd commented Dec 24, 2020 • edited Loading

omry commented Dec 24, 2020

Huizerd commented Dec 24, 2020

Huizerd commented Dec 28, 2020

omry commented Dec 28, 2020

JackUrb commented Dec 31, 2020

omry commented Dec 31, 2020

Huizerd commented Jan 11, 2021 • edited Loading

omry commented Jan 11, 2021

hughplay commented Mar 25, 2021 • edited Loading

Show answer

Long answer and explanation

omry commented Mar 25, 2021 • edited Loading

omry commented Sep 24, 2020 •

edited

Loading

Huizerd commented Dec 24, 2020 •

edited

Loading

Huizerd commented Jan 11, 2021 •

edited

Loading

hughplay commented Mar 25, 2021 •

edited

Loading

omry commented Mar 25, 2021 •

edited

Loading