Changing entry.py for MisconfigurationException error #40

Sangyoon-Bae · 2021-12-01T02:25:35Z

Hi! This is Stella from Seoul National University, I'm getting a lot of help from your code.
I have a question about entry.py line 87.
Originally it has metric = 'valid_' + get_dataset(dm.dataset_name)['metric']

but when I run model, I faced error like this:
'pytorch_lightning.utilities.exceptions.MisconfigurationException: ModelCheckpoint(monitor='valid_mae') not found in the returned metrics: ['train_loss']. HINT: Did you call self.log('valid_mae', value) in the LightningModule?'

So I changed the line 87 as metric = 'train_loss'

and it runs well.

I'm quite afraid that I'm doing something wrong, is it right way to modify the code?
Here are some useful information for my project:

task: regression
input type : integer (originally continuous value, but discretized)
target type : real value
eval metric : rmse
features from data.py

```
        'num_class': 1,
```
```
        'loss_fn': F.l1_loss,
```
```
        'metric': 'mae',
```
```
        'metric_mode': 'min',
```

The text was updated successfully, but these errors were encountered:

zhengsx · 2021-12-01T09:42:59Z

You can have a look about the usage of ModelCheckpoint in pytorch_lightning (see here). Basically it helps to monitor a self-defined metric during the training process.

Sangyoon-Bae · 2021-12-06T07:24:58Z

Thank you for kind reply! I'm always grateful for your help :)
I used shell script in training ZINC dataset, and is this script just for pre-training?
If I want to validate model and test it, then should I do transfer learning like molhiv training example with pre-trained model which is made with this script?

zhengsx · 2021-12-06T12:44:33Z

The reproduction of the ZINC results reported in our paper doesn't need pre-training. You can directly use the script. Btw, remember to test your model on ZINC testset, refer to #35 .

Sangyoon-Bae · 2021-12-07T16:14:23Z

Then shell script in training ZINC dataset do training and validation, and should I use test dataset with another script to reproduce test MAE 0.122?

then loss in log bar (it is not ZINC data) means validation loss?

Sangyoon-Bae · 2021-12-08T04:42:12Z

I solved the second problem! the loss in log bar means average loss about the past 100 batches in training.

zhengsx · 2021-12-08T10:56:02Z

Then shell script in training ZINC dataset do training and validation, and should I use test dataset with another script to reproduce test MAE 0.122?

then loss in log bar (it is not ZINC data) means validation loss?

Yes, pls save your ckpt according to best val then test it on Testset.

zhengsx · 2021-12-23T12:17:20Z

Close this issue due to inactive. Feel free to raise a new one or reopen this one for any further question.

zhengsx closed this as completed Dec 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changing entry.py for MisconfigurationException error #40

Changing entry.py for MisconfigurationException error #40

Sangyoon-Bae commented Dec 1, 2021

zhengsx commented Dec 1, 2021

Sangyoon-Bae commented Dec 6, 2021

zhengsx commented Dec 6, 2021

Sangyoon-Bae commented Dec 7, 2021

Sangyoon-Bae commented Dec 8, 2021

zhengsx commented Dec 8, 2021

zhengsx commented Dec 23, 2021

Changing entry.py for MisconfigurationException error #40

Changing entry.py for MisconfigurationException error #40

Comments

Sangyoon-Bae commented Dec 1, 2021

zhengsx commented Dec 1, 2021

Sangyoon-Bae commented Dec 6, 2021

zhengsx commented Dec 6, 2021

Sangyoon-Bae commented Dec 7, 2021

Sangyoon-Bae commented Dec 8, 2021

zhengsx commented Dec 8, 2021

zhengsx commented Dec 23, 2021