Mistake in GLUE code example

Hi, I found out there is a mistake in how the `total_steps` is computed in https://pytorch-lightning.readthedocs.io/en/stable/notebooks/lightning_examples/text-transformers.html 

For instance, in the [setup() method](https://pytorch-lightning.readthedocs.io/en/stable/notebooks/lightning_examples/text-transformers.html#Transformer-LightningModule), we have:

```python
# Calculate total steps
tb_size = self.hparams.train_batch_size * max(1, self.trainer.gpus)
ab_size = self.trainer.accumulate_grad_batches * float(self.trainer.max_epochs)
self.total_steps = (len(train_loader.dataset) // tb_size) // ab_size
```

If I'm not mistaken, it should be something along the lines of:

```python
# Calculate total steps
tb_size = self.hparams.train_batch_size * max(1, self.trainer.gpus)
ab_size = tb_size * self.trainer.accumulate_grad_batches
self.total_steps = int((len(train_loader.dataset) / ab_size) * float(self.trainer.max_epochs))
```

In the first version, on MRPC (3668 instances), with 30 epochs, 32 batch size, 1 gpu and 1 batch accumulation `total_steps` amounts to 3; in the second version, it amounts to 3438.


cc @borda

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Mistake in GLUE code example #133

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Mistake in GLUE code example #133

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions