Skip to content

[BERT/PyTorch] Unable to increase the iteration count #472

@goyalankit

Description

@goyalankit

Related to Model/Framework(s)
(e.g. GNMT/PyTorch or FasterTransformer/All) BERT/PyTorch

Describe the bug
Increasing max_steps doesn't increase the iteration count

I am trying to increase the number of iterations for BERT pytorch model so that it can train for longer duration. However changing the max_steps keeps the iteration count same as 1963. I am wondering what would the right way to increase the iteration count.

To Reproduce
Try increasing the max_steps in run_pretraining.sh for PyTorch/BERT. The iteration count remains the same.

Expected behavior
Changing max_steps should increase the iteration count.

Environment
Please provide at least:

  • Container version (e.g. pytorch:19.05-py3): nvidia/pytorch:20.03-py3
  • GPUs in the system: (e.g. 8x Tesla V100-SXM2-16GB): Tesla V100
  • CUDA driver version (e.g. 418.67):

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions