Skip to content

[BERT/Tensorflow] Training speed on RTX 2080 ti #762

@kwonmha

Description

@kwonmha

I'm doing pre-training of BERT and trying to find ways to speed up.
I just want to share and get checked the training speed on RTX 2080 ti of BERT.
Hope to know any way to improve speed though speed looks reasonable.

spec :
RTX 2080 ti(11GB memory)
tensorflow 1.14
sequence length = 512

n_gpu(w/ horovod) fp32(batch 4) amp+xla(batch 6)
1 13 32
2 18 43
4 59 77

the numbers in the table mean examples/sec

I wonder is there anything I can do more to speed up.
Unfortunately, due to large sequence length, I cannot increase batch size up to 8, which may enable to use TensorCore.
Also wanna compare with others.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions