New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Only access loss tensor every logging_steps #6802

Merged

sgugger merged 23 commits into huggingface:master from jysohn23:tpu-mlm

Aug 31, 2020

Commits on Aug 28, 2020

Only access loss tensor every logging_steps

* tensor.item() was being called every step. This must not be done
for XLA:TPU tensors as it's terrible for performance causing TPU<>CPU
communication at each step. On RoBERTa MLM for example, it reduces step
time by 30%, should be larger for smaller step time models/tasks.
* Train batch size was not correct in case a user uses the
`per_gpu_train_batch_size` flag
* Avg reduce loss accross eval shards

jysohn23 committed Aug 28, 2020

18fc69a

Fix style (huggingface#6803 )

sshleifer committed Aug 28, 2020
Configuration menu
View commit details

Copy full SHA for 20f7786

Browse repository at this point
Copy the full SHA

20f7786 View commit details

Browse the repository at this point in the history
t5 model should make decoder_attention_mask (huggingface#6800 )

sshleifer committed Aug 28, 2020
Configuration menu
View commit details

Copy full SHA for 3cac867

Browse repository at this point
Copy the full SHA

3cac867 View commit details

Browse the repository at this point in the history
[s2s] Test hub configs in self-scheduled CI (huggingface#6809 )

sshleifer committed Aug 28, 2020
Configuration menu
View commit details

Copy full SHA for 5ab21b0

Browse repository at this point
Copy the full SHA

5ab21b0 View commit details

Browse the repository at this point in the history

Commits on Aug 29, 2020

[s2s] round runtime in run_eval (huggingface#6798 )

sshleifer committed Aug 29, 2020
Configuration menu
View commit details

Copy full SHA for ac47458

Browse repository at this point
Copy the full SHA

ac47458 View commit details

Browse the repository at this point in the history
Pegasus finetune script: add --adafactor (huggingface#6811 )

sshleifer committed Aug 29, 2020
Configuration menu
View commit details

Copy full SHA for 0f58903

Browse repository at this point
Copy the full SHA

0f58903 View commit details

Browse the repository at this point in the history
[bart] rename self-attention -> attention (huggingface#6708 )

sshleifer committed Aug 29, 2020
Configuration menu
View commit details

Copy full SHA for 22933e6

Browse repository at this point
Copy the full SHA

22933e6 View commit details

Browse the repository at this point in the history

Commits on Aug 30, 2020

[tests] fix typos in inputs (huggingface#6818 )

stas00 committed Aug 30, 2020
Configuration menu
View commit details

Copy full SHA for 563485b

Browse repository at this point
Copy the full SHA

563485b View commit details

Browse the repository at this point in the history
Fixed open in colab link (huggingface#6825 )

PandaWhoCodes committed Aug 30, 2020
Configuration menu
View commit details

Copy full SHA for a584761

Browse repository at this point
Copy the full SHA

a584761 View commit details

Browse the repository at this point in the history
Add model card for singbert lite. Update widget for singbert and sing…
```
…bert-large. (huggingface#6827)
```
zyuanlim committed Aug 30, 2020
Configuration menu
View commit details

Copy full SHA for d176aaa

Browse repository at this point
Copy the full SHA

d176aaa View commit details

Browse the repository at this point in the history
BR_BERTo model card (huggingface#6793 )

rdenadai committed Aug 30, 2020
Configuration menu
View commit details

Copy full SHA for 0eecace

Browse repository at this point
Copy the full SHA

0eecace View commit details

Browse the repository at this point in the history
clearly indicate shuffle=False (huggingface#6312 )
```
* Clarify shuffle

* clarify shuffle

Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>
```
xujiaze13 and JetRunner committed Aug 30, 2020
Configuration menu
View commit details

Copy full SHA for 32fe440

Browse repository at this point
Copy the full SHA

32fe440 View commit details

Browse the repository at this point in the history
[s2s README] Add more dataset download instructions (huggingface#6737 )

sshleifer committed Aug 30, 2020
Configuration menu
View commit details

Copy full SHA for dfa10a4

Browse repository at this point
Copy the full SHA

dfa10a4 View commit details

Browse the repository at this point in the history

Commits on Aug 31, 2020

Style

LysandreJik committed Aug 31, 2020
Configuration menu
View commit details

Copy full SHA for 0e83769

Browse repository at this point
Copy the full SHA

0e83769 View commit details

Browse the repository at this point in the history
Patch logging issue

LysandreJik committed Aug 31, 2020
Configuration menu
View commit details

Copy full SHA for 05c3214

Browse repository at this point
Copy the full SHA

05c3214 View commit details

Browse the repository at this point in the history
Set default logging level to WARNING instead of INFO

LysandreJik committed Aug 31, 2020
Configuration menu
View commit details

Copy full SHA for 4561f05

Browse repository at this point
Copy the full SHA

4561f05 View commit details

Browse the repository at this point in the history
TF Flaubert w/ pre-norm (huggingface#6841 )

LysandreJik committed Aug 31, 2020
Configuration menu
View commit details

Copy full SHA for 895d394

Browse repository at this point
Copy the full SHA

895d394 View commit details

Browse the repository at this point in the history

Dataset and DataCollator for BERT Next Sentence Prediction (NSP) task (…

…huggingface#6644)

* add datacollator and dataset for next sentence prediction task

* bug fix (numbers of special tokens & truncate sequences)

* bug fix (+ dict inputs support for data collator)

* add padding for nsp data collator; renamed cached files to avoid conflict.

* add test for nsp data collator

* Style

Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

3 people committed Aug 31, 2020

2de7ee0

Fix in Adafactor docstrings (huggingface#6845 )

sgugger committed Aug 31, 2020
Configuration menu
View commit details

Copy full SHA for d2f9cb8

Browse repository at this point
Copy the full SHA

d2f9cb8 View commit details

Browse the repository at this point in the history
Fix resuming training for Windows (huggingface#6847 )

sgugger committed Aug 31, 2020
Configuration menu
View commit details

Copy full SHA for c48546c

Browse repository at this point
Copy the full SHA

c48546c View commit details

Browse the repository at this point in the history

Only access loss tensor every logging_steps

* tensor.item() was being called every step. This must not be done
for XLA:TPU tensors as it's terrible for performance causing TPU<>CPU
communication at each step. On RoBERTa MLM for example, it reduces step
time by 30%, should be larger for smaller step time models/tasks.
* Train batch size was not correct in case a user uses the
`per_gpu_train_batch_size` flag
* Avg reduce loss accross eval shards

jysohn23 committed Aug 31, 2020

ac03af4

Merge branch 'tpu-mlm' of https://github.com/jysohn23/transformers in…
```
…to tpu-mlm
```
jysohn23 committed Aug 31, 2020
Configuration menu
View commit details

Copy full SHA for db74df3

Browse repository at this point
Copy the full SHA

db74df3 View commit details

Browse the repository at this point in the history
comments

jysohn23 committed Aug 31, 2020
Configuration menu
View commit details

Copy full SHA for 2b981cd

Browse repository at this point
Copy the full SHA

2b981cd View commit details

Browse the repository at this point in the history

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only access loss tensor every logging_steps #6802

Only access loss tensor every logging_steps #6802

Commits on Aug 28, 2020

Commits on Aug 29, 2020

Commits on Aug 30, 2020

Commits on Aug 31, 2020