IFU-master-2022-05-10 and "stable_train_samples_per_second" addition #12

rraminen · 2022-05-11T15:21:20Z

This PR includes the following

IFU till 10 May 2022
Added "stable_train_samples_per_second" code (d5ba921)

* fix report cat path * fix report cat path Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* Add onnx configuration for bigbird-pegasus * Modify docs

* split single_gpu and multi_gpu * update needs in send_result Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

… in case of overflowing tokens (huggingface#17092) * add get_overflowing_images function to ensure 1-to-1 mapping between samples and images in LayoutLMv2Processor * make style * add test for overflowing_tokens, change assert to ValueError, avoiding unrelated formatting changes * change line length by passing --preview into black

…ggingface#17123) * Add type hints for remaining BigBirdPegasus models Here I added type hints to the BigBirdPegasusForCausalLM class. * Add missing type hints for Data2VecText models Added type hints to the Data2VecTextForCausalLM, Data2VecTextForMaskedLM, Data2VecTextForMultipleChoice, Data2VecTextForQuestionAnswering, Data2VecTextForSequenceClassification, and Data2VecTextForTokenClassification classes.

* update docs of length_penalty * Revert "update docs of length_penalty" This reverts commit 466bf48. * add mobilebert onnx config * address suggestions * Update auto.mdx * Update __init__.py * Update features.py

* PyTorch FSDP integration in Trainer * reformatting make style and make quality are now compliant. * Updating dependency check * Trigger CI Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>

…ith try-except (huggingface#16578) * rebase and isort * modify cookiecutter init * fix cookiecutter auto imports * fix clean_frameworks_in_init * fix add_model_to_main_init * blackify * replace unnecessary f-strings * update yolos imports * fix roberta import bug * fix yolos missing dependency * fix add_model_like and cookiecutter bug * fix repository consistency error * modify cookiecutter, fix add_new_model_like * remove stale line Co-authored-by: Dom Miketa <dmiketa@exscientia.co.uk>

…uggingface#17068) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> - Adds auto_batch_size finder - Moves training loop to an inner training loop

…huggingface#17130) * ensure mlflow.end_run() is executed at end of training when mlflow.start_run() was executed by the callback * add debug msg * add support for MLFLOW_TAGS, MLFLOW_RUN_ID, and MLFLOW_NESTED_RUN * update to support python 3.6+ * Validate env variables using ENV_VARS_TRUE_VALUES * Empty-Commit

* LogSumExp trick `question_answering` pipeline. * Adding a failing test.

Co-authored-by: Dom Miketa <dmiketa@exscientia.co.uk>

* [trainer] sharded _load_best_model probably needs a test? * undo delete

…2695) * model zoo take 2 * add deberta * new param for zero2 * doc update * doc update * add layoutlm * bump deepspeed * add deberta-v2, funnel, longformer * new models * style * add t5_v1 * update TAPAS status * reorg problematic models * move doc to another PR * style * fix checkpoint check test * making progress on more models running * cleanup * new version * cleanup

…ingface#17162)

* add support for MLFLOW_FLATTEN_PARAMS * ensure key is str * fix style and update warning msg * Empty commit to trigger CI * fix bug in check_inits.py * add unittest for flatten_dict utils * fix 'NoneType' object is not callable on __del__ * add generic flatten_dict unittest to SPECIAL_MODULE_TO_TEST_MAP * fix style

* unhardcode pretrained model path, make it a class var * add tests for mobilebert tokenizer * allow tempfiles for vocab & merge similarity test to autodelete * add explanatory comments * remove unused imports, let make style do its.. thing * remove inheritance and use BERT tok tests for MobileBERT * Update tests/mobilebert/test_tokenization_mobilebert.py Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com> * amend class names, remove unused import, add fix for mobilebert's hub pathname * unhardcode pretrained model path, make it a class var * add tests for mobilebert tokenizer * allow tempfiles for vocab & merge similarity test to autodelete * add explanatory comments * remove unused imports, let make style do its.. thing * remove inheritance and use BERT tok tests for MobileBERT * Update tests/mobilebert/test_tokenization_mobilebert.py Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com> * amend class names, remove unused import, add fix for mobilebert's hub pathname * amend paths for model tests being in models/ subdir of /tests * explicitly rm test from prev path Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

HuggingFaceDocBuilderDev · 2022-05-11T15:40:31Z

The documentation is not available anymore as the PR was closed or merged.

rraminen · 2022-05-11T19:42:17Z

Filed #13 instead of this PR. Hence closing this.

stevhliu and others added 24 commits May 5, 2022 15:20

Fix link to example scripts (huggingface#17103)

cad61b6

Fix self-push CI report path in cat (huggingface#17111)

351cdbd

* fix report cat path * fix report cat path Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Added BigBirdPegasus onnx config (huggingface#17104)

215e068

* Add onnx configuration for bigbird-pegasus * Modify docs

split single_gpu and multi_gpu (huggingface#17083)

3212afa

* split single_gpu and multi_gpu * update needs in send_result Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

add mobilebert onnx configs (huggingface#17029)

dc3645d

* update docs of length_penalty * Revert "update docs of length_penalty" This reverts commit 466bf48. * add mobilebert onnx config * address suggestions * Update auto.mdx * Update __init__.py * Update features.py

PyTorch FSDP integration in Trainer (huggingface#17136)

05fc176

* PyTorch FSDP integration in Trainer * reformatting make style and make quality are now compliant. * Updating dependency check * Trigger CI Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>

Fix quality and repo consistency

7783fa6

Add the auto_find_batch_size capability from Accelerate into Trainer (h…

2fbb237

…uggingface#17068) Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> - Adds auto_batch_size finder - Moves training loop to an inner training loop

Fix all docs for accelerate install directions (huggingface#17145)

d719bcd

LogSumExp trick question_answering pipeline. (huggingface#17143)

6d80c92

* LogSumExp trick `question_answering` pipeline. * Adding a failing test.

train args defaulting None marked as Optional (huggingface#17156)

1766fa2

Co-authored-by: Dom Miketa <dmiketa@exscientia.co.uk>

[trainer] sharded _load_best_model (huggingface#17150)

9aeacfe

* [trainer] sharded _load_best_model probably needs a test? * undo delete

Fixing the output of code examples in the preprocessing chapter (hugg…

259eeb6

…ingface#17162)

missing file (huggingface#17164)

976835d

Fix template init (huggingface#17163)

4ad2f68

Add DebertaV2ForMultipleChoice (huggingface#17135)

48a8f3d

Added start_train_stable_time

d5ba921

rraminen requested a review from amathews-amd May 11, 2022 15:21

rraminen closed this May 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IFU-master-2022-05-10 and "stable_train_samples_per_second" addition #12

IFU-master-2022-05-10 and "stable_train_samples_per_second" addition #12

rraminen commented May 11, 2022 •

edited

HuggingFaceDocBuilderDev commented May 11, 2022 •

edited

rraminen commented May 11, 2022

IFU-master-2022-05-10 and "stable_train_samples_per_second" addition #12

IFU-master-2022-05-10 and "stable_train_samples_per_second" addition #12

Conversation

rraminen commented May 11, 2022 • edited

HuggingFaceDocBuilderDev commented May 11, 2022 • edited

rraminen commented May 11, 2022

rraminen commented May 11, 2022 •

edited

HuggingFaceDocBuilderDev commented May 11, 2022 •

edited