Update from source #9

Pradhy729 · 2020-08-11T19:04:51Z

Has huggingface#6024 which is useful for tests.

fix typo: ckeckpoint->checkpoint

* refactor almost identical tests * important to add a clear assert error message * make the assert error even more descriptive than the original bt

* TFAlbertFor{TokenClassification, MultipleChoice} * Patch models * BERT and TF BERT info s * Update check_repo

* Cache Github Actions CI * Remove useless file

* Add colab button * Add colab link for tutorials

…6377) * correct encoder decoder model * Apply suggestions from code review * apply sylvains suggestions

* improve names and tests longformer * more and better tests for longformer * add first tf test * finalize tf basic op functions * fix merge * tf shape test passes * narrow down discrepancies * make longformer local attn tf work * correct tf longformer * add first global attn function * add more global longformer func * advance tf longformer * finish global attn * upload big model * finish all tests * correct false any statement * fix common tests * make all tests pass except keras save load * fix some tests * fix torch test import * finish tests * fix test * fix torch tf tests * add docs * finish docs * Update src/transformers/modeling_longformer.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/modeling_tf_longformer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * apply Lysandres suggestions * reverse to assert statement because function will fail otherwise * applying sylvains recommendations * Update src/transformers/modeling_longformer.py Co-authored-by: Sam Shleifer <sshleifer@gmail.com> * Update src/transformers/modeling_tf_longformer.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* Chunked feed forward for Bert This is an initial implementation to test applying feed forward chunking for BERT. Will need additional modifications based on output and benchmark results. * Black and cleanup * Feed forward chunking in BertLayer class. * Isort * add chunking for all models * fix docs * Fix typo Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

* add pl_glue example test * for now just test that it runs, next validate results of eval or predict? * complete the run_pl_glue test to validate the actual outcome * worked on my machine, CI gets less accuracy - trying higher epochs * match run_pl.sh hparms * more epochs? * trying higher lr * for now just test that the script runs to a completion * correct the comment * if cuda is available, add --fp16 --gpus=1 to cover more bases * style

* testing utils: capturing std streams context manager * style * missing import * add the origin of this code

* fix tokenizer saving and loading bugs when adding AddedToken to additional special tokens * Add tokenizer test * Style * Style 2 Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

* Warn if debug requested without TPU fixes (#6308) Check whether a PyTorch compatible TPU is available before attempting to print TPU metrics after training has completed. This way, users who apply `--debug` without reading the documentation aren't suprised by a stacktrace. * Style Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

* Optimized banned token masking * Avoid duplicate EOS masking if in bad_words_id * Updated mask generation to handle empty banned token list * Addition of unit tests for the updated bad_words_ids masking * Updated timeout handling in `test_postprocess_next_token_scores_large_bad_words_list` unit test * Updated timeout handling in `test_postprocess_next_token_scores_large_bad_words_list` unit test (timeout does not work on Windows) * Moving Marian import to the test context to allow TF only environments to run * Moving imports to torch_available test * Updated operations device and test * Updated operations device and test * Added docstring and comment for in-place scores modification * Moving test to own test_generation_utils, use of lighter models for testing * removed unneded imports in test_modeling_common * revert formatting change for ModelTesterMixin * Updated caching, simplified eos token id test, removed unnecessary @require_torch * formatting compliance

* Create README.md * Update model_cards/akhooli/gpt2-small-arabic-poetry/README.md * Update model_cards/akhooli/gpt2-small-arabic-poetry/README.md * Update model_cards/akhooli/gpt2-small-arabic-poetry/README.md * Update model_cards/akhooli/gpt2-small-arabic-poetry/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Create README.md Model card for https://huggingface.co/akhooli/gpt2-small-arabic * Update model_cards/akhooli/gpt2-small-arabic/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>

Co-authored-by: Jingqing Zhang <jingqing.zhang15@imperial.ac.uk>

LysandreJik and others added 28 commits August 10, 2020 08:11

Temporarily de-activate TPU CI

1bbc54a

Update modeling_tf_utils.py (#6372)

3a556b0

fix typo: ckeckpoint->checkpoint

the test now works again (#6371)

0830e79

correct pl link in readme (#6364)

35eb96d

refactor almost identical tests (#6339)

1429b92

* refactor almost identical tests * important to add a clear assert error message * make the assert error even more descriptive than the original bt

Small docfile fixes (#6328)

6028ed9

Patch models (#6326)

b99098a

* TFAlbertFor{TokenClassification, MultipleChoice} * Patch models * BERT and TF BERT info s * Update check_repo

Ci GitHub caching (#6382)

79588e6

* Cache Github Actions CI * Remove useless file

Colab button (#6389)

3e0fe3c

* Add colab button * Add colab link for tutorials

Fix links for open in colab (#6391)

06bc347

[EncoderDecoderModel] add a add_cross_attention boolean to config (#…

3425936

…6377) * correct encoder decoder model * Apply suggestions from code review * apply sylvains suggestions

[s2s] Script to save wmt data to disk (#6403)

b9ecd92

Add missing docker arg for TPU CI. (#6393)

f65ac1f

Add TPU testing once again

8a3db6b

testing utils: capturing std streams context manager (#6231)

83984a6

* testing utils: capturing std streams context manager * style * missing import * add the origin of this code

Fix tokenizer saving and loading error (#6026)

cdf1f7e

* fix tokenizer saving and loading bugs when adding AddedToken to additional special tokens * Add tokenizer test * Style * Style 2 Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

switch Hindi-BERT to S3 README (#6396)

3ae3078

Create README.md (#6413)

00ce881

* Create README.md Model card for https://huggingface.co/akhooli/gpt2-small-arabic * Update model_cards/akhooli/gpt2-small-arabic/README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>

Create Model Card File (#6357)

1d1d5be

pl version: examples/requirements.txt is single source of truth (#6309)

7c6a085

[s2s] wmt download script use less ram (#6405)

f6cb0f8

PegasusForConditionalGeneration (torch version) (#6340)

66fa8ce

Co-authored-by: Jingqing Zhang <jingqing.zhang15@imperial.ac.uk>

Pradhy729 merged commit b18cb64 into Pradhy729:master Aug 11, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update from source #9

Update from source #9

Pradhy729 commented Aug 11, 2020

Update from source #9

Update from source #9

Conversation

Pradhy729 commented Aug 11, 2020