Skip to content

Conversation

@ydshieh
Copy link
Collaborator

@ydshieh ydshieh commented Mar 8, 2022

What does this PR do?

Use tiny models for get_pretrained_model in TFEncoderDecoderModelTest.

This is originally for avoiding GPU OOM for TFRembertEncoderDecoderModelTest on CI daily testing.
But @patrickvonplaten suggests that we should actually use the small model in the following quote:

... think we can rename it to test_model_save_loaf_from_pretrained(...) 😉 I think this "real" name was propragated since the first encoder-decoder tests existed in PyTorch. Since the test does no integration testing (e.g. checking if the output corresponds to something reasonable) it makes 0 difference whether we use dummy weights or no dummy weights here ...

@HuggingFaceDocBuilderDev
Copy link

HuggingFaceDocBuilderDev commented Mar 8, 2022

The documentation is not available anymore as the PR was closed or merged.

@ydshieh ydshieh marked this pull request as ready for review March 8, 2022 14:57
@ydshieh ydshieh changed the title [WIP] Use tiny models for get_pretrained_model in TFEncoderDecoderModelTest Use tiny models for get_pretrained_model in TFEncoderDecoderModelTest Mar 8, 2022
Copy link
Member

@LysandreJik LysandreJik left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me, thank you @ydshieh!

Copy link
Contributor

@patrickvonplaten patrickvonplaten left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Perfect!

@ydshieh ydshieh merged commit b7fa1e3 into huggingface:master Mar 9, 2022
@ydshieh ydshieh deleted the use_tiny_model_for_tf_enc_dec_test branch March 9, 2022 16:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants