Skip to content

Conversation

chenmoneygithub
Copy link
Contributor

  1. Fix the curl command, and add curl instruction for vocab downloading.
  2. Add output flag for glue_fine_tuning script.

Copy link
Member

@mattdangerw mattdangerw left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good! Few nits.

[here](https://storage.googleapis.com/tensorflow/keras-nlp/examples/bert/bert_vocab_uncased.txt),
or in your terminal run:
```shell
curl https://storage.googleapis.com/tensorflow/keras-nlp/examples/bert/bert_vocab_uncased.txt \
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same here curl -O url to keep this a one liner.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am doing a renaming here - in the README we are mixing vocab.txt with bert_vocab_uncased.txt, so to unify them I am only keeping vocab.txt at every reference.


```shell
python3 examples/bert/run_glue_finetuning.py \
--saved_model_input path/to/model/ \
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can probably just leave this as is?

We don't really need to the output model from fine-tuning to be saved anywhere usually. Really what we would want is to save some predictions in the GLUE submission format (tracked here #115).

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yea, I feel that users may find the need to reevaluate the model? If we do not save the model then people won't have access to the finetuned model except finetuning again.


```shell
curl https://dumps.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2
curl -O https://dumps.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks like trailing space here.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

good catch!

@chenmoneygithub chenmoneygithub merged commit 4ba8729 into keras-team:master May 10, 2022
@chenmoneygithub chenmoneygithub deleted the small-fix branch May 10, 2022 20:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants