Skip to content

Commit

Permalink
rename
Browse files Browse the repository at this point in the history
  • Loading branch information
zheyuye committed Jun 30, 2020
1 parent fa011aa commit 74178e2
Show file tree
Hide file tree
Showing 2 changed files with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion scripts/pretraining/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@
Following the instruction of [Prepare OpenWebTextCorpus](../datasets/pretrain_corpus#openwebtext), download and prepare the dataset, obtaining a total of 20610 text files in the folder `prepared_owt`.

```bash
python preprocesse_owt.py --input prepared_owt --output preprocessed_owt --max_seq_length 128
python data_preprocessing.py --input prepared_owt --output preprocessed_owt --max_seq_length 128
```
The above command allows us to generate the preprocessed Numpy features saved in `.npz`.
# Pretrain Model
Expand Down
File renamed without changes.

0 comments on commit 74178e2

Please sign in to comment.