--cache_dir argument in run_lm_finetuning.py not used at all #1623

mpavlovic · 2019-10-24T14:21:52Z

🐛 Bug

Model I am using (Bert, XLNet....): GPT-2

Language I am using the model on (English, Chinese....): English

The problem arise when using:

the official example scripts: run_lm_finetuning.py

The tasks I am working on is:

my own task or dataset: Language model finetuning on custom dataset from human resources domain

To Reproduce

Steps to reproduce the behavior:

Clone the repo
Navigate to transformers/examples directory
Prepare custom train and test datasets (.txt files)
Create ./cache directory
Run the following command in terminal (with replaced custom_ arguments):

python run_lm_finetuning.py \
   --output_dir=<custom_output_dir_path> \
   --model_type=gpt2 \
   --model_name_or_path=gpt2 \
   --do_train \
   --train_data_file=<custom_train_data_file> \
   --do_eval \
   --eval_data_file=<custom_eval_data_file> \
   --per_gpu_eval_batch_size=1   \
   --per_gpu_train_batch_size=1   \
   --save_total_limit=2 \
   --num_train_epochs=1 \
   --cache_dir=./cache

Expected behavior

When the model is downloaded from S3, it is stored to default cache directory in <user_home>/.cache/transformers/ directory, instead to ./cache, as specified in --cache_dir argument. Seems like --cache_dir argument isn't used in .from_pretrained() methods in lines 472, 473 and 477 in the run_lm_finetuning.py script.

Environment

OS: Ubuntu 18.04
Python version: 3.6.6
PyTorch version: 1.3
PyTorch Transformers version (or branch): 2.1.1
Using GPU ? Yes
Distributed of parallel setup ? No
Any other relevant information:

Additional context

The text was updated successfully, but these errors were encountered:

Fix #1623

* upstream/master: Add RoBERTa-based GPT-2 Output Detector from OpenAI Fix other PyTorch models Fix BERT [tests] Flag to test on cuda [tests] get rid of warning [run_tf_glue] Add comment for context misc doc Updating docblocks in optimizers.py GPT-2 XL add authors for models Fix huggingface#1686 add progress bar for convert_examples_to_features [inputs_embeds] All PyTorch models docstring + check model forwards can take an inputs_embeds param Fix huggingface#1623 Fixing mode in evaluate during training Add speed log to examples/run_squad.py

mpavlovic changed the title ~~cache_dir argument in run_lm_finetuning.py not used at all~~ --cache_dir argument in run_lm_finetuning.py not used at all Oct 24, 2019

thomwolf closed this as completed in 89d6272 Nov 5, 2019

thomwolf added a commit that referenced this issue Nov 5, 2019

Merge pull request #1723 from huggingface/fix-1623

d2e2577

Fix #1623

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

--cache_dir argument in run_lm_finetuning.py not used at all #1623

--cache_dir argument in run_lm_finetuning.py not used at all #1623

mpavlovic commented Oct 24, 2019

--cache_dir argument in run_lm_finetuning.py not used at all #1623

--cache_dir argument in run_lm_finetuning.py not used at all #1623

Comments

mpavlovic commented Oct 24, 2019

🐛 Bug

To Reproduce

Expected behavior

Environment

Additional context