Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ConnectionError: Couldn't reach https://raw.githubusercontent.com #2787

Closed
jinec opened this issue Aug 11, 2021 · 9 comments
Closed

ConnectionError: Couldn't reach https://raw.githubusercontent.com #2787

jinec opened this issue Aug 11, 2021 · 9 comments
Labels
bug Something isn't working

Comments

@jinec
Copy link

jinec commented Aug 11, 2021

Hello,
I am trying to run run_glue.py and it gives me this error -

Traceback (most recent call last):
File "E:/BERT/pytorch_hugging/transformers/examples/pytorch/text-classification/run_glue.py", line 546, in
main()
File "E:/BERT/pytorch_hugging/transformers/examples/pytorch/text-classification/run_glue.py", line 250, in main
datasets = load_dataset("glue", data_args.task_name, cache_dir=model_args.cache_dir)
File "C:\install\Anaconda3\envs\huggingface\lib\site-packages\datasets\load.py", line 718, in load_dataset
use_auth_token=use_auth_token,
File "C:\install\Anaconda3\envs\huggingface\lib\site-packages\datasets\load.py", line 320, in prepare_module
local_path = cached_path(file_path, download_config=download_config)
File "C:\install\Anaconda3\envs\huggingface\lib\site-packages\datasets\utils\file_utils.py", line 291, in cached_path
use_auth_token=download_config.use_auth_token,
File "C:\install\Anaconda3\envs\huggingface\lib\site-packages\datasets\utils\file_utils.py", line 623, in get_from_cache
raise ConnectionError("Couldn't reach {}".format(url))
ConnectionError: Couldn't reach https://raw.githubusercontent.com/huggingface/datasets/1.7.0/datasets/glue/glue.py

Trying to do python run_glue.py --model_name_or_path
bert-base-cased
--task_name
mrpc
--do_train
--do_eval
--max_seq_length
128
--per_device_train_batch_size
32
--learning_rate
2e-5
--num_train_epochs
3
--output_dir
./tmp/mrpc/

Is this something on my end? From what I can tell, this was re-fixeded by @fullyz a few months ago.
Thank you!

@jinec jinec added the bug Something isn't working label Aug 11, 2021
@jinec
Copy link
Author

jinec commented Aug 11, 2021

the bug code locate in :
if data_args.task_name is not None:
# Downloading and loading a dataset from the hub.
datasets = load_dataset("glue", data_args.task_name, cache_dir=model_args.cache_dir)

@albertvillanova
Copy link
Member

albertvillanova commented Aug 11, 2021

Hi @jinec,

From time to time we get this kind of ConnectionError coming from the github.com website: https://raw.githubusercontent.com

Normally, it should work if you wait a little and then retry.

Could you please confirm if the problem persists?

@jinec
Copy link
Author

jinec commented Aug 12, 2021

cannot connect,even by Web browser,please check that there is some problems。

@albertvillanova
Copy link
Member

@renmada
Copy link

renmada commented Aug 17, 2021

I can access https://raw.githubusercontent.com/huggingface/datasets/1.7.0/datasets/glue/glue.py without problem...

I can not access https://raw.githubusercontent.com/huggingface/datasets either, I am in China

@jinec
Copy link
Author

jinec commented Aug 18, 2021

Finally i can access it, by the superfast software. Thanks

@jinec jinec closed this as completed Aug 18, 2021
@RuizhuoXu
Copy link

Finally i can access it, by the superfast software. Thanks

Excuse me, I have the same problem as you, could you please tell me how to solve it?

@mikechen66
Copy link

mikechen66 commented Sep 12, 2023

It is not related to the area, the ConnectionError with http://raw.githubuserconent.com has persisted with load_data function, datasets module. However, it can be set to either wget or ssl snippet to download dataset from github as following.

$ wget https://raw.githubusercontent.com/... --no-check-certificate

or

for the tfds, nltk or pandas.read_csv downloading as follows.

import ssl

try:
    _create_unverified_https_context = ssl._create_unverified_context
except AttributeError:
    pass
else:
    ssl._create_default_https_context = _create_unverified_https_context

So it is most probably the problem of github rather than users

@JackeyWang777
Copy link

I can access https://raw.githubusercontent.com/huggingface/datasets/1.7.0/datasets/glue/glue.py without problem...

I can not access https://raw.githubusercontent.com/huggingface/datasets either, I am in China

所以老哥怎么解决这个问题呢

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

6 participants