Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The problem about the third step:Download Dependancy #1

Closed
tangyuhao2016 opened this issue Apr 18, 2021 · 5 comments
Closed

The problem about the third step:Download Dependancy #1

tangyuhao2016 opened this issue Apr 18, 2021 · 5 comments

Comments

@tangyuhao2016
Copy link

Thank you for sharing such great work.

When I run the sh get_checkpoint.sh, I get the mistake like below:

Resolving icbu-ensa-sc.oss-cn-zhangjiakou.aliyuncs.com (icbu-ensa-sc.oss-cn-zhangjiakou.aliyuncs.com)... 47.92.17.218
Connecting to icbu-ensa-sc.oss-cn-zhangjiakou.aliyuncs.com (icbu-ensa-sc.oss-cn-zhangjiakou.aliyuncs.com)|47.92.17.218|:80... connected.
HTTP request sent, awaiting response... 403 Forbidden.

And when I click the link directly, I get the mistake like below:

This XML file does not appear to have any style information associated with it. The document tree is shown below.

AccessDenied
You have no right to access this object because of bucket acl.
607C319BB6DA383338EC6AFD
icbu-ensa-sc.oss-cn-zhangjiakou.aliyuncs.com

May you provide the solution?

@mczhuge
Copy link
Owner

mczhuge commented Apr 18, 2021 via email

@tangyuhao2016
Copy link
Author

Thank you for your help, the problem has been solved.
However, the download speed is so slow, I have tried several times and different networks are used. It's only tens of K per second, and it's very easy to interrupt.
Could you upload the source data to the Baidu cloud disk or Google cloud disk?

@mczhuge
Copy link
Owner

mczhuge commented Apr 19, 2021

Since the limits of authority, we cannot directly put these data in Baidu or Google disk right now.
But there may have two solutions:

  1. First, you can download these datasets by Xunlei or some other similar tools. I just copy the links to Xunlei, such as http://icbu-ensa-sc.oss-cn-zhangjiakou.aliyuncs.com/mingchen.zgmc/KaleidoBERT_TF_CODE/datasets/checkpoint/kaleidobert.ckpt-50683.data-00000-of-00001, and the download speed can achieve 10MB/s.
  2. Waiting for Alibaba Disk ^_^

I wish it could be helpful for you.

@tangyuhao2016
Copy link
Author

Thank you for your help. Even though I use Xunlei with the vip, the speed is only 200-300k/s. And the FashionGen dataset is very large, one download link is about 16.5g and there are several links.

@mczhuge
Copy link
Owner

mczhuge commented Apr 20, 2021

Yes. The pre-processed datasets are large.
I did not use the Xunlei VIP but also get a 7MB/s download speed.
Can you add my WeChat? ID: tjpxiaoming

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants