Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Corrupted files on google drive #12

Open
npmhung opened this issue Jun 24, 2022 · 13 comments
Open

Corrupted files on google drive #12

npmhung opened this issue Jun 24, 2022 · 13 comments

Comments

@npmhung
Copy link

npmhung commented Jun 24, 2022

Hello,

When unzipping files TNL2K_test_subset_p5.zip, TNL2K_test_subset_p3.zip, TNL2K_test_subset_p2.zip, I got the following error:

End-of-central-directory signature not found. Either this file is not
a zipfile, or it constitutes one disk of a multi-part archive. In the
latter case the central directory and zipfile comment will be found on
the last disk(s) of this archive.

Could you check again if you have uploaded everything for the test folder?

I used "unzip -qq FILENAME" to unzip the files. Do you use different package as gunzip or tar?

Thank you!

@wangxiao5791509
Copy link
Owner

Hi, these files are packed using zip tools in the windows system. can you try to unzip them in the windows OS? @npmhung

@npmhung
Copy link
Author

npmhung commented Jun 24, 2022

Besides, p1 and p7 are fine. Are they created on linux or window too?

The files in p4 and p6 are extracted without problems, but their created dates show the time when they are unzipped not the time they were created (I know this by checking unzip -l FILENAME to show the folder structure), so I'm concerned that whether or not those files are correctly zipped.

@wangxiao5791509
Copy link
Owner

Currently, I am not sure whether these files on google-drive are fine or not. My friends tell me the files downloaded from Baidu disk are OK. We use the 360-zip (link) to pack these folders. You can try this software on a PC with windows OS.

@wangxiao5791509
Copy link
Owner

If it still not working, please let me know. I will upload these files again. Thanks.

@npmhung
Copy link
Author

npmhung commented Jun 24, 2022

There are 2 problems:

  • Currently, I'm working on a remote linux server with strict security policy so it's impossible for me to use the 360zip or software from windows OS.
  • I checked the total size of all files on google drive (33.88GB). It's different from that on onedrive (roughly 84.89GB). Therefore, I think there is something wrong with the dataset on google drive.

So if it's possible, could you update the google drive? It's easy for me to download from that server.

Thank you.

@wangxiao5791509
Copy link
Owner

@npmhung Thanks for your feedback. I will check and update the link for the google drive.

@npmhung
Copy link
Author

npmhung commented Jun 24, 2022

Besides, it would be great if you could zip everything into 1 file or each train/test folder into a zip file (e.g. train.zip and test.zip), because google drive kinda limits the number large files that can be download within 24 hours range.

Thank you!

@wangxiao5791509
Copy link
Owner

@npmhung I will try this.

@npmhung
Copy link
Author

npmhung commented Jun 28, 2022

@wangxiao5791509 Do you have any updates on the issue?

Thanks!

@wangxiao5791509
Copy link
Owner

@npmhung Not yet. I will update this link in the following two days.

@wangxiao5791509
Copy link
Owner

@npmhung Hi, I have updated the links for the onedrive.

You can download it from the onedrive now. The googledrive version is still uploading, as it is very slow ...

@npmhung
Copy link
Author

npmhung commented Jul 8, 2022

@wangxiao5791509 Hi, thank you for your update.
Hope you still upload to google drive. I haven't figured out how to download from onedrive to my server yet, but I will give it another try.

@wangxiao5791509
Copy link
Owner

@npmhung Ok, I will continue to update the link for googledrive.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants