-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Better Drive files download failure #1482
Comments
@Conchylicultor : do we need to just change the error message in this ticket? could you guide me a little bit so that I could work on it. |
Is there a way for users to e.g. make a copy of the files into our own Google Drive for a dataset, manually download them to the correct location, and proceed from there? Or any other manual workaround using the publicly-available celeba data? |
For drive links downloads can be done by extracting the For now
|
celeb_a tfrecord files: |
Is there an easier way to install the celeb_a dataset? I am trying the "download" manually method, but it is not helping at all. |
Has anyone tried executing the code cell twice? |
Looking for a workaround for this issue, I ended up finding a routine in the library that checks for the files on your own machine before attempting to download them. It was The database I intended to use was the CaltechBirds2010 and I found the Drive link here. |
I get this below error while I was trying to use deep_weeds dataset with this code " data_train, info = tfds.load("deep_weeds", with_info=True, split='train[:60%]',as_supervised=True) " NonMatchingChecksumError Traceback (most recent call last) 20 frames NonMatchingChecksumError: Artifact https://drive.google.com/uc?export=download&id=1xnK3B6K6KekDI55vwJ0vnc2IGoDga9cj, downloaded to /root/tensorflow_datasets/downloads/ucexport_download_id_1xnK3B6K6KekDI55vwJ0vnc2ITDlCjLc2rcwnx4HX2m4DkEyLfA722UJqaLRkfNhB6ec.tmp.68dd982dd0fd4809b12f3ef885ebe32f/download, has wrong checksum:
|
Download of drive urls sometimes fails with
NonMatchingChecksumError: Artifact https://drive.google.com/... has wrong checksum.
Explanation: Drive sometimes reject the download attempt, and the rejection page is downloaded instead of the data:
The best solution currently is to manually download the data (https://www.tensorflow.org/datasets/overview#manual_download_if_download_fails), rather than using the automated download which got rejected by drive.
Otherwise:
Not sure there can be a solution on Google Drive side, while preventing abuse.
On TFDS side, we could make the error message more explicit when we detect a drive URL.
The text was updated successfully, but these errors were encountered: