Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How2 dataset downloading link unavailable #4767

Closed
chlorane opened this issue Nov 15, 2022 · 5 comments
Closed

How2 dataset downloading link unavailable #4767

chlorane opened this issue Nov 15, 2022 · 5 comments
Labels
Bug bug should be fixed

Comments

@chlorane
Copy link

Describe the bug
A clear and concise description of what the bug is.
I'm using your example program in eg2/how2_2000h, but when I run local/run_asr.sh --asr_tag asr_pretrain, the command line shows that the link to the dataset is corrupted.

Access denied with the following error:

    Cannot retrieve the public link of the file. You may need to change
    the permission to 'Anyone with the link', or have had many accesses.

You may still be able to access the file from the browser:

     https://drive.google.com/uc?id=sharing

Can you provide a new link?

@chlorane chlorane added the Bug bug should be fixed label Nov 15, 2022
@sw005320
Copy link
Contributor

Thanks for the report!
@roshansh-cmu, could you check the status?

@roshansh-cmu
Copy link
Contributor

roshansh-cmu commented Nov 15, 2022

Thanks for the question and interest in the How2 data. The data will be available for access within the next few days through a Google form. I will update here once that happens, and make a PR to the ESPNet How2 recipe page.

@chlorane
Copy link
Author

chlorane commented Nov 18, 2022

If I have the dataset, where shall I put under the directory, and which file shall I put?

@roshansh-cmu
Copy link
Contributor

Please request the dataset using the data release form from the How2 data repository : https://github.com/srvk/how2-dataset.

We follow ESPNet2/ Kaldi format for the training and inference. Please refer to documentation for ESPnet2.

Thank you for your patience

@roshansh-cmu
Copy link
Contributor

You may refer to our PR #4805- this when merged should be used to prepare data from the downloaded dataset bz2 file.

Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Bug bug should be fixed
Projects
None yet
Development

No branches or pull requests

3 participants