Cannot download models locally #1070

Aksh97 · 2021-10-27T05:29:38Z

Hi, I tried to run the commands, which worked perfectly file.


from speechbrain.pretrained import EncoderDecoderASR

asr_model = EncoderDecoderASR.from_hparams(source="speechbrain/asr-crdnn-rnnlm-librispeech")

But when I looked over the model files, the files downloaded are just 158 kbs, and not the full files are downloaded.

Also, when I tried to download the files directly from Files and versions, the ckpt files are downloaded as zip. which cant be used later for inference. IS there any way to resolve it or it has been purposefully made like that.

The text was updated successfully, but these errors were encountered:

Gastron · 2021-10-27T11:45:07Z

The full files are downloaded, but if they're from HuggingFace Hub like here, the full files go the HuggingFace cache, and just symlinks are created, is this what you're actually seeing?

This was just recently discussed in #1055. As this surprises people, maybe we should change it. The HuggingFace cache is the standard location for HuggingFace downloads and the point of a cache is of course to avoid multiple copies of the same files. I'd like to know, though, what do you want to do with the full files - is it not sufficient to have symlinks?

Aksh97 · 2021-10-29T02:58:03Z

Hi @Gastron, thanks for the reply. I wanted to store those files and containerize them so that it does not needs to be downloaded whenever I run the container.
But unlike other repositories, where the full files are downloaded by default, instead of cache. Maybe for speechbrain we can put an arguement symlinks=true, to just add the symlinks and keep that by default. But if we need that to download full files, we can pass the arguement symlinks=false, we can at least download them.

mravanelli · 2021-10-29T03:04:44Z

Right, I think we should find a solution like that. It looks like users are a bit surprized by the default behavior and we might want to change it.

…

On Thu, Oct 28, 2021, 10:58 PM Akshay Sachdeva ***@***.***> wrote: Hi @Gastron <https://github.com/Gastron>, thanks for the reply. I wanted to store those files and containerize them so that it does not needs to be downloaded whenever I run the container. But unlike other repositories, where the full files are downloaded by default, instead of cache. Maybe for speechbrain we can put an arguement symlinks=true, to just add the symlinks and keep that by default. But if we need that to download full files, we can pass the arguement symlinks=false, we can at least download them. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#1070 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AEA2ZVXB7JAXDGJ2FFBKBBLUJIEUXANCNFSM5GZNSIFQ> .

Gastron · 2021-10-29T09:55:45Z

Right, noted.

Aksh97 · 2021-11-03T08:58:42Z

Thanks @Gastron and @mravanelli For the consideration.

Aksh97 · 2021-11-17T02:34:40Z

Hi There, Any further improvement or solution for this or it will take some time to implement the changes?

mravanelli · 2021-11-17T02:55:49Z

@Gastron, do we want to change the download folder in the end?

Gastron · 2021-11-17T11:48:47Z

Let's try to briefly discuss in the SpeechBrain core-team meeting today.

MClarkTurner · 2022-02-16T20:57:08Z

Since, this issue is still open and it has been about 4 months. I also wanted to comment that the current implementation is unintuitive. This is especially egregious given the standard set by PyTorch, upon which speechbrain is built and what an outsider would consider, at a glance, to be standard practice. Furthermore, it was only after stumbling upon this issue board and the comments in #1055 that I was able to resolve changing the location of the cache placement (which is an otherwise undocumented process). For those interested, the solution is the following code:

export HUGGINGFACE_HUB_CACHE=<new_directory>

otherwise the code will default to placing a ./cache/huggingface under your /home directory.

Adel-Moumen · 2023-09-01T14:16:43Z

Hello,

Is the issue still up? We merged a PR #1817 which is modifying a lot of things related to HuggingFace, Pretrainer and so on. Could you please let me know if now everything is working as intended?

Thanks.

Aksh97 closed this as completed Nov 15, 2021

Aksh97 reopened this Nov 17, 2021

mravanelli assigned Gastron Nov 17, 2021

anautsch added this to To do in CI/CD via automation Apr 21, 2022

anautsch moved this from To do to Performance & housekeeping in CI/CD Apr 21, 2022

anautsch mentioned this issue May 19, 2022

bug fix Pretrained.load_audio #1303

Open

anautsch mentioned this issue Jan 25, 2023

Extending pretrained interface for different fetching modes #1817

Merged

8 tasks

asumagic linked a pull request Mar 27, 2024 that will close this issue

Allow not using symlinks when fetching files #2476

Draft

13 tasks

asumagic added the bug Something isn't working label Jul 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot download models locally #1070

Cannot download models locally #1070

Aksh97 commented Oct 27, 2021 •

edited

Loading

Gastron commented Oct 27, 2021

Aksh97 commented Oct 29, 2021

mravanelli commented Oct 29, 2021 via email

Gastron commented Oct 29, 2021

Aksh97 commented Nov 3, 2021

Aksh97 commented Nov 17, 2021

mravanelli commented Nov 17, 2021

Gastron commented Nov 17, 2021

MClarkTurner commented Feb 16, 2022

Adel-Moumen commented Sep 1, 2023

Cannot download models locally #1070

Cannot download models locally #1070

Comments

Aksh97 commented Oct 27, 2021 • edited Loading

Gastron commented Oct 27, 2021

Aksh97 commented Oct 29, 2021

mravanelli commented Oct 29, 2021 via email

Gastron commented Oct 29, 2021

Aksh97 commented Nov 3, 2021

Aksh97 commented Nov 17, 2021

mravanelli commented Nov 17, 2021

Gastron commented Nov 17, 2021

MClarkTurner commented Feb 16, 2022

Adel-Moumen commented Sep 1, 2023

Aksh97 commented Oct 27, 2021 •

edited

Loading