An error was encountered while loading "pyannote/speaker-diarization" #1128

Zpadger · 2022-10-29T02:46:45Z

Hello，when i run the code :

from pyannote.audio import Pipeline
pipeline = Pipeline.from_pretrained("pyannote/speaker-diarization",
                                    use_auth_token="my_token")

I get an error :

Traceback (most recent call last):
  File "/home/dg/anaconda3/envs/pyannote/lib/python3.8/site-packages/huggingface_hub/utils/_errors.py", line 213, in hf_raise_for_status
    response.raise_for_status()
  File "/home/dg/anaconda3/envs/pyannote/lib/python3.8/site-packages/requests/models.py", line 1021, in raise_for_status
    raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 403 Client Error: Forbidden for url: https://huggingface.co/pyannote/segmentation/resolve/2022.07/pytorch_model.bin

whether I use the read token role or the write token role.
Anyone else know how to fix it? Thx.

The text was updated successfully, but these errors were encountered:

micahjon · 2022-10-29T05:53:10Z

Thanks for posting! I'm running into a similar error when using a read token:

pipeline = Pipeline.from_pretrained("pyannote/speaker-diarization",
                                            use_auth_token="hf_E...rest of token here")

401 Client Error: Repository Not Found for url: https://huggingface.co/pyannote/segmentation/resolve/2022.07/pytorch_model.bin. 
If the repo is private, make sure you are authenticated

I'm able to access the segmentation repository just fine in my browser and agreed to the license, but for some reason the token isn't working.

pooya-mohammadi · 2022-10-29T15:06:29Z

the same error here

mezaros · 2022-10-29T15:28:55Z

Same error here, whatever is going on with the tokens is broken and the project has been unusable since 10/27.

pooya-mohammadi · 2022-10-29T20:35:36Z

@micahjon
@Zpadger
@mezaros
use_auth_token works fine. You simply need to update your pyannote.audio because in the new version get_model takes use_auth-token as an input and it works fine. This is added in the new release, so a simple update to the latest version would solve the problem!

JoFrhwld · 2022-10-29T22:14:52Z

It looks like you need to fill out the user agreement on both hf.co/pyannote/speaker-diarization and hf.co/pyannote/segmentation in order to use either one.

If I read the rationale for gating the model on HF correctly, this is strictly a data gathering exercise. It introduces too high a friction for me to recommend the system, or to utilize it as a dependency in any way.

raulqf · 2022-10-30T05:45:22Z

It seems to be working. I've installed a new environment and reproduced the diarization example.

subtyping · 2022-10-30T21:40:01Z

Installed new environment and was able to get things working. I was still encountering the error after accepting both agreements listed by @JoFrhwld - so not sure if agreeing to both of those is needed or not. Either way, fresh install works!

MrEdwards007 · 2022-10-30T21:59:04Z

It works now.

I had uninstalled and reinstalled a few times but that did not by itself resolve the issue.
When this was previously working a few days ago before the update, I had already accepted the diarization agreement.
What did resolve the issue was also accepting the segmentation agreement, as @JoFrhwld suggested.

mezaros · 2022-10-31T03:34:46Z

I'm running the correct 2.1 version, I did go through both diarization and segmentation gateways even though it wasn't specified, I have updated my token a hundred times — and, with nothing changing in my environment, the error has migrated from 403 forbidden documented above to a new SemVer error. Sounds like I need to completely wipe my environment and start over, even though I was already on the correct versions.

Bugs I understand. But why this friction in the first place? It feels developer and user hostile.

Frascth · 2022-10-31T03:42:00Z

same problem, solved with
in my case, agreeing all model in hugging face pyannote.audio, use the read token, huggingface-cli login using created token, and finnaly upgrade the pyannote.audio library to 2.1.1 using (pip install --upgrade pyannote.audio) solved my problem

cetiny · 2022-10-31T19:01:34Z

Agreeing to all the models on Huggingface resolved the issue for me (thanks @JoFrhwld ). Any tips how to save the model locally and access from cache? I don't want my code to be broken like this again.

MrEdwards007 · 2022-10-31T23:39:49Z

Yes, I thought the model was downloaded and really want this to work offline.

shashankmc · 2022-11-01T11:00:57Z

Updating to the latest version of the library and generating use_auth_token for both speaker-diarization and segmentation seems to do the trick. Tested if using certain auth token would create an issue but it doesn't matter which access token is provided once both are generated.

mezaros · 2022-11-01T19:20:57Z

Got it working with a full environment reset.

But, no longer enthused about testing this. We could never, ever rely on it or ask anyone else to, after this experience. The models need to work offline.

pranjal-zipteams · 2022-11-03T08:56:50Z

It works now.

I had uninstalled and reinstalled a few times but that did not by itself resolve the issue. When this was previously working a few days ago before the update, I had already accepted the diarization agreement. What did resolve the issue was also accepting the segmentation agreement, as @JoFrhwld suggested.

This fixes the issue for me.

plandrobe · 2022-11-09T21:48:16Z

https://huggingface.co/pyannote/segmentation/resolve/2022.07/pytorch_model.bin.

Same here. I have already accepted the terms in both repos, download a new token, updated hugging face hub and pyannote.audio to the last versión. The api response is generic for when the url or model is not found (it doesn't matter if the problem is the token or not). Look at the url of the pretrained model in the error that ends with bin. (a dot at the end). If you try the url without the dot in the browser you can download the file fine. Looks like a bug, at some point the code is adding a . (dot) in the file request.

makkasu · 2022-11-24T17:09:13Z

Some variant on the issue here: I've signed up on the hub to the terms for both models, generated the auth token and I can use it locally on my laptop. However, when I try to run it in an Azure VM, it just hangs on pipeline = Pipeline.from_pretrained('pyannote/speaker-diarization', use_auth_token=... until the max HTTPS retries is exceeded. Quite frustrating, can't really proceed with development using this tool until this hurdle is removed. Note that I can load other huggingface models just fine, but they don't use the auth token business.

* related bugs: #1119 #1128 #1130 * related discussions: #1123 #1103 #1126 #1121

hbredin · 2022-11-29T17:46:21Z

I have just updated the FAQ with instructions on how to use pretrained models and pipelines offline.

pri1712 · 2024-07-09T10:19:36Z

Upgrading my pyannote.audio version worked for me as well.

JoFrhwld mentioned this issue Oct 29, 2022

Integrate automated speech recognition transcription as pre-processor option Forced-Alignment-and-Vowel-Extraction/alignedTextGrid#1

Closed

BenoitWang mentioned this issue Nov 7, 2022

[Bug]: Can't access repository even with access token speechbrain/speechbrain#1687

Closed

hbredin added a commit that referenced this issue Nov 29, 2022

doc: describe offline use (#1169)

a1e99ee

* related bugs: #1119 #1128 #1130 * related discussions: #1123 #1103 #1126 #1121

hbredin closed this as completed Nov 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

An error was encountered while loading "pyannote/speaker-diarization" #1128

An error was encountered while loading "pyannote/speaker-diarization" #1128

Zpadger commented Oct 29, 2022 •

edited

Loading

micahjon commented Oct 29, 2022

pooya-mohammadi commented Oct 29, 2022

mezaros commented Oct 29, 2022

pooya-mohammadi commented Oct 29, 2022 •

edited

Loading

JoFrhwld commented Oct 29, 2022 •

edited

Loading

raulqf commented Oct 30, 2022

subtyping commented Oct 30, 2022

MrEdwards007 commented Oct 30, 2022

mezaros commented Oct 31, 2022

Frascth commented Oct 31, 2022 •

edited

Loading

cetiny commented Oct 31, 2022

MrEdwards007 commented Oct 31, 2022

shashankmc commented Nov 1, 2022

mezaros commented Nov 1, 2022

pranjal-zipteams commented Nov 3, 2022

plandrobe commented Nov 9, 2022

makkasu commented Nov 24, 2022 •

edited

Loading

hbredin commented Nov 29, 2022

pri1712 commented Jul 9, 2024

An error was encountered while loading "pyannote/speaker-diarization" #1128

An error was encountered while loading "pyannote/speaker-diarization" #1128

Comments

Zpadger commented Oct 29, 2022 • edited Loading

micahjon commented Oct 29, 2022

pooya-mohammadi commented Oct 29, 2022

mezaros commented Oct 29, 2022

pooya-mohammadi commented Oct 29, 2022 • edited Loading

JoFrhwld commented Oct 29, 2022 • edited Loading

raulqf commented Oct 30, 2022

subtyping commented Oct 30, 2022

MrEdwards007 commented Oct 30, 2022

mezaros commented Oct 31, 2022

Frascth commented Oct 31, 2022 • edited Loading

cetiny commented Oct 31, 2022

MrEdwards007 commented Oct 31, 2022

shashankmc commented Nov 1, 2022

mezaros commented Nov 1, 2022

pranjal-zipteams commented Nov 3, 2022

plandrobe commented Nov 9, 2022

makkasu commented Nov 24, 2022 • edited Loading

hbredin commented Nov 29, 2022

pri1712 commented Jul 9, 2024

Zpadger commented Oct 29, 2022 •

edited

Loading

pooya-mohammadi commented Oct 29, 2022 •

edited

Loading

JoFrhwld commented Oct 29, 2022 •

edited

Loading

Frascth commented Oct 31, 2022 •

edited

Loading

makkasu commented Nov 24, 2022 •

edited

Loading