You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is there a reason why this model has preprocessor_config.json and tokenizer_config.json files?
If you look at facebook/wav2vec2-large, this one doesn't have these fields and therefore Wav2Vec2Processor.from_pretrained() won't return a processor/tokenizer.
The text was updated successfully, but these errors were encountered:
SpeechRecognitionModel.is_finetuned(self) returns True for the base model
facebook/wav2vec2-base
with is not finetuned.How to reproduce:
The issue seems to be that
![image](https://user-images.githubusercontent.com/93617195/186763378-a58e69d2-a0e5-49c3-8273-6597a82ac991.png)
Wav2Vec2Processor.from_pretrained()
does return a processor with aPreTrainedTokenizer
:Therefore
is_finetuned()
yields True:https://github.com/jonatasgrosman/huggingsound/blob/main/huggingsound/speech_recognition/model.py#L58-L78
Possibly this is not an issue of this Repo, but of the files on the huggingface model hub:
https://huggingface.co/facebook/wav2vec2-base/tree/main
Is there a reason why this model has
preprocessor_config.json
andtokenizer_config.json
files?If you look at
facebook/wav2vec2-large
, this one doesn't have these fields and thereforeWav2Vec2Processor.from_pretrained()
won't return a processor/tokenizer.The text was updated successfully, but these errors were encountered: