Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

windows opus-fast-mosestokenizer installation #20

Closed
tatarmb4s opened this issue Oct 21, 2023 · 6 comments
Closed

windows opus-fast-mosestokenizer installation #20

tatarmb4s opened this issue Oct 21, 2023 · 6 comments

Comments

@tatarmb4s
Copy link

Hello! When I try to run the whisper_online_server I get this error:

Exception has occurred: FileNotFoundError
[WinError 2] The system cannot find the file specified
File "S:\AI\Testing\HuggingFace\openai\whisper_streaming\whisper_online.py", line 436, in create_tokenizer
return MosesTokenizer(lan)
^^^^^^^^^^^^^^^^^^^
File "S:\AI\Testing\HuggingFace\openai\whisper_streaming\whisper_online_server.py", line 64, in
online = OnlineASRProcessor(asr,create_tokenizer(tgt_language))
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Everything is the default, I just cloned the repo and installed all dependencies. How can I solve this?

I'm on Windows, using py3.11.3 in a .venv.
image

@Gldkslfmsd
Copy link
Collaborator

installed all dependencies

Are you sure that you installed opus-fast-mosestokenizer correctly?

And can you confirm that it works well with pip install sacremoses on Windows? No changes in code are needed?

@Gldkslfmsd Gldkslfmsd changed the title Exception has occurred: FileNotFoundError [WinError 2] The system cannot find the file specified windows opus-fast-mosestokenizer installation Nov 28, 2023
@white1-doggy
Copy link

white1-doggy commented Dec 8, 2023

but it doesn't work for the mosestokenizer in sacremoses
Uploading 截屏2023-12-08 14.18.58.png…

AttributeError: 'MosesTokenizer' object has no attribute 'split'

@Gldkslfmsd
Copy link
Collaborator

yes, sacremoses does not have sentence segmentation

@white1-doggy
Copy link

yes, sacremoses does not have sentence segmentation

so, please tell me how can I slove this question?

@Gldkslfmsd
Copy link
Collaborator

resolved -- same reason as #23

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants