Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Support Japanese language #18

Open
AtsunoriFujita opened this issue Jul 19, 2021 · 1 comment
Open

[Feature Request] Support Japanese language #18

AtsunoriFujita opened this issue Jul 19, 2021 · 1 comment

Comments

@AtsunoriFujita
Copy link

In some cases, dedicated libraries(e.g. fugashi, ipadic) are required for Japanese tokenizers.
Currently, these libraries are not included in the inference container.
Is it possible to include these libraries or to have an option in the transformers installation?

For example, if we can rewrite the Dockerfile like this, we can handle it.
transformers[sentencepiece] transformers[ja]

Currently, if we deploy from S3, we can work around it with requirements.txt and an empty inference.py, but if we deploy from HF Hub, we don't have a workaround.

Thanks!

@philschmid
Copy link
Collaborator

@AtsunoriFujita thank you for the feature request. We are going to look into it.

@AtsunoriFujita AtsunoriFujita changed the title [Feature Request] Support Japanese language [Feature Request] Supports Japanese language Jul 19, 2021
@AtsunoriFujita AtsunoriFujita changed the title [Feature Request] Supports Japanese language [Feature Request] Support Japanese language Jul 19, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants