[Feature Request] Support Japanese language #18

AtsunoriFujita · 2021-07-19T06:23:46Z

In some cases, dedicated libraries(e.g. fugashi, ipadic) are required for Japanese tokenizers.
Currently, these libraries are not included in the inference container.
Is it possible to include these libraries or to have an option in the transformers installation?

For example, if we can rewrite the Dockerfile like this, we can handle it.
transformers[sentencepiece] → transformers[ja]

Currently, if we deploy from S3, we can work around it with requirements.txt and an empty inference.py, but if we deploy from HF Hub, we don't have a workaround.

Thanks!

The text was updated successfully, but these errors were encountered:

philschmid · 2021-07-19T07:02:02Z

@AtsunoriFujita thank you for the feature request. We are going to look into it.

AtsunoriFujita changed the title ~~[Feature Request] Support Japanese language~~ [Feature Request] Supports Japanese language Jul 19, 2021

AtsunoriFujita changed the title ~~[Feature Request] Supports Japanese language~~ [Feature Request] Support Japanese language Jul 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Support Japanese language #18

[Feature Request] Support Japanese language #18

AtsunoriFujita commented Jul 19, 2021

philschmid commented Jul 19, 2021

[Feature Request] Support Japanese language #18

[Feature Request] Support Japanese language #18

Comments

AtsunoriFujita commented Jul 19, 2021

philschmid commented Jul 19, 2021