Skip to content

Load HuggingFace tokenizer with SentencePiece files #2249

Answered by frankfliu
xudongguan202 asked this question in Q&A
Discussion options

You must be logged in to vote

That means there is no fast tokenizer implementation. You have to port python code into java.
You might want to take a look at our SentencePiece extension, and see if you can use it.

Replies: 2 comments 2 replies

Comment options

You must be logged in to vote
1 reply
@xudongguan202
Comment options

Comment options

You must be logged in to vote
1 reply
@xudongguan202
Comment options

Answer selected by xudongguan202
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants