Open
Description
Amazing work on bpe
!
Wanted to see if there were any known Python bindings a la the snippets from tiktoken below?
import tiktoken
enc = tiktoken.get_encoding("o200k_base")
assert enc.decode(enc.encode("hello world")) == "hello world"
# To get the tokeniser corresponding to a specific model in the OpenAI API:
enc = tiktoken.encoding_for_model("gpt-4o")
Metadata
Metadata
Assignees
Labels
No labels