Skip to content

ibukisaar/ChatGPTTokenizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ChatGPTTokenizer

示例代码

using ChatGPTTokenizer;

using var tokenizer = new BpeTokenizer(File.ReadAllText("merges.txt"));
string text = """
    print("Hello world!")
    """;
var tokens = tokenizer.Encode(text);
Console.WriteLine($"count: {tokens.Length}"); // count: 6
Console.WriteLine(string.Join(',', tokens.Select(t => t.Id))); // 4798,7203,15496,995,2474,8

OpenAI

https://platform.openai.com/tokenizer

About

openai GPT2TokenizerFast的C#实现

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages