Add a `/detokenize` endpoint to the example server #2802

BruceMacD · 2023-08-26T01:19:45Z

Expose the ability to convert tokens to strings in the example server.

$ curl -X 'POST' \
 -d '{"content":"hello world"}' -H 'Content-Type: application/json' \
 'http://127.0.0.1:8080/tokenize'
{"tokens":[29871,22172,3186]}%                          

$ curl -X 'POST' \
 -d '{"tokens":[29871,22172,3186]}' -H 'Content-Type: application/json' \
 'http://127.0.0.1:8080/detokenize'
{"content":"  hello world"}%

resolves #2801

jhen0409 · 2023-08-26T05:10:02Z

It needs to fix the CI failure, other things looks good.

BruceMacD · 2023-08-26T18:05:46Z

Thanks for taking a look, I removed the formatting issue so I believe the workflows should be good now.

* master: (773 commits) server : add `/detokenize` endpoint (ggerganov#2802) convert.py : advanced option (ggerganov#2753) llama : use Unicode Escape Sequence to replace encoded characters (ggerganov#2814) flake.nix : add rocm support and cleanup (ggerganov#2808) llama : move #includes out of _GNU_SOURCE conditional (ggerganov#2817) main : fix bug (penalize_nl=false doesn't work) + suppress warning on mingw (ggerganov#1528) llama : use std::abs in llama_sample_tail_free (ggerganov#2800) k-quants : remove unnecessary tensor shape restrictions (ggerganov#2811) Better perplexity for 2- and 3-bit quantization for LLaMA-v2-70B (ggerganov#2807) Fix HellaSwag (ggerganov#2805) flake : build llama.cpp on Intel with nix (ggerganov#2795) Handle null rope scaling value (ggerganov#2793) Fix spm whitespaces (ggerganov#2806) examples : skip unnecessary external lib in server README.md how-to (ggerganov#2804) llama : fix struct decl (ggerganov#2790) Faster perplexity computation (ggerganov#2786) llama : add llama_beam_search() (ggerganov#2267) convert.py : Get rope scale from HuggingFace models (ggerganov#2772) llama-bench : add model sizes (ggerganov#2771) convert.py : export rope freq_base when converting CodeLlama from an HF model (ggerganov#2773) ...

* Add a /detokenize endpoint to the example server * remove trailing white-space

Add a /detokenize endpoint to the example server

d793897

remove trailing white-space

8a13601

jhen0409 approved these changes Aug 26, 2023

View reviewed changes

jhen0409 merged commit c1ac54b into ggerganov:master Aug 26, 2023
25 checks passed

akawrykow pushed a commit to akawrykow/llama.cpp that referenced this pull request Aug 29, 2023

server : add /detokenize endpoint (ggerganov#2802)

89ddbef

* Add a /detokenize endpoint to the example server * remove trailing white-space

avri-schneider mentioned this pull request Sep 8, 2023

Remove already applied patches ollama/ollama#494

Closed

Sam2much96 pushed a commit to Sam2much96/llama.cpp that referenced this pull request Sep 11, 2023

server : add /detokenize endpoint (ggerganov#2802)

192ce4f

* Add a /detokenize endpoint to the example server * remove trailing white-space

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a `/detokenize` endpoint to the example server #2802

Add a `/detokenize` endpoint to the example server #2802

BruceMacD commented Aug 26, 2023

jhen0409 commented Aug 26, 2023

BruceMacD commented Aug 26, 2023

Add a /detokenize endpoint to the example server #2802

Add a /detokenize endpoint to the example server #2802

Conversation

BruceMacD commented Aug 26, 2023

jhen0409 commented Aug 26, 2023

BruceMacD commented Aug 26, 2023

Add a `/detokenize` endpoint to the example server #2802

Add a `/detokenize` endpoint to the example server #2802