support for json (or other?) grammar? #1945

kurtbuilds · 2024-03-27T01:21:03Z

llama.cpp now supports grammars:

https://til.simonwillison.net/llms/llama-cpp-python-grammars

Is that something that will come to candle?

It sounds like the approach taken in this python library would be straight forward:

https://github.com/1rgs/jsonformer/blob/main/jsonformer/main.py

Basically, since you know the JSON schema, you return appropriate LLM tokens for structure based on control flow, and constrain logit output for typed value situations.

I started to work on this approach in a demo codebase... I'll report back on any progress.

Curious to hear from others about how feasible the approach is.

ealmloff · 2024-03-29T13:23:51Z

👋 I wrote a implementation of constrained sampling with candle here that might be useful as a reference. Here are a few things I found important:

Parsing must be incremental if you want to get reasonable speeds for longer sequences (This makes FSM a good choice)
You can accelerate text generation by eagerly sampling the grammar and feeding the required next tokens into the LLM in one batch instead of one token at a time

lucasavila00 mentioned this issue Apr 7, 2024

Model grammar support via BNF EricLBuehler/mistral.rs#59

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support for json (or other?) grammar? #1945

support for json (or other?) grammar? #1945

kurtbuilds commented Mar 27, 2024 •

edited

ealmloff commented Mar 29, 2024

support for json (or other?) grammar? #1945

support for json (or other?) grammar? #1945

Comments

kurtbuilds commented Mar 27, 2024 • edited

ealmloff commented Mar 29, 2024

kurtbuilds commented Mar 27, 2024 •

edited