feature: constrained grammars #354

mudler · 2023-05-23T08:26:12Z

Is your feature request related to a problem? Please describe.
Output of valid JSON, YAML, or alikes from a model is challenging. Models hallucinates, prompts needs to be fine-tuned, and there is no guarantee it spits valid output format.

Describe the solution you'd like
A way to constrain the output with a grammar. Ideally, would be great to have here as an endpoint this https://twitter.com/GrantSlatton/status/1660348210605596672 - grantslatton/llama.cpp@007e26a .

Describe alternatives you've considered
N/A

Additional context

The text was updated successfully, but these errors were encountered:

kcarnold · 2023-06-19T18:04:25Z

Related existing solution https://github.com/microsoft/guidance/

lee-b · 2023-07-02T09:20:29Z

See also: https://github.com/1rgs/jsonformer

I think there are probably two ways to accomplish this:

At the prompt/output level, generating all of the syntax between values as a prompt, then prompting for only the value, and cutting off the response when a syntactically incorrect token is generated, and so on. Pro: should work with any model / implementation. Con: while it should never generate invalid json, it probably generates empty json that might need to be thrown away at a higher-level json schema checker.
at the sampler level, with knowledge of the desired syntax, so adjusting the likelihood of generating a particular token. Benefit: should work much more reliably, with true syntax awareness. Con: is probably limited to one NN architecture/SDK, like pytorch.

Both approaches have pros and cons, and both require significant effort in parsing syntax.

JSONFormer appears to be the latter, better approach, but also refers to other implementations that seem to do the former.

mudler · 2023-07-02T10:41:34Z

I can share some updates very soon - I got it working with the llama.cpp PR which adds constrained grammars ( I don't have the link now). I could successfully implement OpenAI functions. Will follow up shortly with my experiments

mudler · 2023-07-02T15:56:44Z

When llama.cpp is going to merge this PR: ggerganov/llama.cpp#1773 constrained grammar output will be unblocked and quite easy to implement. I've hacked my way today something, will cleanup and push in a branch soon.

mudler assigned mudler and unassigned mudler May 23, 2023

mudler mentioned this issue Jun 15, 2023

feature: Chat completion functions #588

Closed

This was referenced Jul 2, 2023

wip: add constrained grammar support go-skynet/go-llama.cpp#124

Closed

feat: LocalAI functions #726

Merged

mudler linked a pull request Jul 6, 2023 that will close this issue

feat: LocalAI functions #726

Merged

1 task

mudler closed this as completed in #726 Jul 9, 2023

Atry mentioned this issue Jul 26, 2023

Guidance acceleration huggingface/text-generation-inference#505

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature: constrained grammars #354

feature: constrained grammars #354

mudler commented May 23, 2023 •

edited

kcarnold commented Jun 19, 2023

lee-b commented Jul 2, 2023

mudler commented Jul 2, 2023

mudler commented Jul 2, 2023

feature: constrained grammars #354

feature: constrained grammars #354

Comments

mudler commented May 23, 2023 • edited

kcarnold commented Jun 19, 2023

lee-b commented Jul 2, 2023

mudler commented Jul 2, 2023

mudler commented Jul 2, 2023

mudler commented May 23, 2023 •

edited