Autogrammer constrains the output of language models (LLMs) to generate syntactically valid JSON or SQL.
By leveraging grammars, Autogrammer ensures that an LLM generates output adhering to specific structures and syntax, even with smaller models.
Autogrammer is still being actively developed and should be considered in alpha
- 💻 Live code generation in the browser
- 🗣️ Natural language to SQL conversion
- 🎇 Generating visualizations from text descriptions
- 🌳 Offline apps
- 🕵️ When you want your data staying private
-
Bring your own model — Works seamlessly with
Transformers.js
,web-llm
, and REST endpoints forllama.cpp
andllamafile
, allowing you to use your preferred LLM. -
Support for
JSON
andSQL
— With syntactic validity guaranteed by GBNF grammars. -
Schema support — Provide schemas** (database schema or JSON schema) to further restrict possible output and ensure semantic correctness.
-
Plugins — (Coming soon) Enable additional functionality, such as RAG, on-the-fly error correction, and more.
npm install autogrammer
import { pipeline } from '@xenova/transformers'
import { Autogrammer } from 'autogrammer'
// Load your preferred model
const model = pipeline('text-generation', 'Xenova/gpt2')
// Create Autogrammer for JSON output
const autogrammer = new Autogrammer({
language: 'json',
})
// Tell the model what to generate
const prompt = 'Write me JSON that captures the following address: 1600 Pennsylvania Avenue NW, Washington, DC 20500'
// Run
const response = await autogrammer.execute(prompt)
// See the generated JSON
console.log(response) // { ... json object }
Autogrammer is made up of a number of subpackages that each make up the library:
gbnf
- Parses a GBNF grammar into a graph of rules, which can be used to determine the validity of a next token. Also enables the creation of GBNF grammars dynamically.json2gbnf
- Generates a GBNF grammar for JSON, with optional JSON schemasql2gbnf
- Generates a GBNF grammar for SQL, with optional database schemacontortionist
- Implements a Logits post-processor that restricts LLM output to only include valid next tokensautogrammer
- Orchestrates support for SQL and JSON grammar generation with a variety of LLM models.