Custom Format Parsers

Currently the `output()` method on Genkit results attempts to leniently parse JSON based on whether `format` is supplied or an output schema is present. There are a variety of reasons, however, that someone might want to have greater control over output parsing than this.

Proposed is to extend the `format` option to instead be a registry of formatters with some defaults (`text` and `json` at a minimum), provided. To define a custom format parser, developers would specify it something like so:

```ts
defineFormat('csv', (req) => {
  return {
    parseResponse: (res) => {
      const toParse = extractCodeFence(res.text);
      return parseCSV(toParse);
    },
    // optional, if omitted streaming `.output()` is not supported for this format type
    parseChunk: (chunk) => {
      // here `chunk` contains only the most recent data, `partialResponse` contains
      // a Message with all chunks received so far
      const toParse = extractPartialCodeFence(chunk.accumulatedText);
      return parseCSV(toParse);
    },
    instructions: `Output should be in CSV format with the following columns:\n\n${schemaToCSVSpec(req.output.schema)}.`
  }
});
```

The parser definition semantics should be flexible enough to handle many different scenarios, including:

1. **Built-up response:** where each chunk in a stream returns an incrementally more complete response
2. **Buffered chunking:** where e.g. JSONL is streamed but only on complete objects (so `parseChunk` must have the ability to return `null` and not emit a chunk to the end user)
3. **Options/Schema:** allow a format parser to access the full request including output schema and control how output validation occurs.

To use a custom format is simple: just use the `format` option already in output.

```ts
generate({
  prompt: "Generate a contact list with 10 people.",
  output: {
    format: 'csv',
    schema: z.array(z.object({firstName: z.string(), lastName: z.string()})
  },
});
```

### Work Plan

- [x] Create an interface for formatters that can handle streaming as well as custom instructions.
- [x] Implement a base set of default formatters - `text`, `json`, `array`, `jsonl`, `enum`.
- [x] Refactor `generate` and `generateStream` to play nicely with new streaming semantics.
- [ ] Figure out how to reconcile new streaming semantics with tool loop.
- [ ] Figure out how media and multi-modal output should work.
- [x] Add `instructions?: boolean | string | (req: GenerateRequest) => string` for custom instructions control.
- [x] Add `constrained?: boolean` option to output for schema-constrained generation, and implement it for Gemini models.
- [ ] Add enum mode to Gemini models.
- [ ] Add support for `ai.defineFormat` to define custom formats.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Custom Format Parsers #708

Work Plan

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Custom Format Parsers #708

Description

Work Plan

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions