[FR] Add support for structured extraction with Ollama models #68

svilupp · 2024-02-05T20:27:17Z

It would be good to have aiextract enabled for Ollama models.

The text was updated successfully, but these errors were encountered:

svilupp · 2024-04-10T07:42:27Z

There is no official support yet, but you can easily build it yourself with the following guide: https://svilupp.github.io/PromptingTools.jl/dev/how_it_works#Walkthrough-Example-for-aiextract

It works well with mixtral and similar models.

cpfiffer · 2024-04-16T17:41:53Z

I'm willing to handle this one, relatively straightforward to do. As a clarifying question, what's the difference between a regular schema and a managed one?

svilupp · 2024-04-16T18:30:15Z

Great!

There are different API endpoints in Ollama:

generate - basically text completion-like; where you provide two fields: system, prompt; no multi-turn conversation (1 reply only)
api/chat - "message"-based with multiturn conversations. Similar to OpenAI-style
v1/chat/completions - fully OpenAI-compatible endpoint

generate is the OllamaManagedSchema, because they manage everything. It's legacy and I didn't want to break things to much so I kept it
api/chat is the OllamaSchema
not implemented because there was no need / no advantage (at the time)

From that perspective, I'd assume you would add aiextract for OllamaSchema which is build around api/chat. You would just merge the api_kwargs to also include the format="json" and then try to convert to the return type in a try-catch block. Similar to the OpenAI implementation.

I'd suggest to avoid nested return_types (they are harder for OSS models). Good model to use for aiextract is mixtral if you can run it locally.

Does that answer your question?

svilupp · 2024-04-28T17:59:58Z

Just flagging that it might be easier to tackle this after we can provide "JSON type" representation to the open-source models (not JSON schema) - reference.

See #143

Fixes #68

svilupp added the models-providers label Apr 10, 2024

cpfiffer self-assigned this Apr 16, 2024

svilupp mentioned this issue Apr 16, 2024

Add model providers and Supported functions #134

Merged

cpfiffer added a commit that referenced this issue May 5, 2024

[FR] Add support for structured extraction with Ollama models

97c9733

Fixes #68

cpfiffer mentioned this issue May 5, 2024

Improve parser support #146

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FR] Add support for structured extraction with Ollama models #68

[FR] Add support for structured extraction with Ollama models #68

svilupp commented Feb 5, 2024

svilupp commented Apr 10, 2024

cpfiffer commented Apr 16, 2024

svilupp commented Apr 16, 2024

svilupp commented Apr 28, 2024

[FR] Add support for structured extraction with Ollama models #68

[FR] Add support for structured extraction with Ollama models #68

Comments

svilupp commented Feb 5, 2024

svilupp commented Apr 10, 2024

cpfiffer commented Apr 16, 2024

svilupp commented Apr 16, 2024

svilupp commented Apr 28, 2024