Allow client/server guidance execution #586

slundberg · 2024-01-12T00:15:07Z

This work-in-progress splits the inference computation parts of the Model class into an Engine and a Tokenizer class. The goal is to completely separate the client-side manipulation of Model objects from a stateless inference computation that can be exposed through a remote server. (note the Server class is not yet implemented)

Not currently running, but starting to debug LlamaCpp without any network traffic). Sharing here as a PR for visibility since it changes a lot of code.

slundberg and others added 22 commits January 12, 2024 00:14

First factorization for client/server

8d53d2a

Fix EOS bug

fa901fe

Transformers models now also converted

ee22e3c

Support for Google AI and remote bug fixes (not done)

7e9d6d0

Fix Mock and token forcing bug, also clean up model/engine variables

27e37b0

Merge branch 'main' into client-server

93f03f4

Support LiteLLM and fix Remote EOS bug

09278b3

Add Cohere and Anthropic support

514eeca

VertexAI support

b71ce24

OpenAI engine migration

862ff92

Rename Remote to Grammarless (to enable remote to mean really "remote")

973f0af

Protobuf based grammar serialization

985ca77

Better protobuf serialization

ad3375c

First working client server setup round trip

3772583

Unit server for client server

b87c1c1

Rename grammar classes

8f0e9aa

Add protobuf dependency

cbdf525

update pb2 files to new version

26b141e

Add fastapi to deps

abfe60b

Use Mock for server test

12c721d

Add remote support to Mock

4312c06

Add uvicorn

9306961

slundberg changed the title ~~WIP: Allow client/server guidance execution~~ Allow client/server guidance execution Jan 25, 2024

Expose grammar match() method

4da011f

slundberg mentioned this pull request Jan 26, 2024

Report ConstraintException when using guidance GPT-4 with ThreadPool. #601

Closed

slundberg merged commit 513966f into main Jan 31, 2024
6 checks passed

slundberg deleted the client-server branch January 31, 2024 22:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow client/server guidance execution #586

Allow client/server guidance execution #586

slundberg commented Jan 12, 2024 •

edited

Allow client/server guidance execution #586

Allow client/server guidance execution #586

Conversation

slundberg commented Jan 12, 2024 • edited

slundberg commented Jan 12, 2024 •

edited