Initial implementation of the inference system #869

yk · 2023-01-20T22:43:12Z

This PR introduces:

A server for coordination
A minimal worker
A text client

all building on redis lists to stream data as it is being produced

andreaskoepf

let's get the initial impl in.

andreaskoepf · 2023-01-21T20:45:24Z

inference/server/main.py

+            await asyncio.sleep(1)
+            continue
+
+        chat.message_request_state = MessageRequestState.in_progress


if we have >1 "message-broker" instances and CHATS in db/redis then this "dequeue" operation will become a congestion point. One idea would be to define clear "configuration" tiers, e.g. based on GPU memory requirements and have independent task queues for them.

yk force-pushed the initial-inference branch from 6a66425 to 3a33827 Compare January 20, 2023 22:47

andreaskoepf added the inference label Jan 20, 2023

yk added 4 commits January 21, 2023 13:22

very primitive implementation of inference

ad3570e

re-worked with security in mind

e3cb117

removed polling from clients

da26315

switched workers to websockets

0726913

yk force-pushed the initial-inference branch from 5f536f3 to 0726913 Compare January 21, 2023 12:23

implemented back and forth chats

fa4f560

yk marked this pull request as ready for review January 21, 2023 13:53

yk requested a review from andreaskoepf as a code owner January 21, 2023 13:53

andreaskoepf approved these changes Jan 21, 2023

View reviewed changes

andreaskoepf merged commit 1709dc0 into main Jan 21, 2023

andreaskoepf deleted the initial-inference branch January 21, 2023 21:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial implementation of the inference system #869

Initial implementation of the inference system #869

yk commented Jan 20, 2023

andreaskoepf left a comment

andreaskoepf Jan 21, 2023

Initial implementation of the inference system #869

Initial implementation of the inference system #869

Conversation

yk commented Jan 20, 2023

andreaskoepf left a comment

Choose a reason for hiding this comment

andreaskoepf Jan 21, 2023

Choose a reason for hiding this comment