Streaming responses from OpenAI and GPT4All #221

raulraja · 2023-07-03T17:48:37Z

These changes introduce the capability of streaming in chat responses. It adds the following changes:

A new function, createChatCompletions, has been added to the Chat interface, which allows creating chat completions based on a ChatCompletionRequest. This function returns a Flow<ChatCompletionChunk> representing the generated chat completions.
The Chat interface now includes two overloaded versions of the promptStreaming function. These functions enable streaming by returning a Flow<String> that emits the generated chat responses as they become available.
The implementation of createChatCompletions in GPT4All utilizes a Flow and channels to enable streaming of chat completions.
The MockOpenAIClient has been updated to throw a NotImplementedError for chatCompletions since it's not implemented in the mock.
The OpenAIClient implementation has been updated to utilize the OpenAI API's chat completion functionality and transform the response into the corresponding domain models (ChatCompletionChunk, ChatChunk, ChatDelta, etc.).

Example

package com.xebia.functional.xef.auto.streaming

import com.xebia.functional.xef.auto.llm.openai.OpenAI
import com.xebia.functional.xef.auto.llm.openai.OpenAIEmbeddings
import com.xebia.functional.xef.llm.Chat
import com.xebia.functional.xef.vectorstores.LocalVectorStore

suspend fun main() {
  val chat: Chat = OpenAI.DEFAULT_CHAT
  val embeddings = OpenAIEmbeddings(OpenAI.DEFAULT_EMBEDDING)
  val vectorStore = LocalVectorStore(embeddings)
  chat.promptStreaming(
    question = "What is the meaning of life?",
    context = vectorStore
  ).collect {
    print(it)
  }
}

These changes enable developers to perform streaming chat completions and receive responses incrementally, enhancing the real-time interactive chat experience.

…thub.com/aallam/openai-kotlin

# Conflicts: # core/src/commonMain/kotlin/com/xebia/functional/xef/llm/openai/OpenAIEmbeddings.kt

# Conflicts: # core/src/commonMain/kotlin/com/xebia/functional/xef/auto/CoreAIScope.kt # core/src/commonMain/kotlin/com/xebia/functional/xef/llm/models/functions/CFunction.kt # core/src/commonMain/kotlin/com/xebia/functional/xef/llm/openai/models.kt # kotlin/src/commonMain/kotlin/com/xebia/functional/xef/auto/DeserializerLLMAgent.kt # kotlin/src/commonMain/kotlin/com/xebia/functional/xef/auto/serialization/functions/FunctionSchema.kt # scala/src/main/scala/com/xebia/functional/xef/scala/auto/package.scala

… and java depends on openai module for defaults. xef core does not depend on open ai

…ai-typed-models

# Conflicts: # core/src/commonMain/kotlin/com/xebia/functional/xef/auto/AI.kt # core/src/commonMain/kotlin/com/xebia/functional/xef/auto/AIRuntime.kt # core/src/commonMain/kotlin/com/xebia/functional/xef/auto/AiDsl.kt # core/src/commonMain/kotlin/com/xebia/functional/xef/auto/CoreAIScope.kt # core/src/commonMain/kotlin/com/xebia/functional/xef/llm/models/chat/Message.kt # core/src/commonMain/kotlin/com/xebia/functional/xef/llm/models/chat/Role.kt # core/src/commonMain/kotlin/com/xebia/functional/xef/llm/models/text/CompletionRequest.kt # examples/kotlin/src/main/kotlin/com/xebia/functional/xef/auto/CustomRuntime.kt # java/src/main/java/com/xebia/functional/xef/java/auto/AIScope.java # openai/src/commonMain/kotlin/com/xebia/functional/xef/auto/llm/openai/DeserializerLLMAgent.kt # openai/src/commonMain/kotlin/com/xebia/functional/xef/auto/llm/openai/ImageGenerationAgent.kt # openai/src/commonMain/kotlin/com/xebia/functional/xef/auto/llm/openai/MockAIClient.kt # openai/src/commonMain/kotlin/com/xebia/functional/xef/auto/llm/openai/OpenAIClient.kt # openai/src/commonMain/kotlin/com/xebia/functional/xef/auto/llm/openai/OpenAIEmbeddings.kt # openai/src/commonMain/kotlin/com/xebia/functional/xef/auto/llm/openai/OpenAIRuntime.kt # scala/src/main/scala/com/xebia/functional/xef/scala/auto/package.scala

… local models. Local models can be use in the AI DSL and interleaved with any model.

… block and manual component construction

…upport

# Conflicts: # core/src/commonMain/kotlin/com/xebia/functional/xef/llm/Chat.kt # examples/kotlin/src/main/kotlin/com/xebia/functional/xef/auto/gpt4all/Chat.kt # gpt4all-kotlin/src/jvmMain/kotlin/com/xebia/functional/gpt4all/GPT4All.kt # openai/src/commonMain/kotlin/com/xebia/functional/xef/auto/llm/openai/OpenAIClient.kt

raulraja · 2023-07-04T09:08:17Z

@xebia-functional/team-ai

serras · 2023-07-06T11:11:39Z

openai/src/commonMain/kotlin/com/xebia/functional/xef/auto/llm/openai/MockAIClient.kt

@@ -28,6 +29,9 @@ class MockOpenAIClient(
  private val chatCompletion: (ChatCompletionRequest) -> ChatCompletionResponse = {


Should we maybe split these methods in a different interface? I see more and more that a client implements the interface only partially.

They've already been split into its own interface in main inside Chat, ChatWithFunctions etc. The remaining place where it implements all of them like this is the MockClient. The AIClient interface in the main is already unused and should be removed. I'll push a commit to this PR to remove it.

raulraja and others added 28 commits June 21, 2023 22:22

Generic AI client and models with open-ai client impl from https://gi…

f5058b6

…thub.com/aallam/openai-kotlin

type LLM models based on their capabilities and type the operations

a629d15

add token as parameter to openAI fn falling back to env variable

471497a

add config as optional parameter

dd47c87

Merge branch 'main' into open-ai-typed-models

da80199

remove old config

76d0e9e

Merge remote-tracking branch 'origin/main' into open-ai-typed-models

f631907

# Conflicts: # core/src/commonMain/kotlin/com/xebia/functional/xef/llm/openai/OpenAIEmbeddings.kt

adapt to latest changes from main and new java module

3fe0656

Merge branch 'main' into open-ai-typed-models

4a89c79

Merge remote-tracking branch 'origin/main' into open-ai-typed-models

6feeffa

have openai be its own module that depends on xef-core. kotlin, scala…

7512191

… and java depends on openai module for defaults. xef core does not depend on open ai

Merge remote-tracking branch 'origin/open-ai-typed-models' into open-…

d6c9f18

…ai-typed-models

fix bug in scala fn name for serialization

1103cdc

make AIClient : AutoCloseable

bd097c0

Rename enum cases

ef1c24c

Rename to TEXT_EMBEDDING_ADA_002

c5a70f0

Fix AIClient close expectation

e849bff

Progress with models

a586637

Refactor to have models typed and increase ergonomics

4800420

Loading embeddings and tokenizer from huggingface, dynamic loading of…

8f5d9cb

… local models. Local models can be use in the AI DSL and interleaved with any model.

remove non used repositories

99bdb64

Fix functions model to GPT_3_5_TURBO_FUNCTIONS and example without AI…

a1b8736

… block and manual component construction

remove unused import

48dccec

GPT4All Java Bindings and supported models list + std out streaming s…

b238450

…upport

Streaming chat response from OpenAI and GPT4All

fd1b340

OpenAI example

b2bbf87

Base automatically changed from gpt4all-java-bindings to main July 3, 2023 18:55

raulraja and others added 3 commits July 3, 2023 22:49

Merge branch 'main' into streaming-responses

256d794

Merge branch 'main' into streaming-responses

31ac3a4

Merge branch 'main' into streaming-responses

39f289f

raulraja and others added 2 commits July 4, 2023 12:11

Merge branch 'main' into streaming-responses

8796424

Small Suggestions - Rewrites.

6c967c0

diesalbla force-pushed the streaming-responses branch from a7de43b to 6c967c0 Compare July 4, 2023 19:25

anamariamv requested review from a team, serras and javipacheco and removed request for a team July 6, 2023 09:48

Yawolf previously approved these changes Jul 6, 2023

View reviewed changes

serras previously approved these changes Jul 6, 2023

View reviewed changes

raulraja added 2 commits July 6, 2023 14:07

Merge remote-tracking branch 'origin/main' into streaming-responses

f10582d

Remove AIClient interface

1e25e4d

raulraja dismissed stale reviews from serras and Yawolf via 1e25e4d July 6, 2023 12:07

raulraja merged commit 681dfc0 into main Jul 6, 2023
1 check passed

raulraja deleted the streaming-responses branch July 6, 2023 12:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Streaming responses from OpenAI and GPT4All #221

Streaming responses from OpenAI and GPT4All #221

raulraja commented Jul 3, 2023

raulraja commented Jul 4, 2023

serras Jul 6, 2023

raulraja Jul 6, 2023

		@@ -28,6 +29,9 @@ class MockOpenAIClient(
		private val chatCompletion: (ChatCompletionRequest) -> ChatCompletionResponse = {

Streaming responses from OpenAI and GPT4All #221

Streaming responses from OpenAI and GPT4All #221

Conversation

raulraja commented Jul 3, 2023

Example

raulraja commented Jul 4, 2023

serras Jul 6, 2023

Choose a reason for hiding this comment

raulraja Jul 6, 2023

Choose a reason for hiding this comment