summaries: refactor to use openai api model for inference #59

saghul · 2024-02-16T08:07:51Z

No description provided.

skynet/modules/ttt/summaries/processor.py

skynet/env.py

skynet/modules/ttt/openai_api/app.py

skynet/modules/ttt/summaries/processor.py

saghul

Left some comments, PTAL. It's not clear to me how the 2 ways of running the OpenAI-compat app are attained. Wasn't the executor going to hit an internal OpenAI-compat service locally bypassing auth?

skynet/modules/ttt/openai_api/app.py

skynet/modules/ttt/summaries/jobs.py

skynet/modules/ttt/summaries/processor.py

skynet/modules/ttt/summaries/prompts/action_items.py

skynet/modules/ttt/summaries/prompts/summary.py

skynet/modules/ttt/summaries/v1/router.py

skynet/auth/bearer.py

skynet/main.py

skynet/modules/ttt/summaries/v1/models.py

skynet/modules/ttt/summaries/v1/router.py

skynet/utils.py

skynet/auth/bearer.py

* inference jobso of customers with existing open ai keys will run on open ai servers

saghul

Left some comments, PTAL!

skynet/auth/openai.py

saghul · 2024-02-29T11:12:29Z

skynet/auth/openai.py

+        open_yaml(openai_credentials_file)
+        file_watcher = FileWatcher(openai_credentials_file, lambda: open_yaml(openai_credentials_file))
+
+        file_watcher.start()


When are we going to stop this?

when the process stops. Is there other time when we'd want it to stop?

Not really, in general I prefer explicit vs implicit. No big deal though.

skynet/auth/openai.py

skynet/modules/file_watcher.py

saghul · 2024-02-29T11:18:19Z

skynet/modules/file_watcher.py

+                log.error(f'Error while polling for file changes: {e}')
+                break
+
+    def start(self):


I guess there should be a stop here...

skynet/modules/ttt/summaries/app.py

skynet/modules/ttt/summaries/jobs.py

saghul · 2024-02-29T11:37:21Z

skynet/modules/ttt/summaries/processor.py

-        n_ctx=4096,
-        n_gpu_layers=llama_n_gpu_layers,
-        n_batch=llama_n_batch,
+    global llm


Is this initialize necessary? Can't we just create it at imprt time?

if we move it like that, we need to also better isolate the summaries:dispatcher module to make sure it's not importing it indirectly.

Does the dispatcher import anything from the processor? It kinda shouldn't.

it imports the jobs module, which in turn has reference to the processor

Gotcha. No strong opinion then.

saghul

LGTM, excellent work! Since I opened the PR you need to approve it yourself lol

saghul · 2024-02-29T16:18:25Z

Pl squash-merge.

saghul commented Feb 16, 2024

View reviewed changes

skynet/modules/ttt/summaries/processor.py Show resolved Hide resolved

skynet/env.py Outdated Show resolved Hide resolved

skynet/modules/ttt/openai_api/app.py Outdated Show resolved Hide resolved

skynet/modules/ttt/summaries/processor.py Outdated Show resolved Hide resolved

quitrk force-pushed the tavram/openai branch from 7c24f74 to cc85d55 Compare February 20, 2024 11:22

quitrk changed the title ~~Tavram/openai~~ summaries: refactor to use openai api model for inference Feb 20, 2024

quitrk force-pushed the tavram/openai branch 2 times, most recently from 9db544d to 28c7987 Compare February 23, 2024 14:17

quitrk added 2 commits February 26, 2024 16:36

chore: update langchain

e27d25b

summaries: refactor to use openai api model for inference

66aa934

quitrk force-pushed the tavram/openai branch from 28c7987 to 9008f9c Compare February 26, 2024 15:41

saghul commented Feb 27, 2024

View reviewed changes

openai: add specific handling for openai requests

aa4c9d0

quitrk force-pushed the tavram/openai branch 2 times, most recently from ea480d4 to 256d58a Compare February 27, 2024 13:41

ref: code review changes

4cddaf6

quitrk force-pushed the tavram/openai branch from 256d58a to 4cddaf6 Compare February 27, 2024 13:42

saghul commented Feb 27, 2024

View reviewed changes

skynet/auth/bearer.py Show resolved Hide resolved

skynet/main.py Show resolved Hide resolved

skynet/modules/ttt/summaries/v1/models.py Outdated Show resolved Hide resolved

skynet/modules/ttt/summaries/v1/router.py Show resolved Hide resolved

skynet/utils.py Outdated Show resolved Hide resolved

openai-api: bypass auth for requests coming from within the app

e44a338

saghul commented Feb 28, 2024

View reviewed changes

skynet/auth/bearer.py Show resolved Hide resolved

openai: implement querying for credentials file

1bcab62

* inference jobso of customers with existing open ai keys will run on open ai servers

quitrk force-pushed the tavram/openai branch from d948388 to 1bcab62 Compare February 29, 2024 11:06

saghul commented Feb 29, 2024

View reviewed changes

ref: code review changes

e8c40bb

saghul commented Feb 29, 2024

View reviewed changes

quitrk merged commit a0e7683 into master Mar 1, 2024

quitrk deleted the tavram/openai branch March 1, 2024 07:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

summaries: refactor to use openai api model for inference #59

summaries: refactor to use openai api model for inference #59

saghul commented Feb 16, 2024

saghul left a comment

saghul left a comment

saghul Feb 29, 2024

quitrk Feb 29, 2024 •

edited

Loading

saghul Feb 29, 2024

saghul Feb 29, 2024

saghul Feb 29, 2024

quitrk Feb 29, 2024

saghul Feb 29, 2024

quitrk Feb 29, 2024

saghul Feb 29, 2024

saghul left a comment

saghul commented Feb 29, 2024

summaries: refactor to use openai api model for inference #59

summaries: refactor to use openai api model for inference #59

Conversation

saghul commented Feb 16, 2024

saghul left a comment

Choose a reason for hiding this comment

saghul left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

quitrk Feb 29, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

saghul left a comment

Choose a reason for hiding this comment

saghul commented Feb 29, 2024

quitrk Feb 29, 2024 •

edited

Loading