remove task queue #205

pmeier · 2023-11-17T13:03:13Z

Closes #183, closes #204. TL;DR this PR gets rid of huey / the task queue. It has been the source of confusion, limited some nice to have features (looking at you #185), and overall made the implementation more complex.

Here are the highlights:

Remove the task queue and huey as dependency
~~Rename Chat.prepare and Chat.answer to Chat.aprepare and Chat.aanswer to better reflect their async nature~~ See remove task queue #205 (comment)
~~Re-add Chat.prepare and Chat.answer as synchronous versions of their async counterparts.~~ See remove task queue #205 (comment)
Add a new ragna.local_root function that functions as global configuration
Remove the config as parameter for components. If they need local storage, the new ragna.local_root function should be used.
Enable the ability to implement regular and async functions on the components. A regular function will now be run on a separate thread and thus keeping Ragna "async first"
Create a ragna.deploy namespace and move the config, authentication, REST API, and UI implementation to it.
Rename config.core to config.components
Move authentication and document settings to the "global" config scope, i.e. config.authentication and config.document

I didn't bother to update the documentation properly as we need to have a complete overhaul before the next release anyway.

ragna/core/_rag.py

pmeier · 2023-11-18T22:39:54Z

@nenb Would you be able to test this PR and see if it works as expected. Just note that I didn't implement any streaming for #185 just yet. This will come in a follow-up PR.

ragna/core/_rag.py

pmeier · 2023-11-19T12:59:29Z

Sleeping on this, I think we should not block the removal of the task queue over the issues with the sync endpoints. They were only suggested by me in #183 (comment) to make it more clear that there is blocking behavior. But with this PR, there is no blocking anymore. Even without workers, everything is async. Thus, we can leave the sync endpoints for later. I'll remove them again from this PR and open an issue to fix this later.

This reverts commit b98f637.

pmeier · 2023-11-19T14:28:51Z

@nenb I'm merging this without waiting for your reply to move on to the other features that build on top of this. Since we are not adding any sync endpoints for now, I'm confident that I didn't break anything. Feel free to comment here or a new issue if you find something that I missed.

nenb · 2023-11-19T21:24:56Z

@pmeier Nice work, sorry I haven't been terribly responsive the last few days, I should be back online from tomorrow.

From looking through the PR now, the only bit that caught my eye was anyio.to_thread.run_sync. Some of the chunking/embedding can be quite CPU intensive, and I was wondering if it might be better to run in a separate process (to_process from anyio). I downloaded 5 books from Project Gutenberg in .txt format and tried to do a quick profile of threads vs processes for these 5 books. Unfortunately I got blocked on a bunch of serialisation issues (related to the chromadb ONNX format for their default model, and then related to tiktoken) and so I wasn't able to perform this comparison.

I'm mentioning this here for future reference. I know you have a bunch of work planned, and this is likely premature optimisation. But perhaps worth keeping in mind ways to mitigate the impact of CPU-heavy workloads in future designs. And also some of the defaults, like the values that are used when starting the API with uvicorn. Again, likely premature optimisation on my part!

pmeier · 2023-11-20T08:33:41Z

Unfortunately I got blocked on a bunch of serialisation issues (related to the chromadb ONNX format for their default model, and then related to tiktoken) and so I wasn't able to perform this comparison.

Do you think this is an issue with Ragna? If so, could you write up an issue so I can have a look?

I'm mentioning this here for future reference. I know you have a bunch of work planned, and this is likely premature optimisation. But perhaps worth keeping in mind ways to mitigate the impact of CPU-heavy workloads in future designs.

Yeah, I would put it under premature optimization unless we have valid complains about this. And in that case I would maybe go a step further and not just add the subprocess option, but a task queue would also fit nicely in there.

nenb · 2023-11-20T08:52:29Z

Do you think this is an issue with Ragna? If so, could you write up an issue so I can have a look?

I don't think it's an issue with ragna, but I will write up some details (along with a bunch of other things I haven't responded to) throughout today, so you can have a look.,

nenb · 2023-11-21T11:16:12Z

@pmeier I have opened an issue here with details.

remove task queue from Rag / Chat

7dc927d

pmeier mentioned this pull request Nov 17, 2023

queueless Ragna? #204

Closed

pmeier added 15 commits November 17, 2023 15:05

remove queue and cleanup

50754a5

fix dependencies

8c12dbc

mypy

43cd8dd

fix docs

cd6f0e7

cleanup unpacking

db7a848

remove FIXME comment

f14773f

mini cleanup

9337693

create deploy namespace

ab51f1d

move config and auth to deploy

ff57985

cleanup

2e3e0bf

[dirty] fix tests

4486b7a

add better e2e tests

ad0fcf8

fix tests

8aba23e

fix docs

b98f637

fix tests

3176b30

pmeier marked this pull request as ready for review November 18, 2023 22:17

pmeier commented Nov 18, 2023

View reviewed changes

ragna/core/_rag.py Outdated Show resolved Hide resolved

pmeier added 4 commits November 19, 2023 00:08

cleanup

d217067

[DEBUG] tmp remove the sync in async test

fa2ebf4

[DEBUG] remove sync test

d4b4a4a

[DEBUG] only async

038d659

pmeier commented Nov 19, 2023

View reviewed changes

ragna/core/_rag.py Outdated Show resolved Hide resolved

pmeier added 4 commits November 19, 2023 15:12

Revert "fix docs"

2f039e4

This reverts commit b98f637.

drop sync endpoints

b64c82f

create deploy in tests

f987349

cleanup docs

ce0692c

pmeier merged commit 2c1e5c4 into main Nov 19, 2023
10 checks passed

pmeier deleted the remove-task-queue branch November 19, 2023 14:28

This was referenced Nov 19, 2023

Synchronous endpoints on ragna.core.Chat #212

Open

Remove the memory option for database URL #213

Closed

pmeier mentioned this pull request Nov 20, 2023

add comparison workflow example #214

Draft

This was referenced Dec 1, 2023

[Bug]: sqlite3.OperationalError: attempt to write a readonly database chroma-core/chroma#1441

Closed

[BUG] - sqlite3.OperationalError: attempt to write a readonly database #190

Closed

Fix timeouts #234

Merged

aktech mentioned this pull request Dec 7, 2023

Add recipe for ragna conda-forge/staged-recipes#24706

Closed

10 tasks

pmeier mentioned this pull request Jan 17, 2024

Documentation for authentification objects is missing #279

Closed

pmeier mentioned this pull request Feb 20, 2024

Config refactor #328

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remove task queue #205

remove task queue #205

pmeier commented Nov 17, 2023 •

edited

pmeier commented Nov 18, 2023

pmeier commented Nov 19, 2023

pmeier commented Nov 19, 2023

nenb commented Nov 19, 2023 •

edited

pmeier commented Nov 20, 2023

nenb commented Nov 20, 2023

nenb commented Nov 21, 2023

remove task queue #205

remove task queue #205

Conversation

pmeier commented Nov 17, 2023 • edited

pmeier commented Nov 18, 2023

pmeier commented Nov 19, 2023

pmeier commented Nov 19, 2023

nenb commented Nov 19, 2023 • edited

pmeier commented Nov 20, 2023

nenb commented Nov 20, 2023

nenb commented Nov 21, 2023

pmeier commented Nov 17, 2023 •

edited

nenb commented Nov 19, 2023 •

edited