Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

knowledge retrieval embedding model is taking long to index. #54

Open
sangee2004 opened this issue Feb 14, 2024 · 4 comments
Open

knowledge retrieval embedding model is taking long to index. #54

sangee2004 opened this issue Feb 14, 2024 · 4 comments
Labels
bug Something isn't working knowledge retrieval

Comments

@sangee2004
Copy link

Creating an Assistant with files takes very long time to complete loading the files.
In my case, it took about 8 minutes for loading a PDF document with size 6.5 MB

This is my Mac configration:

Screen Shot 2024-02-14 at 3 48 08 PM

From logs:

2024-02-14 15:17:41 [2024-02-14 23:17:41,913: INFO/ForkPoolWorker-8] Task app.tasks.execute_chat_completion[ec15a362-aba8-4405-84b2-0adce83bf509] succeeded in 11.899923254997702s: None
2024-02-14 15:22:55 [2024-02-14 23:22:55,504: INFO/MainProcess] Task app.tasks.execute_asst_file_create[341a9ecc-bf19-4e6f-ab37-6071404c74f8] received
2024-02-14 15:22:55 [2024-02-14 23:22:55,515: INFO/ForkPoolWorker-8] processing file id : file_176e8b
2024-02-14 15:31:02 [2024-02-14 23:31:02,488: WARNING/ForkPoolWorker-8] [447725520764156016, 447725520764156017, 447725520764156018, 447725520764156019, 447725520764156020, 447725520764156021, 447725520764156022, 447725520764156023, 447725520764156024, 447725520764156025, 447725520764156026, 447725520764156027, 447725520764156028, 447725520764156029, 447725520764156030, 447725520764156031, 447725520764156032, 447725520764156033, 447725520764156034, 447725520764156035, 447725520764156036, 447725520764156037, 447725520764156038]
2024-02-14 15:31:02 [2024-02-14 23:31:02,490: INFO/ForkPoolWorker-8] file file_176e8b processing completed
2024-02-14 15:31:02 [2024-02-14 23:31:02,495: INFO/ForkPoolWorker-8] Task app.tasks.execute_asst_file_create[341a9ecc-bf19-4e6f-ab37-6071404c74f8] succeeded in 486.98781755700475s: None
@sangee2004 sangee2004 changed the title Loading files as part of Assistant creation takes a very long time. knowledge retrieval embedding model is taking long to index. Feb 14, 2024
@sanjay920
Copy link
Collaborator

sanjay920 commented Feb 16, 2024

ed4532a should address this
it made it ~20% faster on my m2 max. should be at least over 50% faster on your machine

@sangee2004
Copy link
Author

I still see the same time taken for loading images:

From logs:

Task app.tasks.execute_asst_file_create[58078a77-84d1-4713-8c77-9735d347ec99] succeeded in 464.2842810860002s: None
2024-02-16 14:16:07 2024-02-16 22:16:07,949 - INFO - Encoding batch 1/3
Batches: 100%|██████████| 1/1 [02:42<00:00, 162.92s/it]
2024-02-16 14:18:50 2024-02-16 22:18:50,888 - INFO - Encoding batch 2/3
Batches: 100%|██████████| 1/1 [02:29<00:00, 149.85s/it]
2024-02-16 14:21:20 2024-02-16 22:21:20,757 - INFO - Encoding batch 3/3
Batches: 100%|██████████| 1/1 [02:26<00:00, 146.93s/it]

@sangee2004
Copy link
Author

Memory limit for docker desktop was 8 Gi for the results I had seen in the earlier comments.
Increasing memory limit for docker desktop to 16 Gi (and restarting rubra) does not help. I see the same loading time.

@tybalex
Copy link
Contributor

tybalex commented Feb 24, 2024

improved by PR #101

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working knowledge retrieval
Projects
None yet
Development

No branches or pull requests

3 participants