-
Notifications
You must be signed in to change notification settings - Fork 12k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Indexing 1.7MB file causes CPU to crash in memory #15223
Comments
8GB of memory is limited to 3GB, otherwise it may cause the server to crash. It has already collapsed 4-5 times |
I found a similar closed issue where a solution was suggested to address service lag during document indexing. The solution involves deploying the worker service related to document indexing separately on other machines to distribute the load and prevent the main service from freezing when uploading large documents. You can start the Worker service on a different machine using the following commands: For Linux or macOS: celery -A app.celery worker -P gevent -c 1 --loglevel INFO -Q dataset,generation,mail,ops_trace For Windows: celery -A app.celery worker -P solo --without-gossip --without-mingle -Q dataset,generation,mail,ops_trace --loglevel INFO Additionally, ensure that your Docker setup meets the necessary requirements, such as configuring Docker Desktop to use at least 2 virtual CPUs and 8 GB of initial memory on macOS, and ensuring Docker 19.03 or later and Docker Compose 1.25.1 or later are installed on Linux. On Windows with WSL 2 enabled, it is recommended to store source code and other data in the Linux filesystem rather than the Windows filesystem [1]. Regarding the segmentation rules, ensure they are properly configured. If the segmentation identifier does not match any part of the document, or if the document is empty or only contains elements that are filtered out during preprocessing, the resulting segments could be empty, leading to a segment length and paragraph count of zero [2]. To continue talking to Dosu, mention @dosu. Help Dosu learn! Give it feedback: Great Response | Irrelevant Answer | Incorrect Sources | Too Verbose | Hallucination | Bug Report | Other |
This might be related to a memory leak in our plugin daemon. langgenius/dify-plugin-daemon#55 |
Self Checks
Dify version
1.0.0
Cloud or Self Hosted
Self Hosted (Docker)
Steps to reproduce
CONTAINER ID NAME CPU % MEM USAGE / LIMIT MEM % NET I/O BLOCK I/O PIDS
d97f561e5694 docker-worker-1 100.87% 2.752GiB / 3GiB 91.74% 90.4kB / 60.6kB 8.32MB / 1.78MB 5
✔️ Expected Behavior
Index successful
❌ Actual Behavior
No response
The text was updated successfully, but these errors were encountered: