[Support] Autotagging still not working #12523
Replies: 4 comments 3 replies
-
|
The The "model does not exist" message afterward is a side effect: the process was killed before it could write the model file to disk, leaving paperless with nothing to load on subsequent runs. To work around this, schedule the classifier task outside your maintenance window using |
Beta Was this translation helpful? Give feedback.
-
|
Thank you for your reply, that clarifies some doubts I had. However, I find the RAM consumption quite heavy. I started another run this morning, the first time it was killed just after two hours, the second time is still running, but RAM is at 80-90%, hence I expect it being killed anytime soon. And it will take longer than 24 hours, most likely (after 8 hours, it is still training the tags classifier). I would have to suspend all scheduled tasks, I need to check which ones I can suspend. |
Beta Was this translation helpful? Give feedback.
This comment has been hidden.
This comment has been hidden.
-
|
Ok, it's done, it took about ten hours and 20 minutes in total, and that only because I stopped all other software that could be stopped on the NAS. If I hadn't, it would have taken around 14 to 16 hours and probably not ended properly. How can I make sure that the classifier keeps the data? I hope additional training will be much faster end less RAM consuming, or will it always be more or less the same? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
What's your question or issue?
I am writing in continuation of this post: [https://github.com//discussions/9902#discussion-8295206]
At some point the classifier stopped again working. I tried to start it again yesterday. In the log, I get these messages:
[2026-04-06 18:25:43,022] [DEBUG] [paperless.classifier] Gathering data from database... [2026-04-06 18:26:55,429] [DEBUG] [paperless.classifier] 7745 documents, 3085 tag(s), 568 correspondent(s), 199 document type(s). 57 storage path(s) [2026-04-06 18:27:10,561] [DEBUG] [paperless.classifier] Vectorizing data... [2026-04-06 18:27:10,561] [DEBUG] [paperless.classifier] Vectorizing data... [2026-04-06 19:49:18,052] [DEBUG] [paperless.classifier] Training tags classifier... [2026-04-07 00:41:53,700] [DEBUG] [paperless.classifier] Training correspondent classifier... [2026-04-07 02:00:05,646] [INFO] [paperless.tasks] Emptying trash of all expired documents [2026-04-07 02:00:06,823] [INFO] [paperless.tasks] Deleted 0 documents from trashand at the NAS, where I initiated the process via SSH, I get this message:
/usr/local/bin/document_create_classifier: line 17: 6180 Killed s6-setuidgid paperless python3 manage.py document_create_classifier “$@”I wonder if this could have to do with some operations which are scheduled to take place around 2am at the NAS (automatic software updates, virus scan, etc.). I restarted the classifier again this morning, according to the last log entries there should be a chance that it will finish before 2am the next day, if the NAS activities caused the problem. I will keep an eye on RAM consumption.
I still wonder: why did it stop classifying (because it used to work after going through the process previously, but stopped after days or weeks of operation), and why doesn't it try to restart? Why does it say (after it had worked for a while):
[paperless.classifier] Document classification model does not exist (yet), not performing automatic matching.There should be sufficient RAM (16 GB, of which about 50% is being used during the process, however I couldn't check at 2 am and don't know if it had built up during the process).
And I sincerely apologize, if there is another post written by me about the same issue or question besides the one referred to above (which is locked and hence cannot be continued by me, or can it?), I looked for it and didn't find any, else I would have added this report there.
What have you tried?
No response
Paperless-ngx version
2.20.10
Host OS
Asustor NAS AS6104T
Installation method
Docker - official image
System status
No response
Relevant logs or output
No response
Beta Was this translation helpful? Give feedback.
All reactions