Replies: 1 comment
-
|
One thing to note, for Docker, custom init scripts are probably easier for users: https://docs.paperless-ngx.com/advanced_usage/#custom-container-initialization |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
With the new feature coming in v3 of paperless - developing and integrating third party parsers - I am happy to share a docling flavored parser to convert your favorite documents! 🐥
Pypi: https://pypi.org/project/pgx-docling-parser-serve/
GitHub: https://github.com/T-Eberle/paperless-docling-parser
This is just a taste of what can come in future - means this is still quite experimental and WIP.
Here are a few things to point out:
Things you can expect in the future:
vector_databasefeature, where the parsed data can directly be chunked and uploaded to a vector database. Docling has the hybrid chunking feature, which I could imagine to use here.To use the plugin, simply install it, e.g. by creating your own dockerfile:
Feel free to contribute and share your ideas!🚀
Beta Was this translation helpful? Give feedback.
All reactions