Custom metadata for documents #141
-
This was already discussed on the old issue board (jonaswinkler/paperless-ng#274) and it is still relevant for paperless-ngx. There were two models suggested by Jonas about a year ago:
Any preferences here? Jonas wrote back then that implementing the first option would be much more complicated than the second one, although proper custom fields would be more elegant (e.g. the use-case of invoice amount & invoice number). |
Beta Was this translation helpful? Give feedback.
Replies: 20 comments 17 replies
-
I like the concept of custom metadata, but I think if it were to be done, wouldn't the use case be additional filter/sorting criteria that appears in the front-end UI filter bar? That makes the simple text field rather useless, as the document is already consumed and OCR'ed. Filtering down the document type/correspondent would get you most of the way there. Then any additional searching by the metadata field would be essentially the same as searching the full-text (in the invoice # or amount example). I guess if the contents of the metadata were not in the body of the document already that could be useful. But if that were the approach, I'd rather that simple text field be an unlimited length text blob which could work as essentially a free-text note stapled to the document. The alternative approach of custom metadata fields would be far more useful for searching/filtering. Maybe both enhancements for separate use cases? |
Beta Was this translation helpful? Give feedback.
-
I'm looking for a field to be able to more easily link a document to a key in another application. One of my main Paperless uses is for Explanation of Benefits (EOB) Scanning. So far, I have been creating a matching TAG in Paperless for unique ID that is created as I enter items in my other app. This is resulting in lots of similar tags. Having another (custom) field to enter that data in would help me in having fewer tags. |
Beta Was this translation helpful? Give feedback.
-
I have seen some commercial DMS Systems, they mostly work with custom metadata-fields per document type. This would be a really mighty approach: One could fill them manually, or maybe use PAPERLESS_POST_CONSUME_SCRIPT to fill them up with data extracted from full text, with some advanced logic and/or regex, or even enrich them with data from external data sources. I have a thousand ideas what i could do with this, by connecting it to ERPNext instances. (BTW: I`m thinking about creating an app which connects erpnext to paperless) One technical approach would be to create a additional database-table per document type, in which every metadata field has its own column. So by creation of a metadata field you would also have to decide about the field-type: string, int, float, date for example. With this approach you could also do effective SQL queries to retrieve some more specific data, then you can with a full-text search. I`m thinking about something like the "edit Doctype" function from the Frappe Framework But yes, it`s obviously not trivial to implement. One easier idea It would be to add one or two text fields: One could be used to store human readable data AKA notes, the other for storing metadata (key,value pairs) in json, for example. |
Beta Was this translation helpful? Give feedback.
-
I belive this is the better design. |
Beta Was this translation helpful? Give feedback.
-
How could the search ui look like for filtering? |
Beta Was this translation helpful? Give feedback.
-
Hi Everyone, I recently started looking for a document management system for my private use and I stumbled upon paperless which looks great. In a previous job, I developed a small software used internally to classify important amounts of documents. Quite similar to paperless in the idea. This has a lot of side benefits like being able to store ids from other systems (customer id, bill id, etc). The OCR is great but it does not understand the content and it is not always perfectly interpreting the text. |
Beta Was this translation helpful? Give feedback.
-
Any news on this? Would really like to sponsor the easiest implementation (just a "notes" field for ... notes; not mandatory searchable) with a pizza and a beer! |
Beta Was this translation helpful? Give feedback.
-
Any news going into this direction? maybe we should raise a Bountysource or something :) |
Beta Was this translation helpful? Give feedback.
-
I wouldn’t be opposed to that, personally. On 6 Apr 2023, at 22:05, Arya Senna ***@***.***> wrote:
Any news going into this direction? maybe we should raise a Bountysource or something :)
—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
+1 For this |
Beta Was this translation helpful? Give feedback.
-
Custom metadata would be a great addition. One of the few things I still miss in paperless |
Beta Was this translation helpful? Give feedback.
-
Hello community, Is ist not possibel to combine 3 of the top feature request merge + reference + metadata?
Make metadata fields which have different typs e.g, Date, referenz like Tags, text, text search with the integatet KI ? Or would it be possibel to expand the commend field with KI Search options to get special values from document? Finally use the existing and add some nice stuff like paperless-ngx it already do so much time to paperless-ng. Thanks to all the greate developers you are incredible. |
Beta Was this translation helpful? Give feedback.
-
I came across this feature request looking for a solution to my use case which hasn't been mentioned so far but seems related. Among other things I store a lot of receipts for HSA expense tracking. I wanted to store the total dollar amount in a custom field. I presently use a tag for the year and thought it would be cool if I could query answers to questions like 'how much did I spend on tag 'x' (i.e. Medical) in year 2023 (i.e. tag 2023)?' There are a ton of other possibilities for something like this. |
Beta Was this translation helpful? Give feedback.
-
I would love to have a custom meta field for account number, and batch number for our billing documents. +1 Is there a bounty program for something like this? |
Beta Was this translation helpful? Give feedback.
-
I would also appreciate having customs fields. For me, a sum (as a an invoice sum) would be really helpful for example. Having look to docspell, there are five types: string, integer (numeric), boolean, date, and currency. I think, these types would cover most of the use cases. |
Beta Was this translation helpful? Give feedback.
-
I'd really appreciate the ability to add a comment or note to something. In Teedy, this is handled with a comments field which show up like a chat about a document when multiple users add comments. They are then indexed and show up in search results. |
Beta Was this translation helpful? Give feedback.
-
My use-cases for custom fields would be:
|
Beta Was this translation helpful? Give feedback.
-
Just to add my two cents... I'm considering migrating from a custom solution to paperless. I have several custom metadata fields that I'd like to be able to preserve: GPS location, last modified date, postal address, source URI (where this came from) and "favorite". There are a few others, but they can be implemented as tags. Favorite can probably be a tag as well. GPS is the only one that looks like it would really benefit from being searchable. |
Beta Was this translation helpful? Give feedback.
-
Look what was just merged! #4502 |
Beta Was this translation helpful? Give feedback.
-
This discussion has been automatically locked since there has not been any recent activity after it was closed. Please open a new discussion for related concerns. |
Beta Was this translation helpful? Give feedback.
Look what was just merged! #4502