Training For Document Information Extraction #36

benugopal · 2022-08-27T15:54:55Z

Its a great project, and I want to try it out the approach without OCR.
I have 3 questions related training

We need to create ground truth for training test and validation, do we have any tool to perform the annotations to get the input as per training requirement.
For training I think you need to use OCR to create ground truth data, than how it is extracted during inference?
I see we need to provide dictionary hierarchy for classes in ground truth, can i use my own classes and custom hierarchy for ground truth example
{
"gt_parse": {
"Item": [
{
"Description": "SPGTHY BOLOGNASE",
"Quantity": "1",
"Price": "58,000"
},
{
"Description": "SPGTHY BOLOGNASE",
"Quantity": "1",
"Price": "58,000"
}],
```
 	"Total": {"value": "20"},
 	"Sub_Total": {"value": "50"},
 	"Number": {"value": "80"}}}
```

Could you please guide.

The text was updated successfully, but these errors were encountered:

Provide feedback