trOCR run example #451

vozdemir · 2021-09-29T06:29:45Z

Many thanks for trOCR!! I couldn't run trOCR on text. How to run it, can you give an exaample pls.

Model I am using trOCR.

wolfshow · 2021-09-29T06:38:37Z

@vozdemir Can you provide more details about your question? You may just download the dataset provided and do the inference with the base/large models. Meanwhile, please refer to #448 for more details.

NielsRogge · 2021-10-01T12:40:54Z

@wolfshow would be great if you can fix my notebook to run inference with TrOCR on one particular image: https://colab.research.google.com/drive/1BHkOBUGHr1xlVQ9pLVLZPA0VvAxF4Zo_?usp=sharing

wolfshow · 2021-10-02T13:45:43Z

@wolfshow would be great if you can fix my notebook to run inference with TrOCR on one particular image: https://colab.research.google.com/drive/1BHkOBUGHr1xlVQ9pLVLZPA0VvAxF4Zo_?usp=sharing

@Dod-o will help providing an inference example on that.

Dod-o · 2021-10-02T20:02:04Z

@NielsRogge The inference example has been uploaded, please see details in pic_inference.py

SeifeddineGharbi · 2021-10-04T08:46:18Z

@Dod-o when running pic_inference.py, I get the following error: urllib.error.HTTPError: HTTP Error 404: Not Found
it occurs when the script tries to download: Downloading: "https://github.com/pytorch/fairseq/archive/master.zip"

Any help please, thank you!!

NielsRogge · 2021-10-04T08:53:00Z

@SeifeddineGharbi I figured it out (I had to create a custom fork of Fairseq in order to make it work).

Expect TrOCR to be added to Transformers soon ;)

SeifeddineGharbi · 2021-10-04T08:59:00Z

@NielsRogge Thank you so much for replying but can you explain a bit further, please?

wolfshow · 2021-10-04T11:15:40Z

@SeifeddineGharbi I figured it out (I had to create a custom fork of Fairseq in order to make it work).

Expect TrOCR to be added to Transformers soon ;)

We found the fairseq model cannot be easily converted into the hf format. So we need to take more time to re-train the models with the hf library.

nithinreddyy · 2021-11-04T16:34:21Z

@wolfshow would be great if you can fix my notebook to run inference with TrOCR on one particular image: https://colab.research.google.com/drive/1BHkOBUGHr1xlVQ9pLVLZPA0VvAxF4Zo_?usp=sharing

Hugging face has uploaded the trocr model in their models. You can look into it.

https://huggingface.co/transformers/model_doc/trocr.html

NielsRogge · 2021-11-04T17:03:42Z

@nithinreddyy haha I wrote that page 😅

nithinreddyy · 2021-11-04T17:09:26Z

@nithinreddyy haha I wrote that page 😅

Yaaa 😬😬. I gone through the 3 notebooks, but you haven't written code for testing the image and extracting the text (For fine tuning model with IAM dataset). In the 2nd notebook you trained the model with IAM dataset, in 3rd notebook you just checked the test evaluation scores. But how to take one image from test data and extract the text?

NielsRogge · 2021-11-04T17:10:21Z

My inference notebook does exactly what you want.

nithinreddyy · 2021-11-04T17:38:14Z

My inference notebook does exactly what you want.

But you are directly loading the model from hugging face. What if we have our own dataset and want to train the model with data? In 2nd notebook You've trained the custom model with IAM dataset and you haven't written code how to extract text from one of the test images. I'm looking for that.

NielsRogge · 2021-11-04T20:50:05Z

Sorry, I thought you were talking about recognizing text, but you mean extracting text.

The IAM dataset only contains single-line text images, hence one doesn't need to perform any text extraction anymore. However, if you want to apply TrOCR on an entire PDF document, then you first need a text extraction algorithm.

You can for example take a look at this one: https://github.com/qurator-spk/eynollah

Dod-o closed this as completed Oct 2, 2021

TheMightyRaider mentioned this issue Oct 5, 2021

Solving "HTTPError: HTTP Error 404: Not Found" on trocr inference #460

Closed

jarrod-dexter mentioned this issue Nov 28, 2021

How to use the generated PAGE-XML as input to TrOCR? qurator-spk/eynollah#58

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

trOCR run example #451

trOCR run example #451

vozdemir commented Sep 29, 2021

wolfshow commented Sep 29, 2021 •

edited

NielsRogge commented Oct 1, 2021

wolfshow commented Oct 2, 2021

Dod-o commented Oct 2, 2021

SeifeddineGharbi commented Oct 4, 2021

NielsRogge commented Oct 4, 2021 •

edited

SeifeddineGharbi commented Oct 4, 2021

wolfshow commented Oct 4, 2021

nithinreddyy commented Nov 4, 2021

NielsRogge commented Nov 4, 2021

nithinreddyy commented Nov 4, 2021

NielsRogge commented Nov 4, 2021 •

edited

nithinreddyy commented Nov 4, 2021

NielsRogge commented Nov 4, 2021 •

edited

trOCR run example #451

trOCR run example #451

Comments

vozdemir commented Sep 29, 2021

wolfshow commented Sep 29, 2021 • edited

NielsRogge commented Oct 1, 2021

wolfshow commented Oct 2, 2021

Dod-o commented Oct 2, 2021

SeifeddineGharbi commented Oct 4, 2021

NielsRogge commented Oct 4, 2021 • edited

SeifeddineGharbi commented Oct 4, 2021

wolfshow commented Oct 4, 2021

nithinreddyy commented Nov 4, 2021

NielsRogge commented Nov 4, 2021

nithinreddyy commented Nov 4, 2021

NielsRogge commented Nov 4, 2021 • edited

nithinreddyy commented Nov 4, 2021

NielsRogge commented Nov 4, 2021 • edited

wolfshow commented Sep 29, 2021 •

edited

NielsRogge commented Oct 4, 2021 •

edited

NielsRogge commented Nov 4, 2021 •

edited

NielsRogge commented Nov 4, 2021 •

edited