Fresh image processing #258

zamazan4ik · 2017-12-17T13:39:06Z

I have createdfresh branch for image processing stuff. Can you check it please?

manisandro · 2017-12-17T13:42:39Z

qt/src/Displayer.hh

@@ -65,6 +65,7 @@ public:

 public slots:
 	void setAngle(double angle);
+	void setScaledImage(const QImage& image, double scale = 1.0);


scaled image is just the image used in the UI, not the one used when recognizing. See Displayer::getImage.

It's just for testing. As far as i remember from our previous discuss about it, i should extend interface of Displayer.

This is actually non-trivial. The Displayer always queries the images directly from the source (i.e. an image file or pdf or djvu document) with the appropriate resolution, either for displaying in the gui, or for passing to tesseract for OCR. The getImage image method is only there for selection tools etc which need to operate on the currently displayed image. Conversely, a setImage method would make little sense because you would only be setting the image used for the UI at the current zoom level etc.
I see two options:
Parametrize the processing steps, so that they are applied to each rendered image, as is done for brightness, contrast etc, see
https://github.com/manisandro/gImageReader/blob/master/qt/src/DisplayRenderer.cc#L31
But I fear this might be too slow.
Otherwise you'd need to have the processing algorithms create a new temporary image, which is then passed to the displayer as a source. To make it elegant, you could actually extent the Source structure
https://github.com/manisandro/gImageReader/blob/master/qt/src/SourceManager.hh#L32
to allow specifying an alternative path to use instead of the path actually specified by the source, and if that path is not empty, then that image is used instead. This has the benefit that you don't need to pollute the source list in the UI with extra entries for the processing output. This alternative path entry would probably have to be a
QMap<int, QString>
since for multi-page documents you will need a temporary image for each page. For multi-page there is the additional challenge when to generate these temproary images, since generating them all at once will probably be to slow. Probably better to just generate them on demand.
I hope this makes sense to you.

You'll probably want to do something similar to DisplayRenderer::adjustImage.

Actually it depends on what operation you are performing, if they are operations which take a long time and should not be repeated everytime the image is redrawn, then you'll probably want to create a new temporary image with the operations applied, and then use that as source.

Hah, it's quite difficult to choose, because different image operations have different duration: from milliseconds to seconds. E.g. denoising is very expensive operation.

then you'll probably want to create a new temporary image with the operations applied, and then use that as source.

Create on disk and then reload from disk?

You can look into image_processing sources: i use for it Doxygen.

I'm familiar with doxygen, but again, I'm not convinced it helps much here besides blowing up the size of the source files.

About SourceManager::addSource method: but in this case as fasr as i understand my temporary item will be added to QListWidgetItem. Is it ok?

Another idea could be to extend the Source struct by adding a QMap<int,QString> preprocessedImages, where you store, for each page, the preprocessed image. If such a page exists, the Displayer will render that image instead of the one of the original source. You can then give the user the option to revert to the original image, in which case the entry in preprocessedImages is simply discarded. This approach would also make it easy to apply the algorithms to all pages of a multipage document, without polluting the sources list with tons of entries.

I like this idea.

zamazan4ik · 2017-12-17T14:02:16Z

Also please check UI. Now it's simple set of buttons for performing different algorithms.

manisandro · 2017-12-17T14:13:44Z

How do you envison the UI in the releasable version? If they remain simple button, I suppose a combobox with the respective entries in the advanced image controls toolbar would work just as well.

manisandro · 2017-12-17T14:14:55Z

Also, how likely is that an algorithm will "wreck" an image? If that can happen, you'll either need a preview or a undo/redo. Preview is easier I suppose ;)

zamazan4ik · 2017-12-17T15:14:25Z

Also, how likely is that an algorithm will "wreck" an image? If that can happen, you'll either need a preview or a undo/redo. Preview is easier I suppose ;)

Of course, always there is some chance to make error in image processing algorithm :-)

This question is very important and should be discussed. I see several ways to deal with it:

Every button apply algorithm on preview. And after some steps user can press "Apply" button and image from preview will replace source image.
You way with Ctrl+Z/Ctrl+Y. I think we also MUST implement it. It's very convinient way for program. I as user want to have this feature in gIR.
After every image processing algorithm ask to user about "Apply result from preview to source image?". I don't like this way :-)

zamazan4ik · 2017-12-17T15:17:04Z

How do you envison the UI in the releasable version? If they remain simple button, I suppose a combobox with the respective entries in the advanced image controls toolbar would work just as well.

I want to clone UI from Abbyy FineReader to gIR. And in FineReader they have separate dock widget for image processing.

Image proccesing UI will not be easy buttons. We will extend it with some customizable features: e.g. user must to be able to regulate denoise power, because we can't do it automatically.

manisandro · 2017-12-17T15:18:14Z

For a start I'd just add a notification bar asking the user whether he wants to apply the change.

zamazan4ik · 2017-12-17T15:25:02Z

Do you have any other warnings/suggestions?

manisandro · 2017-12-17T15:28:11Z

Nope, with correct source logic and polished UI should be good!

zamazan4ik · 2017-12-17T15:28:58Z

Ok, nice. Thank you for the review :-)

manisandro · 2018-09-27T21:57:53Z

Hi @zamazan4ik, I've finally ended up releasing 3.3.0, so we can start again with feature work, if you are still interested in pursuing this.

manisandro · 2019-07-24T22:30:50Z

Closing since this appears to be abbandoned.

zamazan4ik added 5 commits December 17, 2017 16:35

Add image processing algorithms

3f1ee4b

[Qt] Add new buttons for image processing

f2d931d

[Qt] Add new image processor

0c1b8f6

[Qt] Change visibility of setScaledIMage method

b38632e

Fix CMakeLists.txt

5ed531d

manisandro reviewed Dec 17, 2017

View reviewed changes

zamazan4ik added 15 commits December 17, 2017 19:22

[Qt] Remove items from UI side

1eebcae

Add more files to gitignore

823af76

[Qt] Add items to source side

ccf4788

[Qt] Add auto process button

cfd657e

[Qt] Add auto image processing and more binarizations

f6c6a0d

[Qt] Add messagebox

dd907b2

[Qt] Add invertion button

4ef0b22

[Qt] Add invertion algorithms

7570916

Add blur detector

7080d38

Fix blur detection

7e62013

Add auto invertion algorithm

a4105e9

[Qt] Add more algorithms to autoprocessing function

972c055

[Qt] Extend Source struct for preprocessed image

2127fc7

[Qt] Add logic for rendering preprocessed images

adb44e1

[Qt] Fixed Image Processor for proper rendering

ceb3c6b

zamazan4ik added 4 commits December 24, 2017 03:42

[Qt] Remove Closable property for Enhance widget

fe2fcf8

[Qt] Remove AutoInvert function

a116bc9

[Qt] Fixed other image processing algorithms

58a706a

[Qt] Temporary disable resolution for preprocessed images

c80c701

manisandro force-pushed the master branch from f7e0f0d to 837b0a9 Compare December 29, 2017 22:16

manisandro closed this Jul 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fresh image processing #258

Fresh image processing #258

zamazan4ik commented Dec 17, 2017

manisandro Dec 17, 2017

zamazan4ik Dec 17, 2017 •

edited

manisandro Dec 17, 2017

manisandro Dec 17, 2017

zamazan4ik Dec 17, 2017 •

edited

zamazan4ik Dec 22, 2017

manisandro Dec 22, 2017

zamazan4ik Dec 22, 2017 •

edited

manisandro Dec 22, 2017

zamazan4ik Dec 22, 2017

zamazan4ik commented Dec 17, 2017

manisandro commented Dec 17, 2017

manisandro commented Dec 17, 2017

zamazan4ik commented Dec 17, 2017

zamazan4ik commented Dec 17, 2017

manisandro commented Dec 17, 2017

zamazan4ik commented Dec 17, 2017

manisandro commented Dec 17, 2017

zamazan4ik commented Dec 17, 2017

manisandro commented Sep 27, 2018

manisandro commented Jul 24, 2019

Fresh image processing #258

Fresh image processing #258

Conversation

zamazan4ik commented Dec 17, 2017

manisandro Dec 17, 2017

Choose a reason for hiding this comment

zamazan4ik Dec 17, 2017 • edited

Choose a reason for hiding this comment

manisandro Dec 17, 2017

Choose a reason for hiding this comment

manisandro Dec 17, 2017

Choose a reason for hiding this comment

zamazan4ik Dec 17, 2017 • edited

Choose a reason for hiding this comment

zamazan4ik Dec 22, 2017

Choose a reason for hiding this comment

manisandro Dec 22, 2017

Choose a reason for hiding this comment

zamazan4ik Dec 22, 2017 • edited

Choose a reason for hiding this comment

manisandro Dec 22, 2017

Choose a reason for hiding this comment

zamazan4ik Dec 22, 2017

Choose a reason for hiding this comment

zamazan4ik commented Dec 17, 2017

manisandro commented Dec 17, 2017

manisandro commented Dec 17, 2017

zamazan4ik commented Dec 17, 2017

zamazan4ik commented Dec 17, 2017

manisandro commented Dec 17, 2017

zamazan4ik commented Dec 17, 2017

manisandro commented Dec 17, 2017

zamazan4ik commented Dec 17, 2017

manisandro commented Sep 27, 2018

manisandro commented Jul 24, 2019

zamazan4ik Dec 17, 2017 •

edited

zamazan4ik Dec 17, 2017 •

edited

zamazan4ik Dec 22, 2017 •

edited