Task for offline handwriting #18

TomJasuroae · 2015-06-14T00:49:24Z

@tmbdev I have a question about is this clstm suitable for offline English handwriting. I am newer to this area. I have some questions for discussion.

Feature extraction. Deep learning, such as CNN, which can extract feature map from raw images. I don't know what is the feature extraction method in clstm. Can I replace this feature input part by Deep learning? For example, sliding window from a handwritten text line, from left to right to get a sequence feature.
In clstm examples, there are seems only 1 layer lstm. Is there other examples to show more complex network structures? Such as more layers.
What's your opinion for solving offline English handwriting problem? Thanks.

It seems that clstm network will mapping this character to label. So it's better to do some norm operation of this handwriting words? So it has better segmentation result for easy recognition.

tmbdev · 2015-06-14T16:34:06Z

(1) There is no feature extraction. There has been a lot of working on using DNNs as preprocessing for LSTMs. For OCR, we haven't observed any improvements. For other applications, you have to try. Some people swear by it. There is text line normalization for OCR, and that turns out to be very important.

(2) The "bidi2" network gives you a two layer network. Look in clstm_prefab.cc to see how to construct more complex networks. You can essentially just write down the network structure as nested calls to "layer(...)". Again, for OCR, deeper layers don't help.

(3) LSTMs work fine for offline handwriting (that was one of the first applications). Most applications so far have used 2D LSTMs, which aren't implemented in CLSTM yet. Based on limited experience, I would guess that 1D LSTMs may work better than 2D LSTMs with proper preprocessing.

TomJasuroae · 2015-06-14T23:33:59Z

Thanks, @tmbdev , I have already finished some experiments, such as more complex network and normalization. The result is indeed as your said, simple and good norm dataset has better performance. Would you like to share your experience on text line normalization? For example, methods, papers? Thanks.

Epsirom mentioned this issue Jul 21, 2015

How to get confidence values? #24

Closed

TomJasuroae closed this as completed Aug 2, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Task for offline handwriting #18

Task for offline handwriting #18

TomJasuroae commented Jun 14, 2015

tmbdev commented Jun 14, 2015

TomJasuroae commented Jun 14, 2015

Task for offline handwriting #18

Task for offline handwriting #18

Comments

TomJasuroae commented Jun 14, 2015

tmbdev commented Jun 14, 2015

TomJasuroae commented Jun 14, 2015