Batched OCR training? #91

jbaiter · 2016-10-12T08:42:29Z

Currently the CLSTMOCR class that is defined in clstmhl.h can only train on single line images. Due to this, optimizations like Eigen's multi-threaded tensor operations and the GPU support have little effect, since the task size for single samples is too small for them to make a difference.

From a cursory reading of the code I could gather that batched training is supported by the lower-level API, so my question is what would have to be done to have batched training for the high-level CLSTMOCR (and ideally CLSTMText as well) API?

The text was updated successfully, but these errors were encountered:

tmbdev · 2016-10-12T16:42:34Z

There are a bunch of different reasons. The code was ported from Python, where batching wouldn't have helped with speed, so it was easiest and safest to leave it as is. In addition, for other networks, batching tends to result in higher test set error rates, so there wasn't much motivation to add it (it seems to have no effect either way on error rates for LSTMs). Eigen Matrix also didn't support GPU, so there wasn't much motivation for that anyway.

Now that the code uses Eigen Tensor, batching would make more sense. But Eigen turns out not to be such a convenient framework for multicore or GPU computations anyway. In addition, LSTMs on GPUs are probably best implemented using fused kernels implementing multiple time steps at a time.

So, the upshot is that I've started working on a separate project for OCR similar to clstm, but with a focus on parallelization and GPU support, including support for batching.

wanghaisheng · 2016-10-14T11:49:02Z

i was sufferring from the speed of clstm trainning.how could community contribute to the new project

tmbdev · 2016-10-27T21:39:57Z

Give me a few weeks; I just moved from Google to NVIDIA. GPU support will be much better now :-)

amitdo · 2016-10-27T23:39:33Z

Tom,
Good luck with the new job!

tmbdev closed this as completed Oct 27, 2016

amitdo mentioned this issue May 4, 2017

Training kraken and RTL support? mittagessen/kraken#36

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batched OCR training? #91

Batched OCR training? #91

jbaiter commented Oct 12, 2016

tmbdev commented Oct 12, 2016

wanghaisheng commented Oct 14, 2016

tmbdev commented Oct 27, 2016

amitdo commented Oct 27, 2016

Batched OCR training? #91

Batched OCR training? #91

Comments

jbaiter commented Oct 12, 2016

tmbdev commented Oct 12, 2016

wanghaisheng commented Oct 14, 2016

tmbdev commented Oct 27, 2016

amitdo commented Oct 27, 2016