- Hiragana (ETL 8): the main alphabet. Each Hiragana has a corresponding Katakana
- Katakana (ETL 1): Katakana are used for foreign words.
- Kanji (ETL 8): Kanji are Chinese characters that were adopted in Japanese
- Source for ETL: how to unpack using Medium article
- Kuzushiji: Old Japanese Hiragana, but much more complex
- Source: Kuzushiji-MNIST
- Some general rules for input image standarization
- Gray Scale (Black and White): only 1 channel, no RGB (3 channel)
- White background (255) & Character in Black (~0-180) (refer to the normalization)
- Normalization: images should be in the range
- 0 (black) to 1 (white):
image/255.
- -1 (black) to 1 (white): either
(image/ 127.5 -1)
or-(image/ 127.5 -1)
to convert to white background & black character (Depend on the dataset)
- 0 (black) to 1 (white):
- Build a Handwritten Text Recognition System using TensorFlow
- Handwritten Text Recognition with TensorFlow
- Pytorch Repo
- Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks (6 Jan 2016). Gives a state of the art at the time about RPNs. And explains how anchor based RPNs work
-
Blog page that suggests a model structure that could prove useful to us.
-
Courses for a deep learning bootcamp that ended up on a "Build and deploy and end-to-end deep learning system".
-
Tensorflow tutorial that implements and trains a text generator. Really clear for understanding what is going on with RNNs.
-
Image annotation tool implemented by Oxford.
- Build a Handwritten Text Recognition System using TensorFlow
- Handwritten Text Recognition with TensorFlow
- Pytorch Repo
- Code gathering for everything we try to do !
-
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks (6 Jan 2016). Gives a state of the art at the time about RPNs. And explains how anchor based RPNs work
-
Region Proposal Network -- A detailed view. Article that better explains how RPNs work and what are anchors.