Chinese Calligraphy Character Recognition with Intra-class Variant Clustering

The goal of this project is to recognise the 100 most commonly used chinese characters, written in calligraphy in the semi-cursive script. As compared to handwritten chinese characters, chinese calligraphy characters are harder to recognise, even for trained experts because they can be written in a less restrictive way to express the author’s personality and emotions... (read more here)

Sample data from character "总":

Requirements

lear-gist-python is used to extract GIST features. The installation instruction can be found in the repo.

The other depedencies can be installed by running:

$ pip install -r requirements.txt

Notes

notebooks/* - ipynb files to collect and preprocess the data.
train.ipynb - Demo to train the model. The data directories and model hyperparameters have to be specified in config.yaml.
data/cccr/* - Train, validation and test set.
data/common.txt - The most common chinese characters arranged in descending order, ie. the first 100 lines contain the 100 most commonly used characters.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
data		data
fonts		fonts
images		images
modules		modules
notebooks		notebooks
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
requirements.txt		requirements.txt
train.ipynb		train.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

fonts

fonts

images

images

modules

modules

notebooks

notebooks

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

config.yaml

config.yaml

requirements.txt

requirements.txt

train.ipynb

train.ipynb

Repository files navigation

Chinese Calligraphy Character Recognition with Intra-class Variant Clustering

Requirements

Notes

About

Releases

Packages

Languages

License

kahxuan/chinese-calligraphy-recognition

Folders and files

Latest commit

History

Repository files navigation

Chinese Calligraphy Character Recognition with Intra-class Variant Clustering

Requirements

Notes

About

Topics

Resources

License

Stars

Watchers

Forks

Languages