Consistency_Regularization_STR

    pip install -r requirements.txt

Please convert your own dataset to LMDB format by create_dataset.py. (Borrowed from https://github.com/bgshih/crnn/blob/master/tool/create_dataset.py, provided by Baoguang Shi)

For labeled dataset, there are converted Synth90K and SynthText LMDB dataset by luyang-NWPU: [Here], password: tw3x

For unlabeled dataset, you could download raw images from imagenet, places2 and openimages, then detect and crop word images from these images.

sh run_baseline.sh

sh run_semi.sh

sh run_test.sh

download pretrained model from here(https://pan.baidu.com/s/1JF97VY0oiPiK5GpDEsYioQ?pwd=abhr, password:abhr) and put them in saved_models.

Model	Labeled data	Unlabeled data	IC13 857	IC13 1015	SVT	IIIT	IC15 1811	IC15 2077	SVTP	CUTE
TRBA_pr	10% (Synth90K+SynthText)	-	96.3	94.3	91.5	94.3	81.5	77.7	84.2	87.5
TRBA_cr	10% (Synth90K+SynthText)	1.06M unlabeled real data	97.2	95.9	94.3	96.6	86.8	79.7	89.0	93.4
TRBA_pr	Synth90K + SynthText	-	97.2	95.9	91.8	95.5	83.6	79.7	87.3	88.5
TRBA_cr	Synth90K + SynthText	10.6M unlabeled real data	98.0	96.4	96.0	97.0	88.8	84.9	90.9	95.1

This code is based on STR-Fewer-Labels by Jeonghun Baek. Thanks for your contribution.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
evaluation_log		evaluation_log
modules		modules
result		result
saved_models		saved_models
tensorboard		tensorboard
test_and_val		test_and_val
tools		tools
LICENSE		LICENSE
README.md		README.md
config_parser.py		config_parser.py
model.py		model.py
requirements.txt		requirements.txt
run_baseline.sh		run_baseline.sh
run_semi.sh		run_semi.sh
run_test.sh		run_test.sh
semi_train.py		semi_train.py
test.py		test.py
utils.py		utils.py

Caiyuan-Zheng/Consistency_Regularization_STR