Real-time Scene Text Detection with Differentiable Binarization

note: some code is inherited from MhLiao/DB

Install

conda create --name DBNet.pytorch -y
conda activate DBNet.pytorch

conda install ipython pip

# python dependencies
pip install -r requirement.txt

# install PyTorch with cuda-10.1
conda install pytorch torchvision cudatoolkit=10.1 -c pytorch

# clone repo
git clone https://github.com/WenmuZhou/DBNet.pytorch.git
cd DBNet.pytorch/

# build deformable_conv from torchvision >=0.5
git clone https://github.com/pytorch/vision.git
cd vision
python3 setup.py install

Requirements

pytorch 1.2+
torchvision 0.5+
gcc 4.9+

Download

TBD

Data Preparation

train: prepare a text in the following format, use '\t' as a separator

/path/to/img.jpg path/to/label.txt
...

val: use a folder

img/ store img
gt/ store gt file

Train

config the dataset['train']['dataset'['data_path']',dataset['validate']['dataset'['data_path']in config/icdar2015_resnet18_fpn_DBhead_polyLR.yaml
single gpu train

bash singlel_gpu_train.sh

Multi-gpu training

bash multi_gpu_train.sh

Test

eval.py is used to test model on test dataset

config model_path in eval.sh
use following script to test

bash eval.sh

Predict

predict.py is used to inference on single image

config model_path, img_path, in predict.py
use following script to predict

python3 predict.py

The project is still under development.

Performance

ICDAR 2015

only train on ICDAR2015 dataset

Method	image size (short size)	learning rate	Precision (%)	Recall (%)	F-measure (%)	FPS
Defrom-ResNet-18(paper)	736	0.007	86.8	78.4	82.3	48
Resnet18-FPN-DBHead	736	1e-3	87.03	75.06	80.6	43
Resnet50-FPN-DBHead	736	1e-3	88.06	77.14	82.24	27

examples

TBD

todo

mutil gpu training

reference

If this repository helps you，please star it. Thanks.

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
base		base
config		config
data_loader		data_loader
imgs/paper		imgs/paper
models		models
post_processing		post_processing
tools		tools
trainer		trainer
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.MD		README.MD
eval.sh		eval.sh
multi_gpu_train.sh		multi_gpu_train.sh
requirement.txt		requirement.txt
singlel_gpu_train.sh		singlel_gpu_train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real-time Scene Text Detection with Differentiable Binarization

Install

Requirements

Download

Data Preparation

Train

Test

Predict

Performance

ICDAR 2015

examples

todo

reference

About

Releases

Packages

Languages

License

dun933/DBNet.pytorch

Folders and files

Latest commit

History

Repository files navigation

Real-time Scene Text Detection with Differentiable Binarization

Install

Requirements

Download

Data Preparation

Train

Test

Predict

Performance

ICDAR 2015

examples

todo

reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages