Pytorch-yolo-phoc

Implementation on pytorch of the code from the ECCV 2018 paper - Single Shot Scene Text Retrieval. Paper: https://arxiv.org/abs/1808.09044

This code uses the YOLOv2 implementation from https://github.com/marvis/pytorch-yolo2 and modifies it respectively.

All paths are hardcoded and need to be edited accordingly.

Change the cfg/XXXX.data file according to training objective

train  = path_to_file_with_list_of_files_to_train.txt
names = data/recognition.names
backup = backup
gpus  = 0
num_workers = 10

The file cfg/XXXX.cfg contains the config parameters for training.

A folder/file needs to be specified with the images for training time.

Download weights from the convolutional layers (Imagenet pre-trained weights)

wget http://pjreddie.com/media/files/darknet19_448.conv.23

Modify the options in train.py file.

python train.py

The model has been trained, achieving the following results:

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
PHOC		PHOC
__pycache__		__pycache__
cfg		cfg
data		data
layers/batchnorm		layers/batchnorm
models		models
tools/lmdb		tools/lmdb
README.md		README.md
cfg.py		cfg.py
cfg.pyc		cfg.pyc
darknet.py		darknet.py
darknet.pyc		darknet.pyc
dataset.py		dataset.py
dataset.pyc		dataset.pyc
detect.py		detect.py
detect_folder.py		detect_folder.py
detect_retrieval.py		detect_retrieval.py
detect_retrieval_phoc.py		detect_retrieval_phoc.py
image.py		image.py
region_loss.py		region_loss.py
text_detector.py		text_detector.py
train.py		train.py
utils.py		utils.py

AndresPMD/Pytorch-yolo-phoc