Using Generative Adversarial Network techniques to upscale images for Automatic License Plate Recognition

Assignment for the discipline of Deep Learning from PPGI/UFES 2021/01.

Requirements

Google Collab or JupyterNotebook (Collab prefered for GPU training)
Google Drive

Modifications to the Original Code

SRGAN: BaseImage and code were modified in order to observe the training (by a image of a plate). Tensorboard were included in the code.
DLSS: identified and reported bugs/issues. Code improvement for clearer results.
Contribuitions and bug fixes (see here)
ESRGAN: Training were modified to train with same conditions of SRGAN (without image augmentation). Result were modified to try the resolution enhancement with original image.
Other Algorithms: Algorithm and requirements were modified to run with newest versions of tensorflow.

Initializing the project

Put the folder 'FOLDER' in your Google Drive

1)from google.colab import drive
drive.mount('/content/drive/', force_remount=True) # Mount Google Drive folders.

2)cd /content/drive/MyDrive/FOLDER # Install requirements
  !pip install -r requirements.txt
# Run every time you enter Colab.

SRGAN

!python3 /content/drive/MyDrive/FOLDER/train.py -a srgan --gpu 0 --gan-epochs 200 --psnr-epoch 200 /content/drive/MyDrive/FOLDER/DatasetFinalPb: Training with 200 epochs
In train.py base_image = transforms.ToTensor()(Image.open(os.path.join("assets", "baseimage.png"))): Change this line to the image that you want to upscale
!python3 /content/drive/MyDrive/FOLDER/test_image.py -a srgan --gpu 0 --lr /content/drive/MyDrive/FOLDER/PLACADETESTE.png --model-path/content/drive/MyDrive/PASTA/weights/GAN-best.pth: Testing using the last/best GAN values.

Dependencies

requirements.txt

opencv-python>=4.5.2.52
torchvision>=0.9.1+cu111
Pillow>=8.2.0
numpy>=1.19.5
torch>=1.8.1+cu111
tqdm>=4.60.0
scipy>=1.6.3
prettytable>=2.1.0
thop>=0.0.31.post2005241907
setuptools>=56.2.0
tensorboardX>=2.2
lpips>=0.1.3
albumentations
easyocr
pytesseract
imutils

Dependencies for ALPN and OCR

tensorboardX
albumentations
easyocr
pytesseract
imutils
torch==1.8.1+cpu torchvision==0.9.1+cpu torchaudio===0.8.1 -f https://download.pytorch.org/whl/torch_stable.html

DLSS

In Load Model and Analyze Results.ipynb, change model_path (Path to saved .h5 model), dataset_path (Path to folder containing images to super sample), save_path (Folder where you want to save to model as well as generated samples)
Run Load Model and Analyze Results.ipynb (Collab prefered for GPU training)

ESRGAN

Training: In Training.ipynb, change the main folder of ESRGAN algorithm (cd /content/drive/MyDrive/License-super-resolution/) and the PATHTRAIN / PATHTEST (Path to folder containing images to train algorithm). Use only images of 192 x 96 for training. Change the number of epochs (epochs=1).
Generating High Resolution Image: change the main folder of ESRGAN algorithm (cd /content/drive/MyDrive/License-super-resolution/) and the DATA_PATH (Path to folder containing images to super sample). Change the model.load_weights Choose to the desired weights. Choose between original (original image) ordownSample (downsampled image) to run the plate enhancement.

Other Algorithms (SRFEAT, EDSR, ERCA, ...)

Collab with GPU is required in these algorithms. Use KerasImageSuperResolution.ipynb to run the algorithm.

Generating High Resolution images: change the main folder of algorithms (cd /content/drive/MyDrive/SuperResolution/Keras-Image-Super-Resolution/)
- !python demo.py --arc=esrgan --lr_dir=/content/drive/MyDrive/UFPRCROPPEDPB/ --ext=.png --save_dir=/content/drive/MyDrive/SuperResolution/dataset/UFPROUTPUT/ESRGAN --model_path=/content/drive/MyDrive/SuperResolution/Keras-Image-Super-Resolution/exp/esrgan-gan-06-11-14:35/gan-cp-01.h5 --cuda=0
- Change arc with the choosen algorithm (SRFEAT, EDSR, ERCA, ESRGAN, SRGAN).
- Change lr_dir with folder containing images to super sample.
- Change ext with the extension of images (png, jpg, ...).
- Change save_dir with the output directory.
- Change model_path with the .h5 model (inside checkpoints)

Utilities

Labelbox_Processing: a tool to receive data from labelbox and crop images of datasets.
Processing_Resize: a tool written in Processing to resize and pre-process images.

REFERENCES

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
Comparative		Comparative
DLSS		DLSS
DataSet_192x96_Pb		DataSet_192x96_Pb
DataSet_GrayScale		DataSet_GrayScale
Dataset_384x384		Dataset_384x384
Dataset_IARA_Cropped		Dataset_IARA_Cropped
Dataset_IARA_Cropped_Pb		Dataset_IARA_Cropped_Pb
Dataset_Original		Dataset_Original
ESRGAN		ESRGAN
Labelbox_Processing		Labelbox_Processing
OCR		OCR
OCR_NewVersion		OCR_NewVersion
Other_Algorithms		Other_Algorithms
Processing_Resize		Processing_Resize
Report		Report
SRGAN		SRGAN
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Using Generative Adversarial Network techniques to upscale images for Automatic License Plate Recognition

Requirements

Modifications to the Original Code

Initializing the project

SRGAN

Dependencies

Dependencies for ALPN and OCR

DLSS

ESRGAN

Other Algorithms (SRFEAT, EDSR, ERCA, ...)

Utilities

REFERENCES

About

Releases

Packages

Contributors 2

Languages

PhelaPoscam/DeepLearning-2021-01

Folders and files

Latest commit

History

Repository files navigation

Using Generative Adversarial Network techniques to upscale images for Automatic License Plate Recognition

Requirements

Modifications to the Original Code

Initializing the project

SRGAN

Dependencies

Dependencies for ALPN and OCR

DLSS

ESRGAN

Other Algorithms (SRFEAT, EDSR, ERCA, ...)

Utilities

REFERENCES

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages