GitHub - pstwh/platerec-model: platerec-model is a model for recognizing text from images, specifically designed for license plate recognition

platerec-model

platerec-model is a model for recognizing text from images, specifically designed for license plate recognition. The project utilizes a neural network architecture with an encoder-decoder setup and uses SAM (Sharpness-Aware Minimization) for optimizing the model training process. It's really lightweight using only a mobilenet v2 for encoder and a decoder transformer (gpt) for decoder. It is used in the platerec project.

Installation

Clone the Repository:

git clone https://github.com/your-username/platerec-model.git
cd platerec-model

Install Dependencies:
```
pip install -r requirements.txt
```

Usage

Training

To train the model, use the following command:

python train.py --dataset_paths data

Parameters:

--dataset_paths: A list of directories containing the input data files. The directory should contain the images and txt files, for example: 1.jpg and a 1.txt with the plate text, like: AWD1E33 in plain text.

Example:

├── 1.jpg
├── 1.txt
├── 2.jpg
├── 2.txt
├── 3.jpg
├── 3.txt
├── 4.jpg
└── 4.txt

--model_checkpoint: Path to a pretrained model (.pth file) if you have.
--device: The device to use for training (cuda or cpu). Defaults to cuda if available.
--num_epochs: Number of epochs for training. Default is 10.

Inference

To perform inference with the trained model, use the following command:

python inference.py --model_path artifacts/trained_model.pth --image_path lp_cropped.jpg

Parameters:

--model_path: Path to the trained model checkpoint (.pth file).
--image_path: Path to the image file for which text recognition is to be performed.

Model Architecture

The platerec-model employs an encoder-decoder architecture with cross-attention mechanisms. The key components are:

Encoder: Based on mobilenet_v2 for feature extraction from images.
Decoder: Utilizes an embedding layer, position encoding, and multiple decoder blocks with self-attention and cross-attention layers.
Loss Function: Uses cross_entropy loss, with special handling for a specific index (ignore_index=39).

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
README.md		README.md
config.py		config.py
dataset.py		dataset.py
inference.py		inference.py
model.py		model.py
requirements.txt		requirements.txt
sam.py		sam.py
train.py		train.py
transforms.py		transforms.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

platerec-model

Table of Contents

Installation

Usage

Training

Inference

Model Architecture

About

Releases

Packages

Languages

pstwh/platerec-model

Folders and files

Latest commit

History

Repository files navigation

platerec-model

Table of Contents

Installation

Usage

Training

Inference

Model Architecture

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages