As we know that an AI system involves two major pillars, one is code(model + algorithm) and the other is data. In a recent interview Andrew Ng mentioned the importance of high quality data.
Through this project we would like to focus on the data aspect of an AI system.
The project aims to demonstrate the improvement in model performance using Data Quality Metrics and Data Flywheel concept.
Visit this repository to know more about our latest work on Data Qulaity Metric Hypothesis. https://github.com/changsin/FSDL
Training code can be found in training directory. Notebooks used can be found in notebooks directory. License Plate Recognition app in license_plate_recogniser directory.
To use LPR application install docker app. Clone the repository and go inside the license_plate_recogniser directory, build the docker image using command:
docker build --network host -t lpr:latest .
To start the container run.
docker run -it --network host -p 8080:8080 lpr
Now visit your browser and access the url:
localhost:8080
The dashboard is built using streamlit.
To get this app running on a cloud, we use google ecosystem.
Be aware if you are executing this step, you will be billed by Google.
TO run the app on cloud, visit license_plate_recogniser directory and execute:
make Makefile
If everything is fine on your end, you should have a service running in your Google console dashboard. We are using Google's serverless option , Google cloud Run for running this app on cloud.
We have used Yolov5 as our baseline model.
To replicate the traing process, you can visit this colab notebook:
We used Weights and Bias for experiment tracking.
Here are our validation loss curves, mean mAP and Predicition Images.
Datasets used in the project are:
- Pytorch
- Streamlit
- Docker
- Weights and Bias
- Google Cloud Run
- Completion of data flywheel loop to integrate active learning
- Replicate results of SOTA papers for UFPR dataset
- Integrate Data Qulaity Metric in the pipeline
- Include results on other License Plate Datasets