Skip to content

Random Forest in Python to classify characters in Google street view images.

Notifications You must be signed in to change notification settings

deepak-karkala/char-classifer-rf-python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

Character classification using Random Forest Classifier in Python

This works aims to classify characters from Google Street View images. It involves extracting the data to desired format from the dataset, create training and test data sets, use supervised learning to build Random Forest Classifier, predicting the test data using the model.

The libraries numpy, pandas were used for processing data efficiently in Python. Scikit-learn is used to build a Random Forest Classifier. The working environment included Jupyter Notebook and Anaconda.

The model achieved a prediction accuracy of 44.5% on the test images. For reference, the benchmarks (using Julia) for Random Forest and kNN were 42.93% and 40.57% respectively. The maximum score at the time of this writing was 85.43%. So evidently there is a lot of scope of improvement. But this was expected because of the following reasons. The parameters for the Random Forest Classifier was not tuned and optimised. Instead a default set of parameters were used. Further it is well known that for tasks such as image classification, methods such as Neural network usually outperforms Random Forest Classifiers.

The future work should therefore try to optimise the Random Forest Classifier and consider using deep learning to classify the images.

About

Random Forest in Python to classify characters in Google street view images.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published