I have learnt how various machine learning methods or algorithms work as well as their strengths and weaknesses. The coursework involves selecting a given topic/task on Kaggle and then implementing the best possible machine learning model for the selected task.
- Implemented 8 different machine learning models to perform leaf classification -
Naive Bayes Classifier
,Support Vector Machine
,K-Nearest Neighbours
,Logistic Regression
,Linear Discriminant Analysis
,Decision Tree
,Random Forest
andMultilayer Perceptron
- Analyzed the performance of the above machine learnings algorithms for leaf classification
- MLP produces the best prediction results (i.e. the highest accuracy and the lowest log loss value)
- Submitted the prediction results to Kaggle
- Placed among top 15% on the public leaderboard according to the final evaluation scores on Kaggle.
For the project demonstration video :
https://www.youtube.com/watch?v=pQM_NAij_qY&feature=youtu.be
Disclaimer : The python code used to implement the coursework project is no longer maintained. There may be errors or bugs that did not exist at the time of creation.
Follow the instructions below to run the source code.
Need to install using pip the following libraries:
- numpy
- pandas
- seaborn
- matplotlib
- opencv-python
- scikit-learn
- Type the command "python main.py" to run the program.
- Once the program runs, follow the instructions to run a specific Machine Learning Method
- To replicate the results submitted, select MLP to reproduce the submission file.
-
Overview of machine learning and its applications
-
Decision Theory and Bayes Models
-
Classifier Evaluation
-
Classification: Decision trees, Artificial neural networks, linear and kernelized Support vector machines, K-nearest neighbour classifiers
-
Linear regression and its kernelized extension
-
Ensemble Learning
-
Clustering
-
Dimension Reduction
-
Density Estimation