Task: To build a model to classify the scientific papers into various categories like statistics, computer science, physics etc. on the basis of their abstracts.
This repository contains standard implementation of three popular Machine Learning Algorithms.
- Logistic Regression
- Gaussian NB
- Linear SVC
The main idea of this project is to experiment whether a collborative decision making approach (Voting Algorithm) based on the votes provided by the three standard algorithms can outperform the standard algorithms or not.
Result: Feel free to clone the repository and check it yourself. :P
Update: I did this project as one of the academic projects during my bachelors and now I do realize that the idea is very similar to one of the very simple ensemble learning technique.
Data source: Kaggle