Document Classification ML

BBC news dataset has been used to classify models
- Consists of 2225 documents from the BBC news website corresponding to stories in five topical areas from 2004-2005.
- Natural Classes: 5 (business, entertainment, politics, sport, tech)
- http://mlg.ucd.ie/datasets/bbc.html
- D. Greene and P. Cunningham. "Practical Solutions to the Problem of Diagonal Dominance in Kernel Document Clustering", Proc. ICML 2006.
Models trained: Naive Bayes, KNN, SVM and Random Forest
GridSearch with cross validation = 10

Training the model

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.idea		.idea
data		data
save_model		save_model
test_data		test_data
Readme.md		Readme.md
classification_models_metrics.txt		classification_models_metrics.txt
data_process.py		data_process.py
main.py		main.py
model_config.py		model_config.py
test_predict.py		test_predict.py