- In this project, I implemented a Naive Bayes text classifier from scratch to categorize the titles of news. The categories are:
- International
- Sport
- Political
- Cultural-artistic
- Social
- Scientific-medical
- Economic
- Social media
- Web browsing
- Video & audio
- All characters except
آ-ی
and\s
have been removed; Numbers have been replaced byN
. Zeros have been handled by Laplacian smoothing. - The
DataPreprocessor
andNaiveBayesClassifier
classes have been designed for preprocessing the samples and training/evaluating a Naive Bayes classifier respectively.
-
Notifications
You must be signed in to change notification settings - Fork 0
Mohammad8921/NewsTitlesCategorization-NaiveBayesClassifier
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
A Naive-Bayes classifier to categorize titles of persian news
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published