Skip to content

A repository containing different text classification models, applied to Lebanese news articles.

Notifications You must be signed in to change notification settings

moussaKam/arabic_news_classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Lebanese News Classification

Implementing different text classification models, strating from Logisitic regression, ending with bert based language models.

Documents used are news articles borrowed from Lebanon files (www.lebanonfiles.com).

These documents are classed in 5 categories:

  • أخبار محلية
  • أخبار اقليمية ودولية
  • أخبار اقتصادية
  • أخبار رياضة
  • أخبار فنبة

The idea is to report at the end the accuracy of each model and to examine the performance of the SOTA in text classification task over the time.

Models implemented (yet):

  • Logistic Regression + Tf-Idf features
  • Simple Neural Network model from fastText

About

A repository containing different text classification models, applied to Lebanese news articles.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published