Skip to content

A machine learning project that focuses on categorizing Chinese news articles.

Notifications You must be signed in to change notification settings

andyrochi/Categorizing-News-Articles

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Categorizing Chinese News Articles from the Web

Abstract

In this project, we explore the machine learning pipeline and utilize 3 different methods (Naive Bayes, Logistic Regression, Neural Networks) to do the following task: categorize Chinese news articles from the web, given the title and content, into one of the following categories:

  1. 科技 (Technology)
  2. 產經 (Business and Economy)
  3. 娛樂 (Entertainment)
  4. 運動 (Sports)
  5. 社會 (Society)
  6. 政治 (Politics)

The news articles are obtained from CNA. We show that after proper data preprocessing, we can achieve a decent accuracy of at least 93% using any of the three models. See the report for more details.

About

A machine learning project that focuses on categorizing Chinese news articles.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages