Skip to content

b0w1d/DH_Budapest_2018

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

Natural Language Processing Real-Time System for Central European News

Authors: Yun-Cheng Tsai, Pu Yu, Jia-Ching Wu, Ji-Yu Huang

Internet as a medium for information transmission by the media has become one of the main sources of people's knowledge absorption. People always get a preliminary understanding of a topic from the online media. In Central Europe, an incident or a certain issue affects more than one country, possibly multiple countries. Due to the different political and national conditions in various countries, the coverage of the terms and expressions described in the articles of various countries and regions in the news media will present a description and reflection of a certain issue. However, current online information become volume, velocity and variety to require specific technology and analytical methods for its transformation into value. In order to expand to other complementary characteristics of big data in social sciences and humanities (SSH) area.

We propose a natural language processing real-time system to analyze text information from online news websites automatically. The system uses the existing text analytics resources that are defined for collections. Our purpose is to perform analytics on any text documents. Users can immediately check the analysis visualization results. Our system has the following main functions for SSH researchers.

  1. Collect information from online news websites or offline corpus from various countries and analyze the opinions on specific topics.

  2. Researchers can use Named Entity Recognition (NER) classification of groups manually to define the information retrieval features, and use text patterns for sentence interpretation emotional markers.

  3. The terms and phrases described in their articles will present descriptions and reflections on a topic.

  4. Provide a platform that can instantly find and analyze a large number of texts.

The system will help SSH researchers have over view from their documents. Then they can get important information from text quicker than without using any technical supporting.

NER, NLP, Real-Time System, Sentiment Analysis, Text Mining, Data Visualization, NLTK Tree, Information Retrieval.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 78.0%
  • Python 22.0%