Skip to content

A framework to scrape, analyze and visualize trends and insights from news sources

Notifications You must be signed in to change notification settings

Ashinoko/NewsStatsticalAnalysis

Repository files navigation

NewsStatsticalAnalysis

A framework to scrape, analyze and visualize trends and insights from news sources

This project initialized as course project for university.

Install

You need to install python version +3.8.5

You can download the latest version from here

You need install the requirements

pip install -r requirements.txt

Scrapping

Running the code scraper.py will start scraping armenpress.com website for news, and stores the results in data/armenpress/

Processing

Running the code processor.py will start processing scrapped data from data/armenpress/, and creates a csv file in the root folder

Running

The main.py will access the updated-data.csv to generate graphs and insights.

python main.py

The generate_wordclouds.py will generate wordclouds for each month of a given year.

Data

You can access the scraped data here.

You can download the final processed version until (2021/04/23) here.

Screenshots

Sentiment: alt text Mentions: alt text Titles: alt text WordCloud: alt text

Future features

  • More news sources
  • More historical data
  • Interactive visualization
  • More object oriented structure
  • Better sentiment analysis
  • Other Natural Language Processing models
  • Deploying as a separate website

About

A framework to scrape, analyze and visualize trends and insights from news sources

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages