news-popularity-model

Contents of the Project:-

1)Integrated scraped data and ML model to predict virality(Main Project):

Here using Regression model that uses a dataset OnlineNewsPopularityClassification.csv to train itself and then check the virality of the scraped data.The virality is checked on the basis of various information that has been scraped from Times of India website.

The file includes these data for evaluation:

Later on after using sentiment analysis and weighing the relevant words with the ones in popular news a model is created.

For this the data like number of tokens, number of shares etc are used from the respected website.

Later on the virality or popularity score is given:

The score lies between 0 and 1(0 corresponding to not popular news and 1 corresponds for popular news)

2)Classification Model:

This model has various algos like Logistics Regression,Random Forest Classifier,SVM but the one actively used is RandomForestClassifier due to its best results.

The Output shows Essentials of the model using RandomForestClassifier after being trained and tested.

The Output shows the labels before and after standardization. It also shows the accuracy of this model.

3)Regression Model:

This model uses Bayesian Linear Regression to solve the problem and give us the required accuracy of the model.

4)News Aggregator(made using Django by scraping popular websites like Times of India etc):

I also made a website using Django that can be later used to project the popular news on a single site only.

Web Scraping of Times Of India:

Web Scraping Of Hindustan TImes:

Web Scraping of The Economist:

Future of the Project:

A much proper integration of model and NewsAggregator which could predict the virality of the news and display the link to the site on my website asap.

License

The project is available as open source under the terms of the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
NewsAggregator/__pycache__		NewsAggregator/__pycache__
Virality-Checker(main-project)		Virality-Checker(main-project)
news		news
LICENSE		LICENSE
ML_Classification.ipynb		ML_Classification.ipynb
ML_regression.ipynb		ML_regression.ipynb
OnlineNewsPopularityClassification.csv		OnlineNewsPopularityClassification.csv
OnlineNewsPopularityRegression.csv		OnlineNewsPopularityRegression.csv
README.md		README.md
manage.py		manage.py
news-popularity-model.zip		news-popularity-model.zip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NewsAggregator/pycache

NewsAggregator/pycache

Virality-Checker(main-project)

Virality-Checker(main-project)

news

news

LICENSE

LICENSE

ML_Classification.ipynb

ML_Classification.ipynb

ML_regression.ipynb

ML_regression.ipynb

OnlineNewsPopularityClassification.csv

OnlineNewsPopularityClassification.csv

OnlineNewsPopularityRegression.csv

OnlineNewsPopularityRegression.csv

README.md

README.md

manage.py

manage.py