Blog comment prediction

Introduction

Perform a regression task to predict the number of comments on a post after a certain period, applying it to Reddit

Demo: https://comment-reddits.streamlit.app/

Data

The BlogFeedback data can be downloaded from here

Setup

1. Clone this repository

git clone https://github.com/TranMinhDuc190103/Data_mining_finals.git

or download directly instead.

2. Install Dependencies

Create a virtual environment and install the required packages:

pip install -r requirements.txt

Run Jupyter notebooks

Training the Model: Use the following notebook to train your model.

In Models folder we provide 3 pre-trained models saved as .joblib and 3 Jupyter notebooks used to train model. You can run each notebook to get the pre-trained model or use it instead.

Crawl data from Reddit

You can self crawl some data from Reddit by running reddit-crawler.ipynb in folder crawl to crawl data from Reddit. However you need some key from Reddit app to continue.

The credentials.py contain some infomation to interact with Reddit API. Due to security concerns, we are unable to provide complete information. Please contact us for further details.

Run the app

After installing important libraries and storing your infomation about Reddit app in credentials.py, you can run the app with following command

streamlit run app_T.py

Contributing

Contributions are welcome! Please open an issue or submit a pull request for any improvements.

Contact

If you have any question, please contact us via phone or email below:

Trần Minh Đức, 0344794259, tranminhduc5_t66@hus.edu.vn

Nguyễn Mạnh Tuấn, 0349292753, nguyenmanhtuan_t66@hus.edu.vn

Lê Quốc Lâm, 0337213192, lequoclam_t66@hus.edu.vn

Lê Gia Huy, 0984588603, legiahuy_t66@hus.edu.vn

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
BlogFeedBack_Dataset		BlogFeedBack_Dataset
Models		Models
__pycache__		__pycache__
crawl		crawl
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
app.py		app.py
app_T.py		app_T.py
credentials.py		credentials.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Blog comment prediction

Introduction

Data

Setup

1. Clone this repository

2. Install Dependencies

Run Jupyter notebooks

Run the app

Contributing

Contact

About

Releases

Packages

Contributors 3

Languages

License

TranMinhDuc190103/Data_mining_finals

Folders and files

Latest commit

History

Repository files navigation

Blog comment prediction

Introduction

Data

Setup

1. Clone this repository

2. Install Dependencies

Run Jupyter notebooks

Run the app

Contributing

Contact

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages