Skip to content

ppkgtmm/fake-news-detection

Repository files navigation

📰 fake news detection

automate news authenticity prediction with machine learning model

initialization

  1. install python 3.8

  2. install java 8

  3. run below to grant helper script execution

chmod +x run.sh
  1. run the following to initialize project
./run.sh init

each of the following steps uses a YAML configuration file stored in config folder

data cleaning

processed version of dataset is saved to data directory with prep suffix by default

./run.sh clean

data visualization

all visualizations are saved to visualization/outputs directory by default

./run.sh viz

model training

requires 8 GB of memory by default which is configurable at driver_memory in config/modeling.yaml file

./run.sh model

parameter tuning

  • best output model is saved to modeling/outputs directory by default
  • hyper parameter performance summary is also stored to modeling/outputs directory as a CSV file
  • tuning part requires 8 GB of memory which is configurable at driver_memory in config/modeling.yaml file
./run.sh tune

inference

  1. run below to launch model prediction api server
./run.sh api
  • api server will be running on http://localhost:8000
  • /predict endpoint processes texts and performs prediction
  • first request to /predict might be slow due to spark model deserialization
  1. open app/frontend.html file in browser

  2. type or paste text in the web page to get model prediction

About

automate fake news detection with machine learning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published