Web application containing dashboards for Kaggle activities
Web app to view dashboards demonstrating Kaggle activities. Data for dashboards is collected automatically using Kaggle API.
The repository has the following structure:
- app
| - templates
| |- master.html # main page of web app
| - static
| |- githublogo.png # github logo used in the main page
|- run.py # Flask file that runs app
- data
|- process_data.py # script containing ETL for the initial dataset processing
- screenshots
|- main_page.png # screenshot of the main page
- README.md
More details on the most important repository files and scripts:
- process_data.py - script automatically downloads Kaggle meta dataset using Kaggle API and uploads preprocessed data to SQLite database. Script takes path to Kaggle credentials token and path to resulting database file as parameters.
Follow the instructions to run the web-app locally:
-
Install required external libraries (see the External Libraries section below).
-
Clone the repository.
-
Navigate to the repository's root directory.
-
Run the following commands in the repository's root directory to set up database and model.
- Run ETL pipeline that cleans data and stores in database
$ python data/process_data.py /Users/username/kaggle dataset.db
- Run ETL pipeline that cleans data and stores in database
-
Navigate to
app
folder. -
Run the following command in the app's directory to run the web app:
$ python run.py
-
Open http://0.0.0.0:3001/ in web browser.
The following external libraries should be installed in order to run the app:
- SQLAlchemy to store the preprocessed data,
- Plot.ly library for visualization,
- Flask to run the web-app locally.
- Kaggle API Kaggle API to download Kaggle meta dataset.