Skip to content

WilliamStaudenmeier/Mobile_Trends_Using_Spark_and_Dash

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Mobile Trends Using Spark and Dash

This is a light-weight Dash app using Python and Spark on the backend to asyncronously process large files of PlayStore and AppStore data.

You can get the data files for Google here: https://www.kaggle.com/datasets/gauthamp10/google-playstore-apps (Google-Playstore.csv), and for Apple here: https://www.kaggle.com/datasets/gauthamp10/apple-appstore-apps (appleAppData.csv).

First, copy these files into the directory, Mobile_Trends_Using_Spark_and_Dash, then navigate to the same directory in the command line or editor of your choice.

The app consists of three main files:

  • app.py (entrypoint)
  • processor.py (processes data into figures)
  • controller.py (loads and cleans data files)

The application runs on Python 3.8. It will not run on Python 3.10 without refactoring the Collections library which no longer includes Iterable as of 3.10.

Once in the directory, run these commands in order:

  • python3 -m pip install --user virtualenv (unless you already have venv or virtualenv installed); you can also run this without --user
  • python3.8 -m venv env
  • source env/bin/activate
  • pip install -r requirements.txt
  • python app.py

The app will take five minutes to extract and process the data, and will run locally on http://127.0.0.1:8050/.

To run tests:

  • pytest tests.py

About

Asynchronous, classic OOP on the Spark engine with a light front-end

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages