Clean-up file structure and setup app to deploy Heroku #39

WraySmith · 2021-03-08T01:14:30Z

Cleaned-up file structure:

moved data into data folder and created data folder structure
moved scripts and app into folders within src
minor updates to code based on new data locations

NOTE: @mqharris will need to go into the data folder and update the README and anything else required. The scripts will also likely require modification in terms of file path and the script folder should include a README providing an overview of each of the scripts - this can either be done directly within this branch as part of this PR or can be completed after this is pulled in.

Updated app so that the csv isn't read every time a function is called - the performance has improved a bit but still requires more work. Probably requires someone to sit down and look at the entire data workflow of the functions and app.

Added heroku deployment files, app is deployed here: https://boardgame-dashboard-data551.herokuapp.com/

It functions well with the exception of the top 2 figures on the first tab which will need to be optimized so the app is more responsive.

Also, definitely take a look at the reflection document and make sure everything is correct in it.

src/app/app.py

mqharris

lgtm

RyKoe · 2021-03-08T01:37:51Z

reports/exploratory_data_analysis/boardgame_EDA.ipynb

-   "id": "irish-tuning",
+   "id": "clinical-louis",


The changes to these id's looks like some automated naming structure?

this is just typical ipynb dif stuff

RyKoe · 2021-03-08T01:41:11Z

src/app/app.py

+from app_wrangling import call_boardgame_data, subset_data
+
+# load board game data
+boardgame_data = call_boardgame_data()


Is this so that we are only calling the .csv once?

yup, there will still need to be more work on the data logic workflow, quick update was just to ensure the csv was only being read in once

RyKoe · 2021-03-08T01:43:09Z

src/app/app.py

+                        color="primary",
+                    ),
+                    dbc.Collapse(
+                        dbc.Card(dbc.CardBody(data_set_descirption())),


Seems that this function has a persistent spelling mistake. Small fix

added to #38

RyKoe · 2021-03-08T01:46:36Z

src/app/app_wrangling.py

    cat: list
    mech: list
    pub: list
    n: int

    return: pandas dataframe
    """
-    boardgame_data = call_boardgame_data()
+    boardgame_data = data.copy(deep=True)


What is the advantage of having this as a .copy() versus just making boardgame_data = data?

I just went through this quickly but anytime one is potentially doing in place modification of data, the data should be copied to avoid function side-effects of modifying the original data. This wouldn't have been a problem before as you were resetting with read.csv every time but now we're using a single loaded dataset that lives in memory. deep=True is required as the dataframe has lists.

There may be some performance gains by not copying the dataframe where it is definitely not being modified in place. Should be done as part of the general data performance review/optimization. When I did this I just went with being safe and used copy in all instances where it looked like modifications could potentially be happening to the dataframe.

WraySmith added 11 commits March 7, 2021 14:50

update files structure

d93a865

update csv names in report files

5f31db5

add README.md for scripts folder

eccb36a

updated multiple calls from csv in app

c745d76

add requirements.txt and Procfile

e8f3f6e

update Procfile

ea02cb2

updated requirements.txt

57b4b6d

Procfile update

43b28a3

update data locations

99b84d6

update csv read location

57603f7

update README and reflection doc

93fd3c1

WraySmith added this to the Milestone 2 - Python Dashboard milestone Mar 8, 2021

mqharris reviewed Mar 8, 2021

View reviewed changes

src/app/app.py Show resolved Hide resolved

mqharris approved these changes Mar 8, 2021

View reviewed changes

WraySmith merged commit 0c020e4 into main Mar 8, 2021

WraySmith deleted the clean-files-deploy branch March 8, 2021 01:38

WraySmith added this to Done in Python Dashboard Mar 8, 2021

RyKoe approved these changes Mar 8, 2021

View reviewed changes

WraySmith mentioned this pull request Mar 8, 2021

Dashboard Updates to be completed for Milestone 2 #38

Closed

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clean-up file structure and setup app to deploy Heroku #39

Clean-up file structure and setup app to deploy Heroku #39

WraySmith commented Mar 8, 2021 •

edited

mqharris left a comment

RyKoe Mar 8, 2021

WraySmith Mar 8, 2021

RyKoe Mar 8, 2021

WraySmith Mar 8, 2021

RyKoe Mar 8, 2021

WraySmith Mar 8, 2021

RyKoe Mar 8, 2021

WraySmith Mar 8, 2021 •

edited

Clean-up file structure and setup app to deploy Heroku #39

Clean-up file structure and setup app to deploy Heroku #39

Conversation

WraySmith commented Mar 8, 2021 • edited

mqharris left a comment

Choose a reason for hiding this comment

RyKoe Mar 8, 2021

Choose a reason for hiding this comment

WraySmith Mar 8, 2021

Choose a reason for hiding this comment

RyKoe Mar 8, 2021

Choose a reason for hiding this comment

WraySmith Mar 8, 2021

Choose a reason for hiding this comment

RyKoe Mar 8, 2021

Choose a reason for hiding this comment

WraySmith Mar 8, 2021

Choose a reason for hiding this comment

RyKoe Mar 8, 2021

Choose a reason for hiding this comment

WraySmith Mar 8, 2021 • edited

Choose a reason for hiding this comment

WraySmith commented Mar 8, 2021 •

edited

WraySmith Mar 8, 2021 •

edited