Test adding ARI to a New Model Run #245

Damonamajor · 2024-06-05T23:42:30Z

Attached is a markdown file, as well as changes to the model pipeline to allow a test of the Affordability Risk Index. Mostly for Dan, but let me know where it should be housed. It shouldn't be merged, but seemed like it would be easiest to review / look at in this folder.

dfsnow · 2024-06-07T22:03:20Z

dvc.yaml

nitpick (blocking): Most of the changes here just move stuff around. Let's revert the changes to this file, since they don't actually impact the outcome.

dfsnow · 2024-06-07T22:05:15Z

params.yaml

-  run_id: "2024-03-17-stupefied-maya"
+  run_id: "2024-06-02-test-damon"


issue (blocking): The export.run_id key is really only used for changing the outputs of the export pipeline stage. If you want to specify a run_id, use a parameter in the YAML header of your Quarto doc.

dfsnow · 2024-06-07T22:06:16Z

pipeline/00-ingest.R

issue (blocking): It's fine to edit this when messing around, but we shouldn't include changes to this file with the merge request. See my earlier structure note for a workaround.

dfsnow · 2024-06-07T22:18:02Z

ARI-model-test.qmd

issue (blocking): Before I review the content of this report, let's get the structure worked out. Let's make a new root-level directory (analyses) where one-off analyses related to the model can live. Then we'd have:

reports/ for things that are generated with every run

analyses/ for things that are created one time

In order to make the analysis reproducible, you need to load data from sources accessible to others. That would mean:

For files in input/, push to the S3 DVC cache, then use read_parquet() to load the file you pushed directly from S3

For files in output/, finish a run and push it to S3. Then load the files directly from S3, again using read_parquet()

Note that you can re-use the reports/_setup.qmd file here, as it will load the input data from DVC (assuming you pushed) and most relevant output data for a given run.

These should all be fixed now. I pushed to DVC, let me know if there are any other steps that should be taken.

Damonamajor · 2024-06-20T18:47:15Z

For input data pull from the input file from S3 using the dvc hash
That would get us deterministically the same set of input data
For the output, using the github actions workflow we should be able to run the full model pipeline with the new featuure in S3 in the model bucket.
Pull from that RunID the one that is generated by the pipeline.
That would give us input and output data including the performance stats.

dfsnow · 2024-06-20T18:31:57Z

analyses/ARI-model-test.qmd

+library(sf)
+library(dplyr)
+library(ggplot2)
+library(noctua)
+library(arrow)
+library(kableExtra)
+library(spdep)
+library(leaflet)


nitpick: Alphabetically order your dependencies.

dfsnow · 2024-06-20T18:32:34Z

analyses/ARI-model-test.qmd

+title: "Testing the Impact of the Affordability Risk Index on the Model"
+format: html
+---
+```{r, include = FALSE}


nitpick: Run styler::style_file() on this and lintr::lint(). There are some minor formatting issues in here.

analyses/ARI-model-test.qmd

dfsnow · 2024-06-20T18:55:30Z

analyses/ARI-model-test.qmd

+con <- dbConnect(noctua::athena())
+noctua::noctua_options(cache = 10)
+
+input_data <- read_parquet("input/assessment_data.parquet") %>%


issue (blocking): This line is the main point of difference between this and the other reports. The other reports look at the current data, while this report should look at a snapshot of data that includes the variable of interest. In other words, this should ingest input data from DVC and output data from S3 so that we have a fixed, deterministic report for each feature.

analyses/ARI-model-test.qmd

dfsnow · 2024-06-20T19:28:33Z

analyses/ARI-model-test.qmd

+x <- round(x, 2)
+print(paste("Correlation =", x, "when removing 0s"))
+```
+## Creating a map to see if there are spatial disparities in the inter-tract SHAP values


nitpick: Use consistent title case.

Co-authored-by: Dan Snow <31494343+dfsnow@users.noreply.github.com>

…vm into Test_New_Model_Run

Draft

efa1c2f

Damonamajor requested review from dfsnow, wrridgeway and jeancochrane as code owners June 5, 2024 23:42

Damonamajor added 15 commits June 6, 2024 00:19

Add leaflett

2071038

Improving maps

6fcd29b

lintr

8f1707a

Rename

2fb9ba8

Fine tuning

3c60aff

Typo

99c0e4f

Fix join

4afe4c0

pre commit

e2a92f4

Add sentence

baf7e1d

Add quick conclusion

30d664e

Cleanup

1832054

precommit

37ea726

Add coords

f3553a8

pre commit

9050cb9

A bit more documentation

0fa00a4

dfsnow reviewed Jun 7, 2024

View reviewed changes

Damonamajor added 4 commits June 11, 2024 15:26

Move ARI

1212c16

Revert ingest

6a18451

Revert params

cdb0e1b

revert dvc

0940cba

Damonamajor requested a review from dfsnow June 11, 2024 15:48

Damonamajor added 3 commits June 17, 2024 18:19

Text edits

df49220

Include html file

d9a0f0f

Remove html

720e205

dfsnow reviewed Jun 20, 2024

View reviewed changes

Damonamajor and others added 6 commits June 24, 2024 10:37

Update analyses/ARI-model-test.qmd

23a9912

Co-authored-by: Dan Snow <31494343+dfsnow@users.noreply.github.com>

Update analyses/ARI-model-test.qmd

3401d89

Co-authored-by: Dan Snow <31494343+dfsnow@users.noreply.github.com>

Update analyses/ARI-model-test.qmd

dfa8782

Co-authored-by: Dan Snow <31494343+dfsnow@users.noreply.github.com>

Update analyses/ARI-model-test.qmd

7f3587a

Co-authored-by: Dan Snow <31494343+dfsnow@users.noreply.github.com>

Update analyses/ARI-model-test.qmd

1b67e7b

Co-authored-by: Dan Snow <31494343+dfsnow@users.noreply.github.com>

Update analyses/ARI-model-test.qmd

42100b8

Co-authored-by: Dan Snow <31494343+dfsnow@users.noreply.github.com>

Damonamajor changed the title ~~Testing of New Model Run~~ Test adding ARI to a New Model Jun 25, 2024

Damonamajor changed the title ~~Test adding ARI to a New Model~~ Test adding ARI to a New Model Run Jun 25, 2024

Damonamajor added 2 commits June 25, 2024 21:06

Paramatized test

214f1c3

Merge branch 'Test_New_Model_Run' of github.com:ccao-data/model-res-a…

7062851

…vm into Test_New_Model_Run

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test adding ARI to a New Model Run #245

Test adding ARI to a New Model Run #245

Damonamajor commented Jun 5, 2024 •

edited

Loading

dfsnow Jun 7, 2024

dfsnow Jun 7, 2024 •

edited

Loading

dfsnow Jun 7, 2024

dfsnow Jun 7, 2024 •

edited

Loading

Damonamajor Jun 11, 2024

Damonamajor commented Jun 20, 2024

dfsnow Jun 20, 2024

dfsnow Jun 20, 2024

dfsnow Jun 20, 2024

dfsnow Jun 20, 2024

		run_id: "2024-03-17-stupefied-maya"
		run_id: "2024-06-02-test-damon"

Test adding ARI to a New Model Run #245

Are you sure you want to change the base?

Test adding ARI to a New Model Run #245

Conversation

Damonamajor commented Jun 5, 2024 • edited Loading

dfsnow Jun 7, 2024

Choose a reason for hiding this comment

dfsnow Jun 7, 2024 • edited Loading

Choose a reason for hiding this comment

dfsnow Jun 7, 2024

Choose a reason for hiding this comment

dfsnow Jun 7, 2024 • edited Loading

Choose a reason for hiding this comment

Damonamajor Jun 11, 2024

Choose a reason for hiding this comment

Damonamajor commented Jun 20, 2024

dfsnow Jun 20, 2024

Choose a reason for hiding this comment

dfsnow Jun 20, 2024

Choose a reason for hiding this comment

dfsnow Jun 20, 2024

Choose a reason for hiding this comment

dfsnow Jun 20, 2024

Choose a reason for hiding this comment

Damonamajor commented Jun 5, 2024 •

edited

Loading

dfsnow Jun 7, 2024 •

edited

Loading

dfsnow Jun 7, 2024 •

edited

Loading