Simplify linear baseline model and bump all R dependencies #190

dfsnow · 2024-01-26T22:25:22Z

This PR simplifies the linear baseline model by removing the effect encoder step used in its preprocessing pipeline and replacing it with standard one-hot features. This is to reduce the dependency weight of the pipeline, as the embed recipes package has enormous sub-dependencies.

This PR also bumps all the R/renv dependencies used by each of the pipeline profiles to their latest version available on CRAN.

dfsnow · 2024-01-27T05:35:45Z

DESCRIPTION

@@ -9,7 +9,6 @@ Depends:
    ccao,
    conflicted,
    dplyr,
-    embed,


The embed package has super heavy dependencies (keras, tensorflow), so getting rid of it lightens the dependencies of the main lockfile quite a bit.

dfsnow · 2024-01-27T05:37:48Z

R/recipes.R

-    embed::step_lencode_glm(
-      all_of(cat_vars), -has_role("ID"),
-      outcome = vars(meta_sale_price)
-    ) %>%


Dropping this step lets us remove the embed dependency, but it also changes the performance of the linear model. Tagging @ccao-jardine here just as an FYI.

dfsnow · 2024-01-27T05:40:20Z

@jeancochrane @wrridgeway I'm merging this for expediency, but take a look Monday really quick.

Bump all renv dependencies

5a4b627

dfsnow requested review from wrridgeway and jeancochrane as code owners January 26, 2024 22:25

dfsnow added 3 commits January 26, 2024 22:56

Drop embed dependency

a08469b

Drop cat embedding step

d22570f

Cleanup linear model recipe

bab40c0

dfsnow temporarily deployed to deploy January 27, 2024 02:30 — with GitHub Actions Inactive

Add callr dependency

ee5eb2f

dfsnow temporarily deployed to deploy January 27, 2024 04:39 — with GitHub Actions Inactive

dfsnow commented Jan 27, 2024

View reviewed changes

dfsnow changed the title ~~Bump all R dependencies~~ Simplify linear baseline model and bump all R dependencies Jan 27, 2024

dfsnow merged commit fb212f4 into master Jan 27, 2024
7 checks passed

dfsnow deleted the dfsnow/bump-dependencies branch January 27, 2024 05:40

dfsnow mentioned this pull request Jan 27, 2024

Try categorical embeddings using the embed package #49

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify linear baseline model and bump all R dependencies #190

Simplify linear baseline model and bump all R dependencies #190

dfsnow commented Jan 26, 2024 •

edited

Loading

dfsnow Jan 27, 2024

dfsnow Jan 27, 2024

dfsnow commented Jan 27, 2024

Simplify linear baseline model and bump all R dependencies #190

Simplify linear baseline model and bump all R dependencies #190

Conversation

dfsnow commented Jan 26, 2024 • edited Loading

dfsnow Jan 27, 2024

Choose a reason for hiding this comment

dfsnow Jan 27, 2024

Choose a reason for hiding this comment

dfsnow commented Jan 27, 2024

dfsnow commented Jan 26, 2024 •

edited

Loading