Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Simplify linear baseline model and bump all R dependencies #190

Merged
merged 5 commits into from
Jan 27, 2024

Conversation

dfsnow
Copy link
Member

@dfsnow dfsnow commented Jan 26, 2024

This PR simplifies the linear baseline model by removing the effect encoder step used in its preprocessing pipeline and replacing it with standard one-hot features. This is to reduce the dependency weight of the pipeline, as the embed recipes package has enormous sub-dependencies.

This PR also bumps all the R/renv dependencies used by each of the pipeline profiles to their latest version available on CRAN.

@@ -9,7 +9,6 @@ Depends:
ccao,
conflicted,
dplyr,
embed,
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The embed package has super heavy dependencies (keras, tensorflow), so getting rid of it lightens the dependencies of the main lockfile quite a bit.

Comment on lines -80 to -83
embed::step_lencode_glm(
all_of(cat_vars), -has_role("ID"),
outcome = vars(meta_sale_price)
) %>%
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Dropping this step lets us remove the embed dependency, but it also changes the performance of the linear model. Tagging @ccao-jardine here just as an FYI.

@dfsnow dfsnow changed the title Bump all R dependencies Simplify linear baseline model and bump all R dependencies Jan 27, 2024
@dfsnow
Copy link
Member Author

dfsnow commented Jan 27, 2024

@jeancochrane @wrridgeway I'm merging this for expediency, but take a look Monday really quick.

@dfsnow dfsnow merged commit fb212f4 into master Jan 27, 2024
7 checks passed
@dfsnow dfsnow deleted the dfsnow/bump-dependencies branch January 27, 2024 05:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant