Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Try categorical embeddings using the embed package #49

Closed
dfsnow opened this issue Nov 2, 2023 · 1 comment
Closed

Try categorical embeddings using the embed package #49

dfsnow opened this issue Nov 2, 2023 · 1 comment
Labels
method ML technique or method change

Comments

@dfsnow
Copy link
Member

dfsnow commented Nov 2, 2023

Currently, the CCAO uses LightGBM's native categorical handling to deal with categoricals as features. However, there may be more efficient/better ways to handle categoricals using various embeddings. See https://embed.tidymodels.org/

@dfsnow dfsnow added this to the 2024 model changes milestone Dec 5, 2023
@dfsnow dfsnow added the method ML technique or method change label Dec 5, 2023
@dfsnow
Copy link
Member Author

dfsnow commented Jan 27, 2024

We tested out using embeddings of categoricals via the same process as our linear baseline model. However, the embeddings have worse performance than just letting LightGBM handle the categoricals natively. Additionally, the dependencies for the embed package are quite heavy, so as of #190 we've removed embeddings entirely (including from the linear model).

@dfsnow dfsnow closed this as completed Jan 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
method ML technique or method change
Projects
None yet
Development

No branches or pull requests

1 participant