Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

change default sparse_threshold to 0 in TableVectorizer #646

Merged
merged 2 commits into from
Jul 18, 2023

Conversation

LeoGrin
Copy link
Contributor

@LeoGrin LeoGrin commented Jul 18, 2023

which means that the output is always dense. I don't know which is better between this and a small value (like 0.01 or 0.05). Right now it is 0.3, which means that a simple pipeline TableVectorizer-->HistGradientBoostingClassifier often breaks when experimenting.

Copy link
Member

@GaelVaroquaux GaelVaroquaux left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, merging

@GaelVaroquaux GaelVaroquaux merged commit 77ea500 into skrub-data:main Jul 18, 2023
24 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants