Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Target Encoder implementation #34

Closed
jitingxu1 opened this issue Mar 20, 2024 · 1 comment
Closed

feat: Target Encoder implementation #34

jitingxu1 opened this issue Mar 20, 2024 · 1 comment
Assignees

Comments

@jitingxu1
Copy link
Collaborator

Definition

Target encoding involves replacing categorical feature values with a numeric representation derived from the target variable. This method aims to capture the relationship between categorical features and the target variable by encoding categories with their respective impact on the target.

use case

  • High Cardinality Features

requirements

  • Regression, binary and multiclass classification.
  • Handle overfitting
  • Handle unknown category, the new category not present in the training dataset.
  • Handle missing value

Implementation

Fit

  • Treat missing Value
  • Use the mean value of target variable for that category for regression task
  • Use the conditional probability given that category
  • Handle Over-fittings
    • [recommend] Smoothing
    • KFold Target Encoder
    • Leave-one-out
    • Adding Gaussian Noise

Transform

  • Unknown category

Issues

  • over-fitting
  • target leakage

Reference

@deepyaman deepyaman self-assigned this Mar 20, 2024
@jitingxu1 jitingxu1 mentioned this issue Mar 29, 2024
1 task
@jitingxu1
Copy link
Collaborator Author

done

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Development

No branches or pull requests

2 participants