Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Applying log transform to skewed outcome variable #387

Closed
bhu-strata opened this issue Dec 5, 2022 · 3 comments
Closed

Applying log transform to skewed outcome variable #387

bhu-strata opened this issue Dec 5, 2022 · 3 comments

Comments

@bhu-strata
Copy link

My numerical outcome variable is highly skewed so applying a log transform seems to bring it closer to a normal distribution and improves the EBM training accuracy. It is also seems preferable so that the model training isn't biased by the outliers. However, I am having a hard time reconciling the scores and intercept from the model trained on the log-transformed outcome as they differ greatly (even after exponentiating to un-transform them) from a model trained on the original (i.e. not log-transformed) outcome.

  1. Any intuition on whether EBMs should benefit from transforming high skewed data?
  2. Any suggestions on how to un-transform the intercept and scores so they can be interpretable in the original outcome space?
@paulbkoch
Copy link
Collaborator

Hi @bhu-strata -- EBMs should benefit from this if it fits your outcome. This is known as a link function in other GAM packages. It's something that's in our backlog for implementation, but isn't part of our package yet. See #137.

Unfortunately, to maintain additivity you can't really un-transform the scores in the model itself. This is something shared with other GAM packages. I think practitioners eventually get a feel for how to interpret the scores from various link functions. I only have personal experience with this for logits though, so I'm sort of guessing in that regard.

@paulbkoch
Copy link
Collaborator

Closing this issue since we already have an issue tracking alternative link functions.

@paulbkoch
Copy link
Collaborator

Hi @bhu-strata -- We recently shipped v0.4.0, which includes alternative objectives with a log link. More details are available in our documentation: https://interpret.ml/docs/ebm.html#explainableboostingregressor

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

2 participants