Applying log transform to skewed outcome variable #387

bhu-strata · 2022-12-05T16:26:46Z

My numerical outcome variable is highly skewed so applying a log transform seems to bring it closer to a normal distribution and improves the EBM training accuracy. It is also seems preferable so that the model training isn't biased by the outliers. However, I am having a hard time reconciling the scores and intercept from the model trained on the log-transformed outcome as they differ greatly (even after exponentiating to un-transform them) from a model trained on the original (i.e. not log-transformed) outcome.

Any intuition on whether EBMs should benefit from transforming high skewed data?
Any suggestions on how to un-transform the intercept and scores so they can be interpretable in the original outcome space?

paulbkoch · 2023-01-31T13:27:27Z

Hi @bhu-strata -- EBMs should benefit from this if it fits your outcome. This is known as a link function in other GAM packages. It's something that's in our backlog for implementation, but isn't part of our package yet. See #137.

Unfortunately, to maintain additivity you can't really un-transform the scores in the model itself. This is something shared with other GAM packages. I think practitioners eventually get a feel for how to interpret the scores from various link functions. I only have personal experience with this for logits though, so I'm sort of guessing in that regard.

paulbkoch · 2023-02-16T22:38:34Z

Closing this issue since we already have an issue tracking alternative link functions.

paulbkoch · 2023-05-16T08:17:43Z

Hi @bhu-strata -- We recently shipped v0.4.0, which includes alternative objectives with a log link. More details are available in our documentation: https://interpret.ml/docs/ebm.html#explainableboostingregressor

paulbkoch closed this as completed Feb 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Applying log transform to skewed outcome variable #387

Applying log transform to skewed outcome variable #387

bhu-strata commented Dec 5, 2022

paulbkoch commented Jan 31, 2023

paulbkoch commented Feb 16, 2023

paulbkoch commented May 16, 2023

Applying log transform to skewed outcome variable #387

Applying log transform to skewed outcome variable #387

Comments

bhu-strata commented Dec 5, 2022

paulbkoch commented Jan 31, 2023

paulbkoch commented Feb 16, 2023

paulbkoch commented May 16, 2023