Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Gower distance tends to fit global model #158

Closed
pkopper opened this issue Jun 9, 2019 · 3 comments
Closed

Gower distance tends to fit global model #158

pkopper opened this issue Jun 9, 2019 · 3 comments

Comments

@pkopper
Copy link
Contributor

pkopper commented Jun 9, 2019

I experienced a practical problem when using Gower's distance as a dissimilarity measure. In many different settings, the resulting model was very global. This is - to my mind - because when using Gower's distance we do not work with a kernel function. This can be argued to be meaningful as Gower's distance 'scales' the resulting dissimilarities already. However, practically I have observed in some settings that the resulting dissimilarities are not very discriminating so that we end up with an explainer which is not that different from a global explainer.
Did someone else make similar observations and does someone have a good solution for working with mixed data?
I found the distance measure (eq. 9) from the article below very helpful in different contexts. However, the use of it would require to change to package code.
http://www.cs.ust.hk/~qyang/Teaching/537/Papers/huang98extensions.pdf

@thomasp85
Copy link
Owner

It might make sense to allow a kernel on top of the Gower distance as well...

@pkopper
Copy link
Contributor Author

pkopper commented Jun 11, 2019

If you want to I can suggest a fix via pull request.

@thomasp85
Copy link
Owner

Thank you — that would be welcome

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants