Jensen–Shannon divergence #20

LamyaMohaned · 2021-06-14T06:17:31Z

Hello,

I'm trying to understand Jensen–Shannon divergence, I still don't understand the math behind it, but someone asked me to investigate about it and Augmix because of this paragraph:

Alternatively, we can view each set as an empirical distribution and measure the distance between
them using Kullback-Leibler (KL) or Jensen-Shannon (JS) divergence. The challenge for learning
with KL or JS divergence is that no useful gradient is provided when the two empirical distributions
have disjoint supports or have a non-empty intersection contained in a set of measure zero.

from here: https://arxiv.org/pdf/1907.10764.pdf

Is this problem presented in Augmix?

hendrycks · 2021-06-14T16:54:57Z

This is not a problem with AugMix since they share the same support and for all elements of the support, the probabilities are greater than zero.

LamyaMohaned · 2021-06-16T08:09:11Z

Thank you!

normster closed this as completed Sep 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jensen–Shannon divergence #20

Jensen–Shannon divergence #20

LamyaMohaned commented Jun 14, 2021 •

edited

hendrycks commented Jun 14, 2021 •

edited

LamyaMohaned commented Jun 16, 2021

Jensen–Shannon divergence #20

Jensen–Shannon divergence #20

Comments

LamyaMohaned commented Jun 14, 2021 • edited

hendrycks commented Jun 14, 2021 • edited

LamyaMohaned commented Jun 16, 2021

LamyaMohaned commented Jun 14, 2021 •

edited

hendrycks commented Jun 14, 2021 •

edited