# Shannon Entropy, Joint Entropy, Conditional Entropy, Mutual Information, Relative Entropy(KL Divergence)

This page contains a simple example of the above mentioned four concepts. It would be helpful in the next tutorial where we would discuss Von Neumann Entropy, Holevo Bound and POVMs. 

Let us use the following table where the joint probability distribution of two random variables $X$ and $Y$ are given:

|      | Y=0  |Y=1 |Y=2     |
|------|------|------|------|
|X=0   |0   | 1/8 |  1/4    |
|X=1   |1/16   | 0 |  1/16    |
|X=2   |3/8   | 1/8 |  0    |


Now the **Shannon entropy** of this system is given by the following equation: 
>$H(X, Y) = -\sum_{x \in X, y \in Y}{p(x, y)\log_2 p(x, y)}$

The value in this case is: 
>$H(X, Y) = -0\log_2 0 -\frac{1}{8}\log_2\frac{1}{8}-\frac{1}{4}\log_2\frac{1}{4}-\frac{1}{16}\log_2\frac{1}{16}-0\log_2 0-\frac{1}{16}\log_2\frac{1}{16}-\frac{3}{8}\log_2\frac{3}{8}-\frac{1}{8}\log_2\frac{1}{8}-0\log_2 0 = 2.2806$

Note: $0\log 0 = 0$

Individual entropies could be found by marginalizing one variable with all values of the other one. 

>$p(X=0) = \sum_{y}p(X=0, Y=y) = 0 + \frac{1}{8} + \frac{1}{4} = \frac{3}{8}$

Similarly, $p(X=1) = \frac{1}{8}, p(X=2)=\frac{4}{8}, p(Y=0)=\frac{7}{16}, p(Y=1)=\frac{1}{4}, p(Y=2)=\frac{5}{16}$
Now, 
>$H(X) = -\frac{3}{8}\log_2\frac{3}{8} -\frac{1}{8}\log_2\frac{1}{8} -\frac{4}{8}\log_2\frac{4}{8} = 1.405$

And,
>$H(Y) = -\frac{7}{16}\log_2\frac{7}{16} -\frac{1}{4}\log_2\frac{1}{4} -\frac{5}{16}\log_2\frac{5}{16} = 1.546$


Now, **the conditional entropies** would be: 
>$H(X|Y) = H(X, Y) - H(Y) = 2.2806 - 1.546 = .7346$

And,
>$H(Y|X) = H(X, Y) - H(X) = 2.2806 - 1.405 = .8756$

Now, the **mutual information** would be: 
>$I(X;Y) = H(X) - H(X|Y) = 1.405 - .7346 = .6704$

This value would be same if was calculated with $H(Y)$ and $H(Y|X)$. 

To calculate the **relative entropy**, we must find the individual probability distribution of the two variables by marginalization. 
$$
P(X) = \{\frac{3}{8}, \frac{1}{8}, \frac{1}{2}\} \\
P(Y) = \{\frac{7}{16}, \frac{1}{4}, \frac{5}{16}\} \\
$$
Then the relative entropy of these two distributions would be: 
$$
D(P \| Q) = \sum_{x \in X}P(x) \log{\frac{P(x)}{Q(x)}}
$$  
In our case, this would be: 

$$
D(X \| Y) = P(X=0) * \log_2 \frac{P(X=0)}{P(Y=0)} + P(X=1) * \log_2 \frac{P(X=1)}{P(Y=1)} + P(X=2)* \log_2 \frac{P(X=2)}{P(Y=2)}
$$
Calculating, we get:
$$
D(X \| Y) = .375 * \log_2 \frac{.375}{.4375} + .125 * \log_2 \frac{.125}{.25} + .5 * \log_2 \frac{.5}{.3125} = .1310
$$