# Tutorial 4: Plot clusters

This tutorial will talk about how to plot the clusters created from the data in tutorial 1.

**NOTE FOR CONTRIBUTORS: Always clear all output before commiting (``Cell`` > ``All Output`` > ``Clear``)**!

In [None]:
# Magic
%matplotlib inline
# Reload modules whenever they change
%load_ext autoreload
%autoreload 2

# Make clusterking package available even without installation
import sys
sys.path = ["../../"] + sys.path

In [None]:
import matplotlib.pyplot as plt
from mpl_toolkits.mplot3d import Axes3D
from pathlib import Path

from clusterking.plots import ClusterPlot
from clusterking.data.data import Data

As in tutorial 3 we load the data created in tutorial 1:

In [None]:
d = Data("output/cluster/", "tutorial_basics")

## Manual Plotting

We create a data frame out of the data and choose colours and markers to make things ready for plotting:

In [None]:
df = d.df
clusters = list(df['cluster'].unique())
colors = ["red", "green", "blue", "pink"]
markers = ["o", "v", "^"]

### Manual 3d Plot

Let's start with 3d plots:

In [None]:
# defining the axes
ax = plt.figure().gca(projection='3d')
ax.set_xlabel('CVL_bctaunutau')
ax.set_ylabel('CSL_bctaunutau')
ax.set_zlabel('CT_bctaunutau')

# running over all the WC values in the cluster
for index, cluster in enumerate(clusters):
    df_cluster = df[df['cluster'] == cluster]
    ax.scatter(
        df_cluster['CVL_bctaunutau'], 
        df_cluster['CSL_bctaunutau'], 
        df_cluster['CT_bctaunutau'], 
        color=colors[cluster % len(colors)], 
        marker=markers[cluster % len(markers)],
        label=cluster
    )

plt.legend(loc='upper left');
plt.show()

### Manual 2d Plot

In [None]:
# defining the axes
fig, ax = plt.subplots()
ax.set_xlabel('CVL_bctaunutau')
ax.set_ylabel('CSL_bctaunutau')

# defining a cut in the CT plane
CT_value_index = 2
CT_value = df['CT_bctaunutau'].unique()[CT_value_index]

# running over all CVL and CSL values for the chosen CT plane
for index, cluster in enumerate(clusters):
    df_cluster = df[df['cluster'] == cluster]
    df_cluster = df_cluster[df_cluster['CT_bctaunutau'] == CT_value]
    ax.scatter(
        df_cluster['CVL_bctaunutau'], 
        df_cluster['CSL_bctaunutau'], 
        color=colors[cluster % len(colors)], 
        marker=markers[cluster % len(markers)],
        label=cluster
    )

plt.legend(bbox_to_anchor=(1.2, 1.0));
plt.show()

## Using ``plot_clusters``

Set up the plotter:

In [None]:
cp = ClusterPlot(d)

In [None]:
cp.draw_legend=True

### 3D plots

Scatter plot: The list is the list of the columns on the axes. 
Changing the order of the columns will turn around the cube. 

In [None]:
cp.scatter(['CVL_bctaunutau', 'CSL_bctaunutau', 'CT_bctaunutau'])

If it is still not easy to get an overview, use the ``clusters`` argument to limit ourselves to certain clusters.

In [None]:
cp.scatter(['CVL_bctaunutau', 'CSL_bctaunutau', 'CT_bctaunutau'], clusters=[0, 2])

In [None]:
cp.savefig("output/cluster/scatter_3d_02.png")

If only two columns are given, several cuts will be presented (up to 16 by default):

### 2D cuts

In [None]:
cp.scatter(['CVL_bctaunutau', 'CSL_bctaunutau'])

Again, we can also limit ourselves on the clusters that we want to display:

In [None]:
cp.scatter(['CVL_bctaunutau', 'CSL_bctaunutau'], clusters=[1,2])

If many Wilson coefficient points are available, it is better to switch to a 'fill' plot:

In [None]:
cp.fill(['CVL_bctaunutau', 'CSL_bctaunutau'])

### More configuration

Several options to configure the ClusterPlot object can be changed after the object has been initialized.

The number of plots for the 'slices' is given by

In [None]:
cp.max_subplots

Let's change that (note that no warning is issued when trying to set a non-existing property, so do be careful with your typing):

In [None]:
cp.max_subplots = 3

And try it out:

In [None]:
cp.scatter(['CVL_bctaunutau', 'CSL_bctaunutau'])

In [None]:
cp.fill(['CVL_bctaunutau', 'CSL_bctaunutau'])

In [None]:
cp.savefig("output/cluster/fill_2d.png")

To see all options, see the ``Attribute`` section of the help.

In [None]:
help(cp)