# Plot clusters

In [1]:
import pandas as pd
import matplotlib.pyplot as plt
from mpl_toolkits.mplot3d import Axes3D

%matplotlib inline

import sys
sys.path.append('..')

from modules.plots import ClusterPlot

In [2]:
df = pd.read_csv("../output/cluster/long_g10_n30_d05_data.csv")

## Manual Plotting

In [3]:
clusters = list(df['cluster'].unique())
colors = ["red", "green", "blue", "pink"]
markers = ["o", "v", "^"]

### Manual 3d Plot

In [4]:
ax = plt.figure().gca(projection='3d')
ax.set_xlabel('l')
ax.set_ylabel('sl')
ax.set_zlabel('t')

for index, cluster in enumerate(clusters):
    df_cluster = df[df['cluster'] == cluster]
    ax.scatter(
        df_cluster['l'], 
        df_cluster['sl'], 
        df_cluster['t'], 
        color=colors[cluster % len(colors)], 
        marker=markers[cluster % len(markers)],
        label=cluster
    )

plt.legend(loc='upper left');
plt.show()

### Manual 2d Plot

In [5]:
fig, ax = plt.subplots()
ax.set_xlabel('l')
ax.set_ylabel('sl')

# fix remaining Wilson coefficients
t_value_index = 5
t_value = df['t'].unique()[5]

for index, cluster in enumerate(clusters):
    df_cluster = df[df['cluster'] == cluster]
    df_cluster = df_cluster[df_cluster['t'] == t_value]
    ax.scatter(
        df_cluster['l'], 
        df_cluster['sl'], 
        color=colors[cluster % len(colors)], 
        marker=markers[cluster % len(markers)],
        label=cluster
    )

plt.legend(bbox_to_anchor=(1.2, 1.0));
plt.show()

## Using ``plot_clusters``

Set up the plotter:

In [6]:
cp = ClusterPlot(df)

### 3D plots

Scatter plot: The list is the list of the columns on the axes. 
Changing the order of the columns will turn around the cube. 

In [7]:
cp.scatter(['l', 'sl', 't'])

In [8]:
cp.scatter(['sl', 'l', 't'])

If it is still not easy to get an overview, use the ``clusters`` argument to limit ourselves to certain clusters.

In [9]:
cp.scatter(['sl', 'l', 't'], clusters=[1,2, 6])

If only two columns are given, several cuts will be presented (up to 16 by default):

### 2D cuts

In [10]:
cp.scatter(['sl', 'l'])

Again, we can also limit ourselves on the clusters that we want to display:

In [11]:
cp.scatter(['sl', 'l'], clusters=[1, 2])

If many benchmark points are available, it is better to switch to a 'fill' plot:

In [12]:
cp.fill(['sl', 'l'])

### More configuration

Several options to configure the the ClusterPlot object can be changed after the object has been initialized. E.g. the default list of colors is given by

In [13]:
cp.colors

And the number of plots for the 'slices' by

In [14]:
cp.max_subplots

Let's change that (note that no warning is issued when trying to set a non-existing property, so do be careful with your typing):

In [15]:
cp.max_subplots = 3

And try it out:

In [16]:
cp.scatter(['sl', 'l'])

To see all options, see the ``Attribute`` section of the help.

In [17]:
help(cp)