# Some context on what's being developed
### Remarks:
 - A minimal setup should be implemented first
 - Should point to a more extensive mini-language approach (for the future)

### Possible magics specification:
 - First, seting up a name (an existing line magic):`%reload_ext datalintermagics` 
 - **Data setup** (line magic) i.e. , only one dataset supported plus arguments for passing it to the linter it i.e. `header`, `delimiter`
   ```
   %add_linter_data --tracked-variable my_data_var --data-delim=',' --data-header=True
   ```
 - **Lint command** (cell magic) i.e. send code and the tracked variable to the linter, get reply back and show it
   ```
   %lint_cell --ip 0.0.0.0 --port 10000 --show_na=True --show-stats=True --show-passing=True
   ```
   
### Shortlist of TODOs concerning magics:
 - track more than one variable (defining meaning there)
 - add support for linting lines i.e.

In [1]:
%reload_ext datalintermagics
b = 2
%add_linter_data --tracked-variable b --data-header 'True' --data-delim ','

In [2]:
import matplotlib.pyplot as plt

# Though the following import is not directly being used, it is required
# for 3D projection to work with matplotlib < 3.2
import mpl_toolkits.mplot3d  # noqa: F401
import numpy as np
import pandas as pd
from sklearn import datasets
from sklearn.cluster import KMeans
np.random.seed(5)

In [3]:
iris = datasets.load_iris()
iris_df = pd.DataFrame(iris.data, columns=iris.feature_names)
X = iris.data
y = iris.target

%add_linter_data --tracked-variable iris_df --data-header 'True' --data-delim ','

# Now Lint!

In [4]:
%%lint --ip 0.0.0.0 --port 10000 --show-stats True --show-na False --show-passing False
#some comment

Response: 200, OK
Linter output
-------------
 • experimental 	(imbalanced_target_variable)	dataset              Imbalanced target column in 'dataset'
• experimental 	(R_glmmTMB_target_variable)	dataset              Imbalanced dependent variable (glmmTMB)
2 issues found from 18 linters applied (8 OK, 10 N/A) .



In [8]:
%%lint --ip 0.0.0.0 --port 10000 --show-stats True --show-na False --show-passing False
estimators = [
    ("k_means_iris_8", KMeans(n_clusters=8)),
    ("k_means_iris_3", KMeans(n_clusters=3)),
    ("k_means_iris_bad_init", KMeans(n_clusters=3, n_init=1, init="random")),
]

fig = plt.figure(figsize=(10, 8))
titles = ["8 clusters", "3 clusters", "3 clusters, bad initialization"]
for idx, ((name, est), title) in enumerate(zip(estimators, titles)):
    ax = fig.add_subplot(2, 2, idx + 1, projection="3d", elev=48, azim=134)
    est.fit(X)
    labels = est.labels_

    ax.scatter(X[:, 3], X[:, 0], X[:, 2], c=labels.astype(float), edgecolor="k")

    ax.xaxis.set_ticklabels([])
    ax.yaxis.set_ticklabels([])
    ax.zaxis.set_ticklabels([])00
    ax.set_xlabel("Petal width")
    ax.set_ylabel("Sepal length")
    ax.set_zlabel("Petal length")
    ax.set_title(title)

# Plot the ground truth
ax = fig.add_subplot(2, 2, 4, projection="3d", elev=48, azim=134)

for name, label in [("Setosa", 0), ("Versicolour", 1), ("Virginica", 2)]:
    ax.text3D(
        X[y == label, 3].mean(),
        X[y == label, 0].mean(),
        X[y == label, 2].mean() + 2,
        name,
        horizontalalignment="center",
        bbox=dict(alpha=0.2, edgecolor="w", facecolor="w"),
    )

ax.scatter(X[:, 3], X[:, 0], X[:, 2], c=y, edgecolor="k")

ax.xaxis.set_ticklabels([])
ax.yaxis.set_ticklabels([])
ax.zaxis.set_ticklabels([])
ax.set_xlabel("Petal width")
ax.set_ylabel("Sepal length")
ax.set_zlabel("Petal length")
ax.set_title("Ground Truth")

plt.subplots_adjust(wspace=0.25, hspace=0.25)
plt.show()

{"linter_input": {"context": {"data": ",sepal length (cm),sepal width (cm),petal length (cm),petal width (cm)\n0,5.1,3.5,1.4,0.2\n1,4.9,3.0,1.4,0.2\n2,4.7,3.2,1.3,0.2\n3,4.6,3.1,1.5,0.2\n4,5.0,3.6,1.4,0.2\n5,5.4,3.9,1.7,0.4\n6,4.6,3.4,1.4,0.3\n7,5.0,3.4,1.5,0.2\n8,4.4,2.9,1.4,0.2\n9,4.9,3.1,1.5,0.1\n10,5.4,3.7,1.5,0.2\n11,4.8,3.4,1.6,0.2\n12,4.8,3.0,1.4,0.1\n13,4.3,3.0,1.1,0.1\n14,5.8,4.0,1.2,0.2\n15,5.7,4.4,1.5,0.4\n16,5.4,3.9,1.3,0.4\n17,5.1,3.5,1.4,0.3\n18,5.7,3.8,1.7,0.3\n19,5.1,3.8,1.5,0.3\n20,5.4,3.4,1.7,0.2\n21,5.1,3.7,1.5,0.4\n22,4.6,3.6,1.0,0.2\n23,5.1,3.3,1.7,0.5\n24,4.8,3.4,1.9,0.2\n25,5.0,3.0,1.6,0.2\n26,5.0,3.4,1.6,0.4\n27,5.2,3.5,1.5,0.2\n28,5.2,3.4,1.4,0.2\n29,4.7,3.2,1.6,0.2\n30,4.8,3.1,1.6,0.2\n31,5.4,3.4,1.5,0.4\n32,5.2,4.1,1.5,0.1\n33,5.5,4.2,1.4,0.2\n34,4.9,3.1,1.5,0.2\n35,5.0,3.2,1.2,0.2\n36,5.5,3.5,1.3,0.2\n37,4.9,3.6,1.4,0.1\n38,4.4,3.0,1.3,0.2\n39,5.1,3.4,1.5,0.2\n40,5.0,3.5,1.3,0.3\n41,4.5,2.3,1.3,0.3\n42,4.4,3.2,1.3,0.2\n43,5.0,3.5,1.6,0.6\n44,5.1,3.8,1.9,0.4\