# Ward hierarchical clustering for H&E images segmentation

We applied this algorithm on one image of PDAC dataset (B2) in order to segment nuclei

As a reminder, we finally chose to do the segmentation with Visiopharm.

https://scikit-learn.org/0.15/auto_examples/cluster/plot_lena_ward_segmentation.html

https://www.thepythoncode.com/article/kmeans-for-image-segmentation-opencv-python

# 0. Preliminary steps

## 0.1 Environment

conda env create -f env_cellpose.yml --name cellpose

conda activate cellpose

pip install ipykernel

python -m ipykernel install --user --name=cellpose --display-name=Cellpose

pip install imagecodecs

pip install opencv-python

## 0.1.bis My env

In [None]:
conda env create -f env_ST.yml --name STenv

## 0.2. Load packages

In [1]:
import os
import cv2
import time as time
import numpy as np
import scipy as sp
import matplotlib.pyplot as plt
from sklearn.feature_extraction.image import grid_to_graph
from sklearn.cluster import AgglomerativeClustering

## 0.3. Output files

In [2]:
## Result folder 
output_files= f"/sbgenomics/output-files/data/segmentation_ward"
os.makedirs(output_files, exist_ok=True)

## workspace folder
output_workspace = f"./ward/segmented_images"
os.makedirs(output_workspace, exist_ok=True)

## 0.4. Fonction definition

## 1. Image preprocessing

## 1.1. Import image

In [2]:
image = cv2.imread("hne_normalized_wo_background/PDAC_B2_normalized_wo_background/PDAC_B2_result_normalized_wo_background.jpg")

In [None]:
plt.imshow(image)
plt.show()

In [3]:
X = np.reshape(image, (-1, 1))

In [None]:
connectivity = grid_to_graph(*image.shape)

In [None]:
# Compute clustering
print("Compute structured hierarchical clustering...")
st = time.time()
n_clusters = 3  # number of regions
ward = AgglomerativeClustering(n_clusters=n_clusters,
        linkage='ward', connectivity=connectivity).fit(X)
label = np.reshape(ward.labels_, image.shape)
print("Elapsed time: ", time.time() - st)
print("Number of pixels: ", label.size)
print("Number of clusters: ", np.unique(label).size)

In [None]:
# Plot the results on an image
plt.figure(figsize=(5, 5))
plt.imshow(image, cmap=plt.cm.gray)
for l in range(n_clusters):
    plt.contour(label == l, contours=1,
                colors=[plt.cm.spectral(l / float(n_clusters)), ])
plt.xticks(())
plt.yticks(())
plt.show()