Reproduction and Analysis of rbc_wbc_unet.ipynb
Dataset and Source

The notebook rbc_wbc_unet.ipynb reproduces a blood cell segmentation approach based on a publicly available GitHub repository:

Source Repository: https://github.com/Dominosam/blood-cells-segmentation

The dataset consists of blood smear images with corresponding segmentation masks intended to distinguish cellular components such as red blood cells (RBCs), white blood cells (WBCs), and background. The segmentation task is designed to support downstream analysis, including cell localization and morphological assessment.

Preprocessing and Data Preparation

To accommodate local execution constraints, image assets were batch converted to JPEG format to reduce storage requirements. All images were resized to a fixed resolution of 256 × 256 pixels and normalized to the [0, 1] range prior to model input.

These preprocessing steps align with standard practices for convolutional segmentation models and ensure consistent spatial dimensions across training and inference.

Model Architecture

The model implemented in this notebook follows a classic UNet encoder–decoder architecture. The network consists of:

A downsampling path composed of stacked convolutional layers with max pooling to progressively reduce spatial resolution while increasing feature depth

A bottleneck layer with increased channel capacity and dropout for regularization

An upsampling path using nearest-neighbor upsampling combined with skip connections to corresponding encoder layers

A final 1×1 convolution with softmax activation to produce pixel-wise class probabilities

The architecture clearly preserves spatial context through skip connections, enabling fine-grained segmentation of cellular structures.

Execution and Qualitative Evaluation

The model executed successfully end-to-end, producing segmentation outputs without runtime errors. The original implementation does not include quantitative evaluation metrics such as pixel accuracy, Dice coefficient, or Intersection over Union (IoU).

As a result, performance assessment was conducted qualitatively by overlaying predicted segmentation masks onto the original images using external visualization tools. This visual inspection provided insight into the strengths and failure modes of the model.

Observations and Limitations

The model demonstrates strong segmentation performance for cells with relatively round and well-defined morphology. Nuclear regions, in particular, are segmented with high consistency and sharp boundaries.

However, segmentation quality degrades for more irregular or amoeboid cell shapes. In these cases, the model often produces incomplete or fragmented cytoplasmic regions. A recurring failure pattern involves carving gaps through the cytoplasm, suggesting difficulty maintaining contiguous segmentation in areas with low contrast or complex morphology.

These behaviors indicate that the model is more robust at capturing nuclear features than cytoplasmic structure, likely influenced by both dataset composition and architectural bias toward high-contrast boundaries.

Relevance to the Capstone Project

This notebook provides a valuable reference for understanding the practical strengths and limitations of UNet-based segmentation in hematologic imaging. While the approach is effective for well-defined cells, its sensitivity to irregular morphology highlights challenges that must be addressed in clinically realistic pipelines.

For the Capstone project, these findings reinforce the importance of integrating segmentation models that are resilient to morphological variability and capable of handling complex cellular contexts. The exercise also underscores the necessity of combining qualitative and quantitative evaluation when assessing segmentation models intended for downstream diagnostic use.

Summary

The reproduced UNet model offers impressive segmentation performance under favorable conditions and serves as a strong baseline for blood cell segmentation tasks. At the same time, its limitations on irregular cell shapes provide meaningful guidance for future architectural and dataset design choices in the Capstone project.