GitHub - vishy04/Cell-Segmentation: Tired of counting cells by hand? 🔬 This project uses a U-Net deep learning model to automatically find and count cells, saving you time and improving accuracy. Perfect for researchers and bio-AI enthusiasts!

# Cell Segmentation & Counting with Deep U-Net

A robust, industry-standard Deep Learning pipeline for segmenting and counting cell nuclei in biomedical images. Trained on the **2018 Data Science Bowl (BBBC038)** dataset, this project leverages a custom Deep U-Net architecture to handle complex biological textures and overlapping cells.



## Features

* Real Data Pipeline: Automatically downloads and parses the ~85MB BBBC038 dataset (Data Science Bowl 2018).
* Deep U-Net Architecture: A 5-level U-Net with Batch Normalization and He Initialization for stable training on textured biomedical images.
* Advanced Post-Processing: Uses Watershed algorithm with distance transform to separate touching cells (crucial for accurate counting).
* Industry-Standard Evaluation: Includes Bland-Altman plots and IoU distribution analysis to validate scientific accuracy.
* Data Augmentation: Real-time rotation, flipping, and zooming to prevent overfitting.

How to Use

Follow these steps to set up the project and train your own model.

1. Installation

First, clone the repository and navigate into the project directory:

git clone [https://github.com/yourusername/cell-segmentation-unet.git](https://github.com/yourusername/cell-segmentation-unet.git)
cd cell-segmentation-unet

Create a virtual environment (recommended) and install the required dependencies:

# Create virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

2. Training the Model

To start the training pipeline, run the train.py script. This script handles the entire workflow:

Downloads the BBBC038 dataset.
Preprocesses the images and merges mask files.
Augments the data in real-time.
Trains the Deep U-Net model.

python train.py

Output: The script will save the best-performing model to best_model.keras and the test dataset to .npy files for evaluation.

3. Evaluating Performance

Once training is complete, you can generate a comprehensive performance report using evaluate.py. This script loads the trained model and the test data to produce industry-standard metrics.

python evaluate.py

Output: This will generate evaluation_report.png, containing:
IoU Histogram: Distribution of segmentation quality.
Bland-Altman Plot: Analysis of counting bias and agreement.
Visual Overlays: Qualitative comparison of predictions vs. ground truth.

Dataset

The project uses the 2018 Data Science Bowl (BBBC038) dataset, hosted by the Broad Institute.

Content: Diverse microscopy images (fluorescence, histology, brightfield).
Ground Truth: High-quality masks where each nucleus is annotated.
Preprocessing: The RealBiologicalLoader class merges individual mask files into a single binary map for semantic segmentation.

Model Architecture

The model is a Deep U-Net optimized for biomedical segmentation:

Encoder: 4 downsampling blocks (Conv2D -> BatchNorm -> ReLU -> MaxPool).
Bottleneck: 512 filters with Dropout (0.3) to capture high-level features.
Decoder: 4 upsampling blocks with skip connections to preserve spatial resolution.
Output: Sigmoid activation for pixel-wise probability.

Results & Evaluation

We will use the gold standard Bland-Altman Analysis to validate counting accuracy.

Metric	Value (Approx)	Description
Mean IoU	0.85+	Intersection over Union (Segmentation Quality)
Counting Bias	< 1.0	Average difference between Pred & Ground Truth counts
Pixel Accuracy	> 98%	Accuracy of background/foreground classification

Requirements

Python 3.8+
TensorFlow 2.x
OpenCV
Scikit-Image
Matplotlib / Seaborn
Pandas / Numpy

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the Project
Create your Feature Branch (git checkout -b feature/AmazingFeature)
Commit your Changes (git commit -m 'Add some AmazingFeature')
Push to the Branch (git push origin feature/AmazingFeature)
Open a Pull Request

License

Distributed under the MIT License. See LICENSE for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
LICENSE		LICENSE
.gitignore		.gitignore
README.md		README.md
data_loader.py		data_loader.py
evaluate.py		evaluate.py
model.py		model.py
requirments.txt		requirments.txt
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How to Use

1. Installation

2. Training the Model

3. Evaluating Performance

Dataset

Model Architecture

Results & Evaluation

Requirements

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

How to Use

1. Installation

2. Training the Model

3. Evaluating Performance

Dataset

Model Architecture

Results & Evaluation

Requirements

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages