UNet-and-Hybrid-UNet-for-Semantic-Segmentation

UNet and Hybrid UNet for Semantic Segmentation on CityScapes dataset with dice score as metric.

First of all, as with any Python file or IPython notebook, all the required libraries are imported. The libraries imported include os to handle file paths, numpy for numerical processing, pandas for data manipulation, Pytorch for creating and training the model and matplotlib & seaborn for visualizations.

Next the file paths for the dataset are fetched and paths for logging directory and weight directory are created.

A class called Generator is created to transform the training data as it loads based on various parameters such as batch size, scaling factor, and data augmentation approaches. The class has a function to augment the training data and a function to generate a training batch. Some images with their masks from the training dataset are displayed to gain an understanding of what the images hold.

Next the UNet model is created as a class with objects of other classes as its components. After that a UNet model is created as an object of the UNet class. Adam optimizer with binary cross entropy loss function was used for training the model. A function for training is defined, using which the model is trained on 10 epochs with a batch size of 32 images.

Using the validation dataset, some predictions are made on the trained model and the results and displayed and compared with the ground truth images.

Libraries used:

numpy
os
seaborn
matplotlib
pandas
tqdm
pytorch
torchvision
cv2
random

The motivation for the project: To check whether there is an improvement in the performance of the UNet segmentation model with a slight change in its structure.

The files in the repository with a small description of each: Readme -> the file you are reading right now UNet.ipynb -> holds code for creation, training and testing of the UNet model for cityscapes dataset Hybrid_UNet.ipynb -> holds code for creation, training and testing of the Hybrid UNet model for cityscapes dataset Licence -> MIT licence

Download the dataset from https://www.kaggle.com/dansbecker/cityscapes-image-pairs and extract and use the corresponding paths in the code for the code to run successfully.

Summary of the results of the analysis: The dice score co-efficient which is a measure of accuracy in the task of semantic segmentation shows improvement when we used the Hybrid model. The dice score of the plain UNet model was 0.4549 when compared with the Hybrid UNet model for which it is 0.4588. Comparing them with the benchmark (0.77) both the models have low values due to the smaller number of epochs and extremely small size of the dataset.

Necessary acknowledgments:None

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
Hybrid_UNet.ipynb		Hybrid_UNet.ipynb
LICENSE		LICENSE
README.md		README.md
UNet.ipynb		UNet.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UNet-and-Hybrid-UNet-for-Semantic-Segmentation

About

Releases

Packages

Languages

License

EhsanMohd/UNet-and-Hybrid-UNet-for-Semantic-Segmentation

Folders and files

Latest commit

History

Repository files navigation

UNet-and-Hybrid-UNet-for-Semantic-Segmentation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages