Image Segmentation Using U-Net

Image segmentation is the process of classifying the pixels in a digital image to different classes. There are two types of image segmentation : Semantic Segmentation and Instance Segmentation.

In semantic segmentation, pixels belonging to a given class as classified as one label whereas in instance segmentation each instance of a specific class are assigned a separate label.

In this repo I have implemented a simple U-net to demonstrate the use case of image segmentation.

U-Net

U-Net, as the name suggest follows an encoder - decoder based model with decoder mirroring the encoders and a bottleneck layer. It was first introduced by Olaf Ronneberger et. al. in their paper U-Net: Convolutional Network for Biomedical Image Segmentation with training strategy that relies on the strong use of data augmentation and use the available annotated samples more efficiently.

Each encoder block has two conv layer followed by a max pool layer and each decoder block has two conv layers followed by upsampling layers along with the output of the corresponding encoder layer as shown in the image above.

Dataset

The custom dataset used in this implementation was created by Divam Gupta using CityScape Dataset. He also has an excellent blog on Semantic Segmentation using Keras. It has 367 annotated images of size 480p by 360p in train set and 101 annotated images of size 480p by 360p in validation set. The annotated image has pixels classified in 12 classes - 'sky', 'building','column/pole', 'road', 'side walk', 'vegetation', 'traffic light', 'fence', 'vehicle', 'pedestrian', 'byciclist', 'void'.

Method

A U-Net architecture with input and output shape of 128X128X3 was used. It has a down sampler or 2 encoder blocks of each - 64,128,256 and 512 filters and upsampler or 2 decoder blocks of each - 512,256,128 and 64 filters. It also uses a bottleneck of 2 1024 filters. The model had ~34.5M parameters and was trained for 20 epochs with Adam optimizer and catergorical cross entropy as loss function.

Steps

Clone this repo.
Download the dataset from this link
Install the required libraries using requirements.txt
Use train.ipynb to retrain the architecture on custom dataset
The train log will be stored as tesorboard variables in ./log folder

Results

With the above method following results were achieved

Accuracy Plot	Loss Plot

Applications

Image segmentation can has application under different sector

Medical Imaging
Autonomous Driving
Satellite Imaging

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
images_		images_
log/20230203-161222		log/20230203-161222
README.md		README.md
requirements.txt		requirements.txt
test.ipynb		test.ipynb
train.ipynb		train.ipynb
u_net.py		u_net.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Image Segmentation Using U-Net

U-Net

Dataset

Method

Steps

Results

Applications

Reference

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Ayush-Mi/Image-Segmentation-Using-U-Net

Folders and files

Latest commit

History

Repository files navigation

Image Segmentation Using U-Net

U-Net

Dataset

Method

Steps

Results

Applications

Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages