Semantic Segmentation using variations of U-Net

Description

In Computer Vision, Semantic Segmentation of any image means classifying each pixel on the image into a particular class (including an unknown class if required).

In the past years of Deep Learning, major advances have been made in Semantic Segmentation and a breakthrough occurred when Fully Convolutional Network (FCN) architectures like FCN-8 and U-Net were designed for an end to end masked-data-based learning. This article makes variations in the original U-Net architecture to increase its performance.

Model Architectures

Original U-Net Architecture

1. Standard U-Net

Standard U-Net architecture is similar to original U-Net with difference in sizes. Also, to avoid copy and crop and do pure concatenation, total symmetry is maintained and hence output of encoder block at a given depth can be directly concatenated with to its decoder counterpart.

2. U-Net TCED (Tightly Connected Encoder and Decoder)

The Copy and Crop layers of U-Net concatenate Spatial Information from the encoder block to the decoder block at the same depth. The concatenated layer is squeezed again on the channel axis. These weights, copied directly from the encoder blocks, have spatial information that helps the decoder. The rationale for concatenating weights from the encoder at the same depth is to provide network some relevant spatial information and adding on this, this approach provides more amount of spatial context.

3. Res-U-Net

Simple Encoder is replaced with Encoders with residual layers like in ResNet architecture. As U-Net may be very deep and ResNet helps in fending off degradation problem in deep architectures.

Data set

CARLA self-driving car dataset with 1060 images and mask.
KITTI SELF DRIVING CARS DATA

Results

Inference on Carla Self Driving Car Dataset

Inference on images clicked in San Francisco

https://miro.medium.com/max/1400/1*f_bZokDyirlArd7m9Vamkw.png https://miro.medium.com/max/1400/1*uDr9aJosrB-zaemea8pRnQ.png

Commands to run

Training

New Training Unet-TCED

python .\src\driver.py -t training -n True -v UNetTCED

New Training UNet STD

python .\src\driver.py -t training -n True

Training from previous checkpoint

python .\src\driver.py -t training

Inference multiple images

python .\src\driver.py -t inference -m True -f .\data\carla\test\test_1\ -e png

Inference single image

python .\src\driver.py -t inference -i .\data\carla\test\test_1\7.png -d True -s True -e png

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Semantic Segmentation using variations of U-Net

Description

Model Architectures

1. Standard U-Net

2. U-Net TCED (Tightly Connected Encoder and Decoder)

3. Res-U-Net

Data set

Results

Inference on Carla Self Driving Car Dataset

Inference on images clicked in San Francisco

Commands to run

Training

New Training Unet-TCED

New Training UNet STD

Training from previous checkpoint

Inference multiple images

Inference single image

About

Releases

Packages

Languages

srivastavashobhit/Semantic-Segmentation-using-variations-of-UNet

Folders and files

Latest commit

History

Repository files navigation

Semantic Segmentation using variations of U-Net

Description

Model Architectures

1. Standard U-Net

2. U-Net TCED (Tightly Connected Encoder and Decoder)

3. Res-U-Net

Data set

Results

Inference on Carla Self Driving Car Dataset

Inference on images clicked in San Francisco

Commands to run

Training

New Training Unet-TCED

New Training UNet STD

Training from previous checkpoint

Inference multiple images

Inference single image

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages