Small-Object Sensitive Segmentation Using Across Feature Map Attention

This is the source code for the method as described in our paper: Small-Object Sensitive Segmentation Using Across Feature Map Attention.

The overview of our method. (a) represents an overview of combining the AFMA method with a general semantic segmentation method. The encoder of the segmentation model is input to the AFMA method, and its output is applied to the output of the segmentation method. (b-i) presents a detailed illustration of combining the AFMA method with different semantic segmentation models.

The framework of our method. (a) Calculate the Across Feature Map Attention. The inputs are the initial image and i-th layer feature maps of the encoder. (b) Output Modification. The generated AFMA in (a) is used to modify the output of the decoder’s predicted masks. (c) The process of generating gold AFMA.

As shown in above framework figure. Our approache mainly consists of three parts:

The lines 23-176 of encoder_channelatt_img.py are used for calculating AFMA between original image and any layer of feature map (for part (a) in above figure).
The files with _decoder.py suffix in the deeplabv3, fpn, linknet, manet, pan, pspnet, unet, unetplusplus folders are the steps to combine AFMA to the existing model (for part (b) in above figure). AFMA approach is adaptable to different types of architectures of various semantic segmentation models and can work on different layers of the encoder’s feature maps.
The MyLoss_correction.py in utils is for calculating the gold standard AFMA and training loss (for part(c) in above figure).

Requirements

albumentations==1.0.0
inplace_abn==1.1.0
matplotlib==3.4.2
numpy==1.22.2
opencv_python_headless==4.5.2.54
pretrainedmodels==0.7.4
segmentation_models_pytorch==0.2.0
torch==1.8.0
torchvision==0.9.0

Data

CamViD The Cambridge-driving Labeled Video Database (CamVid) is the first collection of videos with object class semantic labels, complete with metadata..
CityScapes CityScapes is a new large-scale dataset that contains a diverse set of stereo video sequences recorded in street scenes from 50 different cities.
Caltech-UCSD Birds Caltech-UCSD Birds 200 (CUB-200) is an image dataset with photos of 200 bird species (mostly North American).
LiTS 130 CT scans for segmentation of the liver as well as tumor lesions.
SkinLesion Skin Lesion Analysis towards Melanoma Detection Challenge Part I.

In order to make it easier for the readers to reproduce and understand the code, I have provided a small amount of example data used in our experiment under the dataset folder, where provides six training, validation and test images for the CamVid.

File declaration

models/attonimage：contains the codes for calculating AFMA.

models/manet：the decoder and segmentation part of the manet_afma model.

models/unet： the decoder and segmentation part of the unet_afma model.

models/unetplusplus：the decoder and segmentation part of the unet++_afma model.

models/deeplabv3：the decoder and segmentation part of the deeplabv3_afma model.

models/fpn：the decoder and segmentation part of the fpn_afma model.

models/pan：the decoder and segmentation part of the pan_afma model.

models/linknet：the decoder and segmentation part of the linknet_afma model.

models/pspnet：the decoder and segmentation part of the pspnet_afma model.

main.py: The codes for training, validating and testing.

Run the codes

Install the environment.

pip install -r requirements.txt

Train and test the model.

python main.py

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
data		data
dataset/camvid		dataset/camvid
figures		figures
models		models
utils		utils
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

dataset/camvid

dataset/camvid

figures

figures

models

models

utils

utils

README.md

README.md

main.py

main.py

requirements.txt

requirements.txt

Repository files navigation

Small-Object Sensitive Segmentation Using Across Feature Map Attention

Requirements

Data

File declaration

Run the codes

About

Releases

Packages

Languages

ShengtianSang/AFMA

Folders and files

Latest commit

History

Repository files navigation

Small-Object Sensitive Segmentation Using Across Feature Map Attention

Requirements

Data

File declaration

Run the codes

About

Resources

Stars

Watchers

Forks

Languages