Stand-Alone-Axial-Attention

This is a pytorch implementation of the paper Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation by Huiyu Wang, Yukun Zhu, Bradley Green, Hartwig Adam, Alan Yuille and Liang-Chieh Chen.

Method

This paper implements the attention mechanism into different ResNet architectures.

Global Self-Attention on images is subject to the problem, that it can only be applied after significant spatial downsampling of the input. Every pixels relation is calculated to every other pixel so learning gets computationally very expensive, which prevents its usage across all layers in a fully attentional model.

In this paper the authors migitate this issue by introducing their Axial-Attention concept, where the attention mechanism related to one pixel is applied in two steps, vertically and horizontally:

Furthermore they extend the positional encoding from query-pixels also to the keys and values.

Implementation details

I only tested the implementation with ResNet50 for now. The used ResNet V1.5 architectures are adapted from https://github.com/pytorch/vision/blob/master/torchvision/models/resnet.py

The paper notes In order to avoid careful initialization of WQ, WK, WV , rq, rk, rv, we use batch normalizations in all attention layers. Consequently two batch normalization layers are applied.

Additional Parameters:

attention: ResNet stages in which you would like to apply the attention layers
num_heads: Number of attention heads
kernel_size: Maximum local field on which Axial-Attention is applied
inference: Allows to inspect the attention weights of a trained model

Example

See the jupyter notebook or the example training script

Requirements

pytorch
I use fast.ai and the imagenette dataset for the examples

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
__pycache__		__pycache__
Axial_Layer.py		Axial_Layer.py
Axial_Model.py		Axial_Model.py
README.md		README.md
example.ipynb		example.ipynb
train_Axial.py		train_Axial.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stand-Alone-Axial-Attention

Method

Implementation details

Additional Parameters:

Example

Requirements

About

Releases

Packages

Languages

MartinGer/Stand-Alone-Axial-Attention

Folders and files

Latest commit

History

Repository files navigation

Stand-Alone-Axial-Attention

Method

Implementation details

Additional Parameters:

Example

Requirements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages