Visualizing Attention maps on input images #145

mineshmathew · 2019-08-06T07:54:35Z

❓ Questions and Help

I was wondering if there is any way to visualize the attention weights over the original inputs for vqa or captioning. I see such figures in papers.

Is there a script already available for this.

apsdehal · 2019-08-07T01:40:11Z

No, we don't have these scripts. But you can save attention weights from the model and use the functions below to plot attention weights on image:

import matplotlib
import matplotlib.pyplot as plt
import numpy as np
import skimage
import cv2

from PIL import Image

cmap = matplotlib.cm.get_cmap('jet')
cmap.set_bad(color="k", alpha=0.0)

def attention_bbox_interpolation(im, bboxes, att):
    softmax = att
    assert len(softmax) == len(bboxes)

    img_h, img_w = im.shape[:2]
    opacity = np.zeros((img_h, img_w), np.float32)
    for bbox, weight in zip(bboxes, softmax):
        x1, y1, x2, y2 = bbox
        opacity[int(y1):int(y2), int(x1):int(x2)] += weight
    opacity = np.minimum(opacity, 1)

    opacity = opacity[..., np.newaxis]
    
    vis_im = np.array(Image.fromarray(cmap(opacity, bytes=True), 'RGBA'))
    vis_im = vis_im.astype(im.dtype)
    vis_im = cv2.addWeighted(im, 0.7, vis_im, 0.5, 0)
    vis_im = vis_im.astype(im.dtype)
    
    return vis_im


def attention_grid_interpolation(im, att):
    softmax = np.reshape(att, (14, 14))
    opacity = skimage.transform.resize(softmax, im.shape[:2], order=3)
    opacity = opacity[..., np.newaxis]
    opacity = opacity*0.95+0.05

    vis_im = opacity*im + (1-opacity)*255
    vis_im = vis_im.astype(im.dtype)
    return vis_im

def visualize_pred(im_path, boxes, att_weights):
    im = cv2.imread(im_path)
    im = cv2.cvtColor(im, cv2.COLOR_RGB2RGBA)
    b,g,r,a = cv2.split(im)           # get b, g, r
    im = cv2.merge([r,g,b,a])

    M = min(len(boxes), len(att_weights))
    im_ocr_att = attention_bbox_interpolation(im, boxes[:M], att_weights[:M])
    plt.imshow(im_ocr_att)

Might have some errors though.

CCYChongyanChen · 2021-07-16T06:20:48Z

FYI..
An alternative to the function attention_grid_interpolation(im, att).
Attention_grid_interpolation generates white mask with different opacity.

def get_blend_map(self,img, att_map, blur=True, overlap=True):
        att_map -= att_map.min()
        if att_map.max() > 0:
            att_map /= att_map.max()
        att_map = skimage.transform.resize(att_map, (img.shape[:2]), order = 3)
        if blur:
            att_map = skimage.filters.gaussian(att_map, 0.02*max(img.shape[:2]))
            att_map -= att_map.min()
            att_map /= att_map.max()
        cmap = plt.get_cmap('jet')
        att_map_v = cmap(att_map)
        att_map_v = np.delete(att_map_v, 3, 2)
        plt.imshow(att_map_v)
        #plt.imshow(img,alpha=0.2)    
        plt.show()
        # cv2.imwrite("attentionmap.jpg",att_map_v*255)
        return att_map_v

shamanthak-hegde · 2022-07-26T07:33:06Z

FYI..
An alternative to the function attention_grid_interpolation(im, att).
Attention_grid_interpolation generates white mask with different opacity.

Hi, I understood what visualize_pred and attention_bbox_interpolation does. But what's the use of get_blend_map or attention_grid_interpolation? And how do we use it?

micdist · 2022-10-13T16:14:49Z

@apsdehal Hi mate thanks for sharing code. Just playing with it on DETR and i get size mismatch with "weights", what was the shape of weights you have developed that for? Does it expect me to provide weight projected onto image dimensions and than extract a bounding box size slice? Thanks

batubb · 2024-07-16T03:54:50Z

So you always visualize the first attention head? You ignore the others?

mineshmathew closed this as completed Aug 7, 2019

mchandak29 mentioned this issue May 17, 2020

Bounding boxes for visualizing attention maps #255

Closed

lilyli2004 mentioned this issue Aug 22, 2020

VilBERT output_attentions #507

Closed

linjieli222 mentioned this issue Sep 28, 2020

Code for learned image and question attentions linjieli222/VQA_ReGAT#17

Closed

CCYChongyanChen mentioned this issue Jul 16, 2021

How to visualize attention map for model with multiple heads attention (e.g., vilbert) #917

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Visualizing Attention maps on input images #145

Visualizing Attention maps on input images #145

mineshmathew commented Aug 6, 2019 •

edited

Loading

apsdehal commented Aug 7, 2019 •

edited

Loading

CCYChongyanChen commented Jul 16, 2021

shamanthak-hegde commented Jul 26, 2022 •

edited

Loading

micdist commented Oct 13, 2022

batubb commented Jul 16, 2024

Visualizing Attention maps on input images #145

Visualizing Attention maps on input images #145

Comments

mineshmathew commented Aug 6, 2019 • edited Loading

❓ Questions and Help

apsdehal commented Aug 7, 2019 • edited Loading

CCYChongyanChen commented Jul 16, 2021

shamanthak-hegde commented Jul 26, 2022 • edited Loading

micdist commented Oct 13, 2022

batubb commented Jul 16, 2024

mineshmathew commented Aug 6, 2019 •

edited

Loading

apsdehal commented Aug 7, 2019 •

edited

Loading

shamanthak-hegde commented Jul 26, 2022 •

edited

Loading