Add attention visualization tool #36630

ArthurZucker · 2025-03-10T14:43:31Z

What does this PR do?

TODOS

Add some tests
make it work on all models

cc @yonigozlan and @zucchini-nlp @molbap

This will be available as:

from transformers.utils.attention_visualizer import AttentionMaskVisualizer
visualizer = AttentionMaskVisualizer("meta-llama/Llama-3.2-3B-Instruct")
visualizer("A normal attention mask")

visualizer = AttentionMaskVisualizer("mistralai/Mistral-Small-24B-Instruct-2501")
visualizer("A normal attention mask with a long text to see how it is displayed, and if it is displayed correctly")

visualizer = AttentionMaskVisualizer("google/paligemma2-3b-mix-224")
visualizer("<img> You are an assistant.", suffix = "What is on the image?")

visualizer = AttentionMaskVisualizer("google/gemma-2b")
visualizer("You are an assistant. Make sure you print me") # we should have slidiing on non sliding side by side

visualizer = AttentionMaskVisualizer("google/gemma-3-27b-it")
visualizer("<img>You are an assistant. Make sure you print me") # we should have slidiing on non sliding side by side

HuggingFaceDocBuilderDev · 2025-03-10T15:11:16Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

jmayank23 · 2025-03-13T02:41:04Z

Hi, I was trying to visualize the attention mask for Gemma 3 but I get

AttributeError: 'Gemma3ForConditionalGeneration' object has no attribute 'visualize_attention_mask'

For reference, sharing relevant excerpts of the code:

from transformers import AutoProcessor, Gemma3ForConditionalGeneration, AutoTokenizer
from PIL import Image
import json
import torch

model_id = "google/gemma-3-27b-it"

model = Gemma3ForConditionalGeneration.from_pretrained(
    model_id, device_map="auto"
).eval()

processor = AutoProcessor.from_pretrained(model_id)
tokenizer = AutoTokenizer.from_pretrained(model_id)
.
.
.
inputs = processor.apply_chat_template(
        messages,
        add_generation_prompt=True,
        tokenize=True,
        return_dict=True,
        return_tensors="pt"
    ).to(model.device, dtype=torch.bfloat16)

model.visualize_attention_mask(tokenizer, inputs)

ArthurZucker · 2025-03-17T14:53:25Z

Hey ! Will work a bit to make it more easy to use! THanks 🤗

…tion-utilities

molbap · 2025-03-18T15:55:28Z

Doing a first review, so happy about this

molbap

Amazing

src/transformers/utils/attention_visualizer.py

Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>

…mers into attention-utilities

ArthurZucker added 2 commits March 10, 2025 15:42

add utils fiel

8c7e523

style

c0bcabe

ArthurZucker added 2 commits March 10, 2025 16:26

nits

70d5a1c

nits

1661e2a

ggerganov mentioned this pull request Mar 13, 2025

llama : refactor llama_context, llama_kv_cache, llm_build_context (v2) ggml-org/llama.cpp#12181

Merged

ArthurZucker added 17 commits March 17, 2025 17:39

update

f91474c

updaets

7b0fbfc

update

0a620d7

Merge branch 'main' of github.com:huggingface/transformers into atten…

8183c42

…tion-utilities

fix init issues

1c2548e

big updates

a646377

nits

8745ac5

nits?

d8aa60a

small updates

4f7d7c0

nites

80b4c0b

there were still some models left

2e74e31

style

e6f7815

fixes

ae8fa2f

updates

0595eb7

nits _ fixes

254fa8d

push changes

d7d090f

update

5c8f5e5

ArthurZucker marked this pull request as ready for review March 18, 2025 15:49

github-actions bot requested a review from Rocketknight1 March 18, 2025 15:50

ArthurZucker requested a review from molbap March 18, 2025 15:51

update

2ba5ba9

update

13ef2be

molbap approved these changes Mar 18, 2025

View reviewed changes

ArthurZucker and others added 7 commits March 18, 2025 18:40

Apply suggestions from code review

73bb3ab

Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>

style

e5ff855

Merge branch 'attention-utilities' of github.com:huggingface/transfor…

2fe818d

…mers into attention-utilities

styling and return a string for testing

7c06ea5

small updates

483cb25

always biderectional for now

2a8b67d

update

178ba76

ArthurZucker merged commit fef8b7f into main Mar 19, 2025
19 of 24 checks passed

ArthurZucker deleted the attention-utilities branch March 19, 2025 12:58

stevhliu mentioned this pull request Mar 25, 2025

[Community contributions] Model cards #36979

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add attention visualization tool #36630

Add attention visualization tool #36630

ArthurZucker commented Mar 10, 2025 •

edited

Loading

HuggingFaceDocBuilderDev commented Mar 10, 2025

jmayank23 commented Mar 13, 2025

ArthurZucker commented Mar 17, 2025

molbap commented Mar 18, 2025

molbap left a comment

Add attention visualization tool #36630

Add attention visualization tool #36630

Conversation

ArthurZucker commented Mar 10, 2025 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Mar 10, 2025

jmayank23 commented Mar 13, 2025

ArthurZucker commented Mar 17, 2025

molbap commented Mar 18, 2025

molbap left a comment

Choose a reason for hiding this comment

ArthurZucker commented Mar 10, 2025 •

edited

Loading