Decoder-only Attribution Models Support #144

gsarti · 2022-10-20T15:06:20Z

Description

This PR introduces support to perform attributions on decoder-only models from Hugging Face (e.g. GPT2) using the AutoModelForCausalLM abstraction. The implementation will involve several generalizations of existing classes and methods, notably:

Support for Batch alongside the current EncoderDecoderBatch usage.
Support for target-side-only attributions produced by decoder-only models throughout the attribution pipeline and in FeatureAttributionSequenceOutput.
Handling ForSeq2SeqLM (current) and ForCausalLM (to be added) in the HuggingfaceModel class.

The new loading pipeline will also support using pre-loaded models in inseq.load_model aiming for compatibility with LLM.int8 (see #141).

Related Issue

#128

Type of Change

🚀 New feature (non-breaking change which adds functionality)

gsarti · 2022-10-26T15:24:37Z

In the current state, decoder-only models can be instantiated with inseq.load_model and used to generate text with the model.generate wrapper function. Compatibility with EncoderDecoder models is preserved, with currently existing tests passing. However, the attribution process is currently not functioning for DecoderOnly models.

…ments

* origin/main: Python 3.11 CI support (#146) Added Python 3.11 checks

gsarti · 2022-11-03T22:54:33Z

Major (non-breaking) changes in progress:

Disentangled format_attribute_args from attribute_step to enable independent usage and overloading in children classes (see e.g. DiscretizedIntegratedGradients overloading)
Model architecture classes: Separating architecture and framework responsibility in AttributionModel introducing framework-agnostic architecture classes (EncoderDecoderAttributionModel, DecoderOnlyAttributionModel) to handle input processing and attribution params formatting. Final classes will be children sharing one framework and one architecture (e.g. HuggingfaceEncoderDecoderModel).
Fix partial target-side attribution: These were added at the beginning but didn't keep up with newest features, resulting in wrong slices when non-default attr_pos_start and attr_pos_end parameters were passed. These should now be fully fixed and functional when attribute_target=True.

Example with full attribution:

Example constraining same attributions between steps [6, 10):

Note how the selected slices attributed in the second case match scores with the same slices in the full attribution.

Todos:

Add classes responsibility diagrams with attribution flow to Dev docs
Fix contrastive attribution test
Add test case for full vs sliced attribution using attribute_target

gsarti · 2022-11-07T16:33:25Z

Some tests are broken at the moment (need checking) but decoder-only attribution seems to be producing the right output. Visualization still doesn't support decoder-only formats, so .show is not yet functional.

gsarti · 2022-11-09T16:51:19Z

Minor adjustments to viz functions and aggregators to support an empty source_attribution field for decoder-only models. Tests failing on some issues with the attr_pos_start and attr_pos_end that require further investigation - current results are wrong.

gsarti · 2022-11-10T17:03:36Z

Fixed the issues with positions, and now both seq2seq and decoder-only attribution are working as expected! 🎉 Also added the first test for decoder-only models to the test suite to ensure continued compatibility. Points still needing discussion/fixing:

Currently the combination of batched attribution (i.e. more than one input_text provided to attribute) and constrained decoding (i.e. specified generated_text) doesn't work if generated texts do not have the same length. Such functionality would require bidirectional padding, while currently the fact of using padding="left" for decoder-only models forces all batched outputs to have the same length.

EDIT: We opted for enforcing a batch size of one (with an info message to inform the user) to handle this case for the moment. The only downside is a slower attribution process, but it is reasonable in the context.

Setting start/end positions for decoder-only attributions produce wrong slices (i.e. the only modalities in which decoder-only is working now is single-example generate/constrained or batched generate/constrained with fixed length). This will need fixing before merging.

EDIT: Enforcing a batch size of one for the case of multiple texts with custom start-end positions for a decoder-only model.

nfelnlp · 2022-11-13T10:52:13Z

This looks amazing! :)

When I tested it with the following setup:

gpt_model = inseq.load_model("gpt2", "saliency")
gpt_out = gpt_model.attribute(
    "The developer argued with the designer because her idea",
    show_progress=True,
    pretty_progress=True,
    generation_args={"max_new_tokens": 10}
)
gpt_out.show()

...I ran into this assertion error:

inseq/inseq/models/attribution_model.py

Lines 196 to 198 in 27d1a77

    
           assert all( 
        
               generated_texts[idx].startswith(input_texts[idx]) for idx in range(len(input_texts)) 
        
           ), "Forced generations with decoder-only models must start with the input texts."

It turns out that generated_texts is a str, while input_texts is a list (of length one).

I could produce this after simply commenting out the three lines:

Should I run into this error or should we make the assertion less strict?

gsarti · 2022-11-14T09:23:50Z

Thanks for spotting this bug @nfelnlp! The format_input_texts call in attribute is supposed to enforce that both input and generated texts are lists of strings, but if the generated text is not forced and is only one, generate wrongfully returns a string instead of a list of length 1. To solve the issue it's sufficient to remove this line

texts = texts[0] if len(texts) == 1 else texts

from HuggingfaceModel.generate.

…-only attribution

gsarti · 2022-11-15T16:25:43Z

@nfelnlp with the last commit both the case you mentioned and the batched decoder-only attribution should work, documented our choice in the comment above!

gsarti · 2022-11-18T18:16:25Z

@nfelnlp only some robustness tests are left, then we can proceed with the merge. Could you help with that? 🙂

nfelnlp · 2022-12-05T09:16:16Z

Just tested the latest version and it works flawlessly for the GPT-2 use case I described above.
Looks good to me! ✅

Polishing model loading with pre-loaded models support

3bf685e

gsarti assigned gsarti and nfelnlp and unassigned gsarti and nfelnlp Oct 20, 2022

gsarti requested a review from nfelnlp October 20, 2022 15:06

gsarti added the enhancement New feature or request label Oct 20, 2022

gsarti added this to the Demo Paper Release milestone Oct 20, 2022

gsarti linked an issue Oct 20, 2022 that may be closed by this pull request

Add support for decoder-only models #128

Closed

Decoupled decoder-only models

edfb5db

gsarti added 6 commits October 27, 2022 17:47

Decoupled decoder-only attribute args formatting, minor typing adjust…

a384a21

…ments

Merge remote-tracking branch 'origin/main' into decoder-only-support

b79cd28

* origin/main: Python 3.11 CI support (#146) Added Python 3.11 checks

MPS GPU support

ba8d041

Fixed GPU backend check, optional deps separation

dc85873

Fix safety

bbe1f5f

Architecture classes, fix partial attribution

ac92bd6

gsarti added 3 commits November 4, 2022 16:15

Further refactoring of architecture-specific steps

8acb6cf

Further refactoring of architecture-specific steps

ff43fbd

Decoder attribution working (buggy)

07224fd

gsarti added 2 commits November 8, 2022 09:37

Fix encoder-decoder input

90a47ca

Adjustments for decoder-only attribution visualization

21dbe42

Decoder-only attribution working

27d1a77

Fixed batched constrained decoder-only attribution and single decoder…

9a79f13

…-only attribution

gsarti added 3 commits November 17, 2022 18:33

Support for missing unk/eos tokens in HF model

1961198

Support for missing unk/eos tokens in HF model

92bcce3

Fixed attr pos for batch decoder-only, update readme

d9bca12

gsarti added 3 commits November 28, 2022 16:51

Arch methods reorg, fix seq attr conversion bug for n=1, extra tests

4141171

Update deps

8649606

Fix setup.cfg for flake8 6.0

51596a3

gsarti changed the title ~~[WIP] Decoder-only Attribution Models Support~~ Decoder-only Attribution Models Support Dec 5, 2022

gsarti merged commit a08af94 into main Dec 5, 2022

gsarti deleted the decoder-only-support branch December 5, 2022 09:22

gsarti added this to the v0.5 milestone Jul 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decoder-only Attribution Models Support #144

Decoder-only Attribution Models Support #144

gsarti commented Oct 20, 2022

gsarti commented Oct 26, 2022

gsarti commented Nov 3, 2022 •

edited

gsarti commented Nov 7, 2022

gsarti commented Nov 9, 2022

gsarti commented Nov 10, 2022 •

edited

nfelnlp commented Nov 13, 2022

gsarti commented Nov 14, 2022 •

edited

gsarti commented Nov 15, 2022

gsarti commented Nov 18, 2022

nfelnlp commented Dec 5, 2022

Decoder-only Attribution Models Support #144

Decoder-only Attribution Models Support #144

Conversation

gsarti commented Oct 20, 2022

Description

Related Issue

Type of Change

gsarti commented Oct 26, 2022

gsarti commented Nov 3, 2022 • edited

gsarti commented Nov 7, 2022

gsarti commented Nov 9, 2022

gsarti commented Nov 10, 2022 • edited

nfelnlp commented Nov 13, 2022

gsarti commented Nov 14, 2022 • edited

gsarti commented Nov 15, 2022

gsarti commented Nov 18, 2022

nfelnlp commented Dec 5, 2022

gsarti commented Nov 3, 2022 •

edited

gsarti commented Nov 10, 2022 •

edited

gsarti commented Nov 14, 2022 •

edited