fix adapters with `adapter_attention` #29

JoPfeiff · 2020-07-13T19:25:04Z

🐛 Bug

Old versions of the adapters initialized *adapter_attention* which were never used but stored.
I proposed a two stage fix:

hot fix which does not log the warning that the parameters were not instantiated
remove the parameters from all adapters

Information

Model I am using (Bert, XLNet ...): e.g. RoBERTa-Base

Language I am using the model on (English, Chinese ...): English

Adapter setup I am using (if any):
many but e.g.

model = AutoModel.from_pretrained("roberta-base")
model.load_adapter("comsense/csqa@ukp", "text_task", config="{'using': 'pfeiffer'}")

The problem arises when using:

[ x] the official example scripts: (give details below)
my own modified scripts: (give details below)

The tasks I am working on is:

[x ] an official GLUE/SQUaD task: (give the name)
my own task or dataset: (give details below)

To reproduce

Steps to reproduce the behavior:

load the adapter, you will get a warning for parameters which are not required

Expected behavior

no warning

Environment info

transformers version:
Platform:
Python version:
PyTorch version (GPU?):
Tensorflow version (GPU?):
Using GPU in script?:
Using distributed or parallel set-up in script?:

The text was updated successfully, but these errors were encountered:

calpt · 2020-07-13T22:01:25Z

hot fix which does not log the warning that the parameters were not instantiated

can we keep the logging and reduce the log level from "warn" to "info"?

arueckle · 2020-07-14T08:08:25Z

Can we just remove the weights from the checkpoints?

import sys
from shutil import copyfile

import torch

chckpt_path = sys.argv[1]
copyfile(chckpt_path, chckpt_path + '.backup')
chckpt = torch.load(chckpt_path, map_location=torch.device('cpu'))
chckpt_new = dict()

for k, w in chckpt.items():
    if 'adapter_attention' not in k:
        chckpt_new[k] = w
    else:
        print('unwanted key: {}'.format(k))

torch.save(chckpt_new, chckpt_path)

JoPfeiff added the bug Something isn't working label Jul 13, 2020

calpt added a commit that referenced this issue Jul 16, 2020

Reduce log levels (#29 & #30).

9cc15f7

calpt mentioned this issue Aug 21, 2020

Fix weights of bert-base adapters for GLUE tasks adapter-hub/Hub#9

Merged

calpt closed this as completed in adapter-hub/Hub#9 Aug 24, 2020

calpt reopened this Aug 24, 2020

calpt mentioned this issue Feb 4, 2021

Fix weights & prediction head descriptions for all adapters adapter-hub/Hub#20

Merged

calpt closed this as completed in adapter-hub/Hub#20 Feb 4, 2021

calpt added a commit that referenced this issue Aug 25, 2023

Rename wrap_model() -> init() (#29)

efaa650

calpt added a commit to calpt/adapter-transformers that referenced this issue Aug 28, 2023

Rename wrap_model() -> init() (adapter-hub#29)

b3182a9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix adapters with `adapter_attention` #29

fix adapters with `adapter_attention` #29

JoPfeiff commented Jul 13, 2020

calpt commented Jul 13, 2020

arueckle commented Jul 14, 2020

fix adapters with *adapter_attention* #29

fix adapters with *adapter_attention* #29

Comments

JoPfeiff commented Jul 13, 2020

🐛 Bug

Information

To reproduce

Expected behavior

Environment info

calpt commented Jul 13, 2020

arueckle commented Jul 14, 2020

fix adapters with `adapter_attention` #29

fix adapters with `adapter_attention` #29