about transformers version compatibility #4

jianbohuang · 2023-04-19T09:11:26Z

Dose it compatible with higher transformers version?

if run it in my env of verison : transformers 4.28.0
then it occurs an error:

./models/bert.py", line 229, in forward
RuntimeError: The size of tensor a (3) must match the size of tensor b (9) at non-singleton dimension 0

xinyu1205 · 2023-04-19T12:34:55Z

Thank you for your very valuable feedback. I will check this issue and give you a response as soon as possible in next few days.

deyiluobo · 2023-04-21T02:21:21Z

I had the same problem, I lowered the version of transformers to no avail

xinyu1205 · 2023-04-22T14:17:46Z

The reason behind the issue is that our code has been modified based on BLIP. To resolve the issue quickly, you can refer to a simple solution provided in this GitHub comment: salesforce/BLIP#142 (comment).

Further modifications are required in the Tag2Text/models/bert.py file to align it with the new version of the transformer. I have added this to my pending tasks list, but due to my current workload, I cannot ensure completion as soon as possible. Really hope you can understand.

Sincerely Thank you once again for bringing this issue to my attention.

bloodhunt3r · 2023-04-24T07:27:10Z

replace the following code:

https://github.com/xinyu1205/Tag2Text/blob/9f6866e115ed3026d748bc67de67a9d428df1016/models/bert.py#L224-L229

with the following code to do alignment:

        if key_layer.shape[0] > query_layer.shape[0]:
            key_layer = key_layer[:query_layer.shape[0], :, :, :]
            attention_mask = attention_mask[:query_layer.shape[0], :, :]
            value_layer = value_layer[:query_layer.shape[0], :, :, :]
        attention_scores = torch.matmul(query_layer, key_layer.transpose(-1, -2))

xinyu1205 · 2023-04-24T08:06:00Z

Thank you very much for sharing! If you have already tested the corresponding version, you are very welcome to be one of the contributors to this project by initiating Pull Requests (please add comments in the corresponding area). We appreciate your willingness for sharing and look forward to your contributions.

Qiliqing · 2023-04-24T12:30:33Z

replace the following code:

https://github.com/xinyu1205/Tag2Text/blob/9f6866e115ed3026d748bc67de67a9d428df1016/models/bert.py#L224-L229

with the following code to do alignment:

        if key_layer.shape[0] > query_layer.shape[0]:
            key_layer = key_layer[:query_layer.shape[0], :, :, :]
            attention_mask = attention_mask[:query_layer.shape[0], :, :]
            value_layer = value_layer[:query_layer.shape[0], :, :, :]
        attention_scores = torch.matmul(query_layer, key_layer.transpose(-1, -2))

It works, thx!

Refactor a bit

xinyu1205 closed this as completed Apr 26, 2023

xinyu1205 mentioned this issue Apr 26, 2023

Run Grounded-Segment-Anything + Tag2Text Demo IDEA-Research/Grounded-Segment-Anything#204

Closed

sky97613 mentioned this issue Jun 15, 2023

Can you confirm that I understand the overall system architecture (Figure 3) of the thesis? #41

Open

crazycth pushed a commit to crazycth/recognize-anything that referenced this issue Sep 4, 2023

Merge pull request xinyu1205#4 from YiVal/multi

a5cc518

Refactor a bit

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

about transformers version compatibility #4

about transformers version compatibility #4

jianbohuang commented Apr 19, 2023

xinyu1205 commented Apr 19, 2023

deyiluobo commented Apr 21, 2023

xinyu1205 commented Apr 22, 2023

bloodhunt3r commented Apr 24, 2023

xinyu1205 commented Apr 24, 2023

Qiliqing commented Apr 24, 2023

about transformers version compatibility #4

about transformers version compatibility #4

Comments

jianbohuang commented Apr 19, 2023

xinyu1205 commented Apr 19, 2023

deyiluobo commented Apr 21, 2023

xinyu1205 commented Apr 22, 2023

bloodhunt3r commented Apr 24, 2023

xinyu1205 commented Apr 24, 2023

Qiliqing commented Apr 24, 2023