fix Internvl-int8 sft bug #932

hjh0119 · 2024-05-14T11:54:49Z

PR type

Bug Fix
New Feature
Document Updates
More Models or Datasets Support

PR information

fix bug described in OpenGVLab/InternVL#129

bitsandbytes/autograd/_functions.py", line 479, in backward
    .mul_(state.SCB.unsqueeze(1).mul(1.0 / 127.0))
RuntimeError: The size of tensor a (92576) must match the size of tensor b (92553) at non-singleton dimension 0

…_url * commit '84826bdda4cbd58f51a3c84ad55787cb1c723f4f': update doc (modelscope#934) fix Internvl-int8 sft bug (modelscope#932)

* commit '6e5b58a8af8e1fb92b1498d5c45cfbea11da1b36': fix Internvl-int8 device map (modelscope#937) support ms-agent-roleplay dataset (modelscope#936) FIx eval url (modelscope#941) update doc (modelscope#934) fix Internvl-int8 sft bug (modelscope#932)

jinghan added 28 commits May 13, 2024 14:12

init

7b8b5ac

fix

9fe8e6f

fix

59dc739

fix

9b91384

fix

b0d41e8

fix

03770b6

fix

1e34aed

fix

7c93aa0

fix

eafcbae

fix

71f6be1

fix

72cd9b1

update

d52b6bb

fix

142e48e

fix

e6dba83

fix

3f085cd

update

e9eb893

update

a7de550

fix

5981711

fix dtype

cfc10ea

update

62a2745

test

eca737d

test

49296d4

update

2090e89

fix infer bnb judge

ed84f39

update

bac2625

init

84504fa

update

5357c72

update

eb462c9

tastelikefeet approved these changes May 14, 2024

View reviewed changes

hjh0119 merged commit cef448b into modelscope:main May 14, 2024
1 of 2 checks passed

hjh0119 deleted the internvl branch May 14, 2024 12:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix Internvl-int8 sft bug #932

fix Internvl-int8 sft bug #932

hjh0119 commented May 14, 2024

fix Internvl-int8 sft bug #932

fix Internvl-int8 sft bug #932

Conversation

hjh0119 commented May 14, 2024

PR type

PR information