How to load model with load_8_bit #24

stevezhang88 · 2023-04-18T05:09:31Z

My GPU is 3090ti, 24G. I have to use load_8_bit to load vicuna 13b. Could you can tell me how to do on MiniGPT-4?

TsuTikgiau · 2023-04-18T07:32:09Z

Thank you for your interest! Currently, you can set low_resource to True in the evaluation config file eval_configs/minigpt4_eval.yaml. Besides, you may need to set the number of beams to 1 to save GPU memory in demo.py.
As discussed in [this issue](Another user in this issue, this reduces the memory usage to about 24G, although it may still OOM when the output is long. We are working to come up with a solution to make it work inside 24G memory. Will come back to you once we finish

stevezhang88 · 2023-04-18T09:17:15Z

thanks. you're so nice. waiting for your solution. I suppose many person like me have only one-GPU computer.

TsuTikgiau · 2023-04-18T19:46:17Z

We update the default setting of the demo and now it should load vicuna in 8 bit by default. The demo now should be able to run on single 3090 if you set the beam search width to 1(which is also the default value now)

TsuTikgiau closed this as completed Apr 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to load model with load_8_bit #24

How to load model with load_8_bit #24

stevezhang88 commented Apr 18, 2023

TsuTikgiau commented Apr 18, 2023

stevezhang88 commented Apr 18, 2023

TsuTikgiau commented Apr 18, 2023

How to load model with load_8_bit #24

How to load model with load_8_bit #24

Comments

stevezhang88 commented Apr 18, 2023

TsuTikgiau commented Apr 18, 2023

stevezhang88 commented Apr 18, 2023

TsuTikgiau commented Apr 18, 2023