从本地加载量化，程序没反应 #37

wweyl · 2023-07-12T13:41:37Z

`import torch
from transformers import AutoModelForCausalLM, AutoTokenizer
from transformers.generation.utils import GenerationConfig

tokenizer = AutoTokenizer.from_pretrained("/root/autodl-tmp/model/Baichuan-13B-Base", use_fast=False, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("/root/autodl-tmp/model/Baichuan-13B-Base", torch_dtype=torch.float16, trust_remote_code=True)
model = model.quantize(8).cuda()`

从本地加载，再量化，程序没反应，内存也不涨。不知道哪些写的有问题

jameswu2014 · 2023-07-13T03:04:11Z

你的机器配置是什么？

wweyl · 2023-07-13T04:13:45Z

显卡V100-32GB， 29G 内存

jameswu2014 · 2023-07-13T04:56:49Z

可以试试增大swap区。

GradientGuru closed this as completed Jul 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

从本地加载量化，程序没反应 #37

从本地加载量化，程序没反应 #37

wweyl commented Jul 12, 2023

jameswu2014 commented Jul 13, 2023

wweyl commented Jul 13, 2023

jameswu2014 commented Jul 13, 2023

从本地加载量化，程序没反应 #37

从本地加载量化，程序没反应 #37

Comments

wweyl commented Jul 12, 2023

jameswu2014 commented Jul 13, 2023

wweyl commented Jul 13, 2023

jameswu2014 commented Jul 13, 2023