You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
(py3.9_env) weidong@weidongdeMacBook-Pro-2 ~ % python /Volumes/Data/LLM/Chinese-LLaMA-Alpaca-2-main/scripts/openai_server_demo/openai_api_server.py --base_model /Volumes/Data/LLM/chinese-alpaca-2-7b-hf --only_cpu
===================================BUG REPORT===================================
Welcome to bitsandbytes. For bug reports, please run
python -m bitsandbytes
and submit this information together with your error trace to: https://github.com/TimDettmers/bitsandbytes/issues
================================================================================
bin /Users/weidong/anaconda3/envs/py3.9_env/lib/python3.9/site-packages/bitsandbytes/libbitsandbytes_cpu.so
/Users/weidong/anaconda3/envs/py3.9_env/lib/python3.9/site-packages/bitsandbytes/cextension.py:34: UserWarning: The installed version of bitsandbytes was compiled without GPU support. 8-bit optimizers, 8-bit multiplication, and GPU quantization are unavailable.
warn("The installed version of bitsandbytes was compiled without GPU support. "
'NoneType' object has no attribute 'cadam32bit_grad_fp32'
CUDA SETUP: Loading binary /Users/weidong/anaconda3/envs/py3.9_env/lib/python3.9/site-packages/bitsandbytes/libbitsandbytes_cpu.so...
dlopen(/Users/weidong/anaconda3/envs/py3.9_env/lib/python3.9/site-packages/bitsandbytes/libbitsandbytes_cpu.so, 0x0006): tried: '/Users/weidong/anaconda3/envs/py3.9_env/lib/python3.9/site-packages/bitsandbytes/libbitsandbytes_cpu.so' (not a mach-o file), '/System/Volumes/Preboot/Cryptexes/OS/Users/weidong/anaconda3/envs/py3.9_env/lib/python3.9/site-packages/bitsandbytes/libbitsandbytes_cpu.so' (no such file), '/Users/weidong/anaconda3/envs/py3.9_env/lib/python3.9/site-packages/bitsandbytes/libbitsandbytes_cpu.so' (not a mach-o file)
Xformers is not installed correctly. If you want to use memory_efficient_attention use the following command to install Xformers
pip install xformers.
USE_MEM_EFF_ATTENTION: False
STORE_KV_BEFORE_ROPE: False
Apply NTK scaling with ALPHA=1.0
The value of scaling factor will be read from model config file, or set to 1.
Loading checkpoint shards: 100%|███████████████████████████████████████████████████████| 2/2 [00:06<00:00, 3.34s/it]
/Users/weidong/anaconda3/envs/py3.9_env/lib/python3.9/site-packages/transformers/generation/configuration_utils.py:362: UserWarning: `do_sample` is set to `False`. However, `temperature` is set to `0.9` -- this flag is only used in sample-based generation modes. You should set `do_sample=True` or unset `temperature`. This was detected when initializing the generation config instance, which means the corresponding file may hold incorrect parameterization and should be fixed.
warnings.warn(
/Users/weidong/anaconda3/envs/py3.9_env/lib/python3.9/site-packages/transformers/generation/configuration_utils.py:367: UserWarning: `do_sample` is set to `False`. However, `top_p` is set to `0.6` -- this flag is only used in sample-based generation modes. You should set `do_sample=True` or unset `top_p`. This was detected when initializing the generation config instance, which means the corresponding file may hold incorrect parameterization and should be fixed.
warnings.warn(
Vocab of the base model: 55296
Vocab of the tokenizer: 55296
Traceback (most recent call last):
File "/Volumes/Data/LLM/Chinese-LLaMA-Alpaca-2-main/scripts/openai_server_demo/openai_api_server.py", line 105, in <module>
model.float()
File "/Users/weidong/anaconda3/envs/py3.9_env/lib/python3.9/site-packages/transformers/modeling_utils.py", line 2068, in float
raise ValueError(
ValueError: `.float()` is not supported for quantized model. Please use the model as it is, since the model has already been casted to the correct `dtype`.
The text was updated successfully, but these errors were encountered:
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your consideration.
提交前必须检查以下项目
问题类型
模型量化和部署
基础模型
Chinese-Alpaca-2 (7B/13B)
操作系统
macOS
详细描述问题
依赖情况(代码类问题务必提供)
运行日志或截图
The text was updated successfully, but these errors were encountered: