We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
llamafactory version ==v0.7.1 torch ==2.1 torch_npu == 2.1.0.post3
ubuntu22.0.4
华为800T A2 8卡 Ascend 910b CANN toolkit = Ascend-cann-toolkit_8.0.RC1_linux-aarch64.run CANN kernels = Ascend-cann-kernels-910b_8.0.RC1_linux.run
运行代码 ASCEND_RT_VISIBLE_DEVICES=0 llamafactory-cli chat llama2.yaml llama2.yaml内容:
model_name_or_path: /mnt/nvme1/models/Llama-2-7b-chat-hf/ template: llama2 do_sample: false
命令成功运行·但是提示使用cpu推理而不是npu
能执行单卡推理llama2 多卡推理llama2
No response
The text was updated successfully, but these errors were encountered:
之前也是用过readme中的cann toolkit和kernel 报一样的错误
Sorry, something went wrong.
推理速度正常吗?正常的话那就是在 npu 上面,cpu 会特别慢
不正常,一秒几个token吧,看htop用了两核cpu,npu把模型推进去了,显存有占用但是功率没有变化
一秒几个应该是正常速度
No branches or pull requests
Reminder
System Info
依赖版本
llamafactory version ==v0.7.1
torch ==2.1
torch_npu == 2.1.0.post3
系统版本
ubuntu22.0.4
机器信息
华为800T A2
8卡 Ascend 910b
CANN toolkit = Ascend-cann-toolkit_8.0.RC1_linux-aarch64.run
CANN kernels = Ascend-cann-kernels-910b_8.0.RC1_linux.run
Reproduction
运行代码
ASCEND_RT_VISIBLE_DEVICES=0 llamafactory-cli chat llama2.yaml
llama2.yaml内容:
命令成功运行·但是提示使用cpu推理而不是npu
![image](https://private-user-images.githubusercontent.com/28583005/337858229-b99d46cb-b784-4181-8ee2-48c73f417e14.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjEyMTQ4MjgsIm5iZiI6MTcyMTIxNDUyOCwicGF0aCI6Ii8yODU4MzAwNS8zMzc4NTgyMjktYjk5ZDQ2Y2ItYjc4NC00MTgxLThlZTItNDhjNzNmNDE3ZTE0LnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA3MTclMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwNzE3VDExMDg0OFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTJjYWYwNjVhMWY4Y2JhOWY2NjBjNmRmZDY5ZTA4M2UzMDU5OGU5MDVhOGNlMDZlZGRhMWM0MDAyYjQ5NWY2MjEmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.prAHmsX59-C28RE14GOvCBCYZXq7LlAnaLzoChJLqJA)
Expected behavior
能执行单卡推理llama2 多卡推理llama2
Others
No response
The text was updated successfully, but these errors were encountered: