We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No description provided.
The text was updated successfully, but these errors were encountered:
通过transformers的自动模型切分功能可以很方便地实现多卡推理,您可以参考 moss_cli_demo.py 中的第 24 至第 29 行以获得更多帮助。
transformers
Sorry, something went wrong.
可以多卡 sft 么?目前试了下,似乎基于 accelerate 的 device_map 做 pipeline 并行的方法会造成 input 跟 下一个 block 的 layernorm 发生 different device 的错误。这个如何解决呢?谢谢~
device_map
No branches or pull requests
No description provided.
The text was updated successfully, but these errors were encountered: