Skip to content

ChatGLM-6B ONNX Models

Pre-release
Pre-release

Choose a tag to compare

@wangzhaode wangzhaode released this 29 Mar 03:17
· 193 commits to master since this release
6229dc3

ChatGLM-6B导出的ONNX模型,权重使用fp32保存;使用顺序如下:

embedding -> block_0 -> ... -> block_27 -> lm_head

提供ONNX便于在其他框架上进行部署与测试。

onnx模型转移到 https://github.com/wangzhaode/llm-export 项目中。