We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
你好,请问有W4A16与FP16矩阵乘的具体性能对比数据吗?
The text was updated successfully, but these errors were encountered:
在A30上,m=1,n=16384,k=4096 FP16带宽大概750GB/s, W4A16大概是600GB/s。
Sorry, something went wrong.
谢谢回复,请问具体的FP16矩阵乘和W4A16,W2A16矩阵乘,总体时间上加速如何呢?
No branches or pull requests
你好,请问有W4A16与FP16矩阵乘的具体性能对比数据吗?
The text was updated successfully, but these errors were encountered: