We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hi, 请教下,vld1q_f32这个指令会读取4个f32类型的值,对于这个3x3的Kernel是怎么处理的?
vld1q_f32
ncnn/src/layer/arm/convolution_3x3.h
Lines 66 to 68 in e2e8e1b
感觉在计算乘加时有点问题。
The text was updated successfully, but these errors were encountered:
卷积核权重读取时读的是向量,卷积输入也是向量,但在计算的时候,是卷积核权重向量中的一个标量乘上卷积输入中的向量,不是直接向量乘向量。
Sorry, something went wrong.
https://github.com/Tencent/ncnn/wiki/armv7-neon-intrinsics-%E5%92%8C%E5%86%85%E5%B5%8C%E6%B1%87%E7%BC%96%E6%B7%B7%E7%94%A8 3x3这里是q寄存器单路使用
No branches or pull requests
Hi,
请教下,
vld1q_f32
这个指令会读取4个f32类型的值,对于这个3x3的Kernel是怎么处理的?ncnn/src/layer/arm/convolution_3x3.h
Lines 66 to 68 in e2e8e1b
感觉在计算乘加时有点问题。
The text was updated successfully, but these errors were encountered: