Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[TOPI] Add dense schedule for fp16 and fp32 using gemm #17091

Merged
merged 15 commits into from
Jul 9, 2024

Conversation

eirenevp
Copy link
Contributor

Add a new schedule for the dense operator
based on the gemm algorithm.

Add a new schedule for the dense operator
based on the gemm algorithm.

Change-Id: Iaf4423d21d20b5813c77a0a27c4751f8cbd1d8b8
@ekalda
Copy link
Contributor

ekalda commented Jun 13, 2024

cc @Anndrey24 @lhutton1 @leandron

Change-Id: I545a805f40197db01d91ce622a890754d4a3b901
Change-Id: Ica02890f132b67e3c9bd64f3fb98ee8c060e8c05
Copy link
Contributor

@Anndrey24 Anndrey24 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the changes, @eirenevp ! It's great to finally have an arm_cpu-specific dense schedule which we could possibly extend to also generate SVE instructions in the future.
I've just left a few comments based on my experience with the conv2d schedule.

cmake/config.cmake Outdated Show resolved Hide resolved
python/tvm/relay/op/strategy/arm_cpu.py Outdated Show resolved Hide resolved
python/tvm/relay/op/strategy/arm_cpu.py Outdated Show resolved Hide resolved
python/tvm/topi/arm_cpu/dense_alter_op.py Outdated Show resolved Hide resolved
python/tvm/topi/arm_cpu/dense_gemm.py Outdated Show resolved Hide resolved
python/tvm/topi/arm_cpu/dense_gemm.py Outdated Show resolved Hide resolved
tests/python/relay/test_dense.py Outdated Show resolved Hide resolved
tests/python/relay/test_dense.py Outdated Show resolved Hide resolved
tests/python/relay/test_dense.py Outdated Show resolved Hide resolved
python/tvm/topi/arm_cpu/dense_gemm.py Outdated Show resolved Hide resolved
Change-Id: I11126864bdcfec1c88dbc39bafa18c8b78bbe5fc
Change-Id: Ibbcfa2a94d14c9013e231c95725b0f651ae4ff46
Change-Id: Iaefc79b7c415d4cdca2917b5c5fa5842eba2f71d
Change-Id: Ia6468fe6cf113781819c871e1f110bf4e98a3d59
Change-Id: I3e5c532ed041c691237bf04fda4645bf25befbcc
Change-Id: I1217eaec3b63eccfa32bcbf3b410f09fa0997f36
Change-Id: Iccd59c8afdd70a98da031ff2d0cf8f5d3378a700
Change-Id: I8851b227a9dacf90046e4d3d1b0418355145e79a
Change-Id: Ib50606a829142202142b326a111580f9eec72f8c
Change-Id: I44bf4bfbf85b5ab4fa91d653af25ae22f01ed79a
Change-Id: Ie85f7d5c6d0e99fc389bd9b3f29787065e1e0cd0
Change-Id: I170e09358b61217586adc3591c020b065e2f5d3e
Copy link
Contributor

@ekalda ekalda left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks for all the work on this @eirenevp!

@ekalda ekalda merged commit c4e6f96 into apache:main Jul 9, 2024
19 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants