Skip to content

Conversation

@ZuseZ4
Copy link
Collaborator

@ZuseZ4 ZuseZ4 commented Jun 15, 2023

No description provided.

@ZuseZ4
Copy link
Collaborator Author

ZuseZ4 commented Jun 15, 2023

@wsmoses are we sure that this is correct?

bool is_normal(llvm::IRBuilder<> &B, llvm::Value *trans) {

I just added handling for transposed matrices and my tests indicate that it's not.
I did modify the gemm_f_c_transpose_lacpy.ll testcase by setting transa to 'T' instead of 'N' (and A is modified, thus cached). In the T case the lacpy args to cache A should be switched as compared to the old 'N" testcase, but that's not happening.

Edit. sry, nvm. mixed up tests. Caching now works correct under transpositions.

@wsmoses
Copy link
Member

wsmoses commented Jun 15, 2023 via email

@wsmoses
Copy link
Member

wsmoses commented Jun 15, 2023 via email

@wsmoses
Copy link
Member

wsmoses commented Jun 15, 2023 via email

@ZuseZ4
Copy link
Collaborator Author

ZuseZ4 commented Jun 15, 2023

@wsmoses thanks. Behaves better now.

@ZuseZ4 ZuseZ4 requested a review from wsmoses June 16, 2023 20:57
@ZuseZ4 ZuseZ4 marked this pull request as ready for review June 16, 2023 20:58
@ZuseZ4 ZuseZ4 changed the title wip, use tg declarations to link mat or vec dimensions use tg declarations to handle mat and vec dimensions Jun 16, 2023
@ZuseZ4 ZuseZ4 enabled auto-merge (squash) June 16, 2023 21:22
@ZuseZ4
Copy link
Collaborator Author

ZuseZ4 commented Jun 16, 2023

This now also has enough to remove this one: (ld $A, $transa, $lda, $m, $n),,
but since we already solved that I'll focus on nested rules before cleaning working code up.

@ZuseZ4 ZuseZ4 merged commit 8bd86b8 into main Jun 19, 2023
@tgymnich
Copy link
Member

this broke both macOS build and macOS tests

@ZuseZ4
Copy link
Collaborator Author

ZuseZ4 commented Jun 20, 2023

weren't they broken already for a while @tgymnich ?
Ah I see, they got fixed in between.

@tgymnich
Copy link
Member

tgymnich commented Jun 20, 2023

Yeah i fixed them last WE. The Xcode build is the only one still broken (but thats just the cmake arg order).

@ZuseZ4 ZuseZ4 deleted the tblgen-trans branch August 9, 2023 18:50
MilesCranmer pushed a commit to MilesCranmer/Enzyme that referenced this pull request Jul 24, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants