Skip to content

OCP FP8 support for gfx12.#1710

Merged
illsilin merged 360 commits into
developfrom
merge_from_internal
Dec 3, 2024
Merged

OCP FP8 support for gfx12.#1710
illsilin merged 360 commits into
developfrom
merge_from_internal

Conversation

@illsilin
Copy link
Copy Markdown
Collaborator

@illsilin illsilin commented Dec 2, 2024

These changes will enable the OCP FP8 data type support on gfx12 architectures.

aska-0096 added 30 commits May 19, 2023 06:45
* sanity pass

* sanity pass 2

* confirm significant performance regression.

* turn on all instances

* turn off instance format

* Fix bug & tunning & format

* DML meta, self_attn+cross_attn

* sanity pass

* remove useless flag

* update tile and problem size used in AIT attention

* bug fix in grouped conv supporting check
1. example, fmha
2. gridwise pipeline
3. deviceop, fmha, change some containers from vector to array
@illsilin illsilin merged commit 08d5c02 into develop Dec 3, 2024
@illsilin illsilin deleted the merge_from_internal branch December 7, 2024 01:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants