Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[GPU] Micro sdpa #24656

Open
wants to merge 4 commits into
base: master
Choose a base branch
from

Conversation

vladimir-paramuzov
Copy link
Contributor

@vladimir-paramuzov vladimir-paramuzov commented May 23, 2024

Details:

  • Added SDPA impl based on microkernels using internal onednn API and related infra
  • Current limitations:
    • fused transpose shouldn't change order of innermost dim (head size).
    • is_causal = true is not supported
    • fp16 only
    • num heads dimension must be static
    • no indirect kv support
  • Initial version of KV Cache + SDPA func test
  • Enabled Transpose+SDPA fusion for static shape too

Tickets:

@vladimir-paramuzov vladimir-paramuzov added this to the 2024.3 milestone May 23, 2024
@github-actions github-actions bot added the category: build OpenVINO cmake script / infra label May 23, 2024
@github-actions github-actions bot added the category: IE Tests OpenVINO Test: plugins and common label Jun 4, 2024
@vladimir-paramuzov vladimir-paramuzov force-pushed the micro_sdpa branch 5 times, most recently from 5dbb9c7 to d149dd4 Compare June 13, 2024 13:03
@github-actions github-actions bot removed the category: IE Tests OpenVINO Test: plugins and common label Jun 13, 2024
@vladimir-paramuzov vladimir-paramuzov force-pushed the micro_sdpa branch 2 times, most recently from 5cc640c to 6c596a9 Compare June 14, 2024 11:06
@vladimir-paramuzov vladimir-paramuzov marked this pull request as ready for review June 14, 2024 11:08
@vladimir-paramuzov vladimir-paramuzov requested review from a team as code owners June 14, 2024 11:08
@vladimir-paramuzov vladimir-paramuzov changed the title [WIP][GPU] Micro sdpa draft [GPU] Micro sdpa draft Jun 14, 2024
@vladimir-paramuzov vladimir-paramuzov changed the title [GPU] Micro sdpa draft [GPU] Micro sdpa Jun 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: build OpenVINO cmake script / infra category: GPU OpenVINO GPU plugin under_perf_check
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants