Skip to content

vulkan: add Vulkan backend for GGML_OP_GATED_DELTA_NET#2

Closed
ProgenyAlpha wants to merge 1 commit into
am17an:gated_delta_netfrom
ProgenyAlpha:vulkan-for-pr19504
Closed

vulkan: add Vulkan backend for GGML_OP_GATED_DELTA_NET#2
ProgenyAlpha wants to merge 1 commit into
am17an:gated_delta_netfrom
ProgenyAlpha:vulkan-for-pr19504

Conversation

@ProgenyAlpha
Copy link
Copy Markdown

Adds Vulkan compute shader and ggml-vulkan.cpp wiring for the gated delta net recurrence op.

Changes

  • gated_delta_net.comp — new compute shader with dynamic head size, multi-token loop, GQA stride-based broadcast, inline exp(g) and sigmoid(beta) transforms
  • ggml-vulkan.cpp — push constants, pipeline creation, dispatch function, build graph case, supports_op
  • vulkan-shaders-gen.cpp — shader registration

Test Results

7/7 test-backend-ops cases pass on NVIDIA RTX 3080 Ti:

  • head_size=128, n_seq_tokens=1, n_seqs=1, v_repeat=1, permuted=0 OK
  • head_size=64, n_seq_tokens=1, n_seqs=2, v_repeat=1, permuted=0 OK
  • head_size=64, n_seq_tokens=4, n_seqs=1, v_repeat=1, permuted=0 OK
  • head_size=64, n_seq_tokens=4, n_seqs=2, v_repeat=1, permuted=0 OK
  • head_size=32, n_seq_tokens=4, n_seqs=2, v_repeat=2, permuted=0 OK
  • head_size=64, n_seq_tokens=4, n_seqs=2, v_repeat=1, permuted=1 OK
  • head_size=64, n_seq_tokens=4, n_seqs=1, v_repeat=1, permuted=1 OK

Adds compute shader and full ggml-vulkan.cpp wiring for the gated delta
net recurrence op. Supports multi-token sequences, GQA stride-based
broadcast, and inline sigmoid/exp transforms.

Tested 7/7 on NVIDIA RTX 3080 Ti via test-backend-ops:
- head_size 32/64/128, single/multi token, single/multi seq, GQA, permuted
@ProgenyAlpha ProgenyAlpha deleted the vulkan-for-pr19504 branch February 16, 2026 23:42
@ProgenyAlpha ProgenyAlpha restored the vulkan-for-pr19504 branch February 16, 2026 23:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant