graph : ensure DS32 kq_mask_lid is F32 by CISC · Pull Request #23864 · ggml-org/llama.cpp

CISC · 2026-05-29T10:47:57Z

Overview

Additional information

Since build_attn_inp_kq_mask returns F16 mask when flash attention is enabled, pass a modified copy of cparams for kq_mask_lid.

llama.cpp/src/models/deepseek32.cpp

Lines 341 to 344 in 1f0aa2a

    
           // mask indexer scores 
        
           ggml_tensor * indexer_kq_mask = inp_attn_dsa->get_kq_mask_lid(); 
        
           indexer_score = ggml_add(ctx0, indexer_score, indexer_kq_mask); 
        
           cb(indexer_score, "indexer_score", il);

This is a bit hacky, open for better solutions. cc/ @am17an

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: mboten

am17an · 2026-05-29T11:17:07Z

Does this mask need to be f32?

CISC · 2026-05-29T11:21:34Z

Does this mask need to be f32?

Either that or we have to cast indexer_score to F16.

fairydreaming · 2026-05-29T16:03:15Z

So... I checked how DeepSeek V3.2 works in master (a couple of hours too late) and ended up here. But this PR helps, ggml_cuda_op_add error is gone.

ensure DS32 kq_mask_lid is F32

d19c6cb

CISC requested a review from ggerganov May 29, 2026 10:48

am17an approved these changes May 29, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

graph : ensure DS32 kq_mask_lid is F32#23864

graph : ensure DS32 kq_mask_lid is F32#23864
CISC wants to merge 1 commit into
masterfrom
cisc/graph-ds32-lid-mask-fix

CISC commented May 29, 2026 •

edited

Loading

Uh oh!

am17an commented May 29, 2026

Uh oh!

CISC commented May 29, 2026

Uh oh!

fairydreaming commented May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	// mask indexer scores
	ggml_tensor * indexer_kq_mask = inp_attn_dsa->get_kq_mask_lid();
	indexer_score = ggml_add(ctx0, indexer_score, indexer_kq_mask);
	cb(indexer_score, "indexer_score", il);

Conversation

CISC commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Additional information

Requirements

Uh oh!

am17an commented May 29, 2026

Uh oh!

CISC commented May 29, 2026

Uh oh!

fairydreaming commented May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CISC commented May 29, 2026 •

edited

Loading