Improves API parameter naming consistency #96

LoserCheems · 2025-08-09T03:36:10Z

Renames q/k/v parameters to query/key/value in flash attention functions for better readability and standardization.

Updates parameter documentation to reflect the new naming convention and fixes GQA condition description to use <= instead of <.

Removes outdated footer reference to integration docs.

Renames q/k/v parameters to query/key/value in flash attention functions for better readability and standardization. Updates parameter documentation to reflect the new naming convention and fixes GQA condition description to use <= instead of <. Removes outdated footer reference to integration docs.

Copilot

Pull Request Overview

This PR improves API consistency by renaming abbreviated parameter names in flash attention functions to their full descriptive names (q/k/v → query/key/value). This enhances code readability and follows standard naming conventions.

Renames abbreviated parameter names to full descriptive names in function signatures
Updates parameter documentation to reflect the new naming convention
Fixes GQA condition description and removes outdated footer reference

Copilot · 2025-08-09T03:36:35Z

docs/api_reference.md

+- key: (B, K, H_kv, D). Same dtype/device as query; GQA when H_kv <= H
+- value: (B, K, H_kv, D). Same dtype/device as query; GQA when H_kv <= H


The GQA condition description should clarify what 'H' refers to. It should specify 'H_kv <= num_heads' or reference the query tensor's head dimension to avoid ambiguity.

Suggested change

- key: (B, K, H_kv, D). Same dtype/device as query; GQA when H_kv <= H

- value: (B, K, H_kv, D). Same dtype/device as query; GQA when H_kv <= H

- key: (B, K, H_kv, D). Same dtype/device as query; GQA when H_kv <= H (number of query heads in the query tensor)

- value: (B, K, H_kv, D). Same dtype/device as query; GQA when H_kv <= H (number of query heads in the query tensor)

Copilot · 2025-08-09T03:36:36Z

docs/api_reference.md

+- key: (B, K, H_kv, D). Same dtype/device as query; GQA when H_kv <= H
+- value: (B, K, H_kv, D). Same dtype/device as query; GQA when H_kv <= H


The GQA condition description should clarify what 'H' refers to. It should specify 'H_kv <= num_heads' or reference the query tensor's head dimension to avoid ambiguity.

Suggested change

- key: (B, K, H_kv, D). Same dtype/device as query; GQA when H_kv <= H

- value: (B, K, H_kv, D). Same dtype/device as query; GQA when H_kv <= H

- key: (B, K, H_kv, D). Same dtype/device as query; GQA when H_kv <= H (number of query heads)

- value: (B, K, H_kv, D). Same dtype/device as query; GQA when H_kv <= H (number of query heads)

Copilot AI review requested due to automatic review settings August 9, 2025 03:36

LoserCheems assigned LoserCheems and Copilot Aug 9, 2025

Copilot AI reviewed Aug 9, 2025

View reviewed changes

LoserCheems assigned SNHuan and wubingheng111 Aug 9, 2025

LoserCheems merged commit 5ba76a0 into main Aug 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improves API parameter naming consistency #96

Improves API parameter naming consistency #96

Uh oh!

LoserCheems commented Aug 9, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Aug 9, 2025

Uh oh!

Copilot AI Aug 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		- key: (B, K, H_kv, D). Same dtype/device as query; GQA when H_kv <= H
		- value: (B, K, H_kv, D). Same dtype/device as query; GQA when H_kv <= H

Improves API parameter naming consistency #96

Improves API parameter naming consistency #96

Uh oh!

Conversation

LoserCheems commented Aug 9, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Aug 9, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 9, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants