Simplify attention mask and bias parameter naming #76

LoserCheems · 2025-07-28T04:37:50Z

Remove redundant prefixes from attention mask and bias parameters to enhance code readability and consistency. Eliminate unused parameters to streamline the interface across the flash attention API.

Removes redundant "attn_" prefixes from mask and bias parameter names to improve code readability and consistency. Also removes unused keep_window_size field from Mask_params struct.

Renames attention mask and bias parameters from `attn_mask`/`attn_bias` to `mask`/`bias` for improved clarity and consistency throughout the flash attention API. Removes the unused `keep_window_size` parameter from function signatures and parameter structures to clean up the interface.

…h-dmattn into Support-backward

Renames attn_mask_offset to mask_offset and attn_bias_offset to bias_offset to improve code readability and reduce verbosity while maintaining the same functionality.

Simplifies parameter naming by removing the "attn_" prefix from mask and bias related variables throughout the flash attention kernel. Updates all references to use the shorter naming convention: - attn_mask_* becomes mask_* - attn_bias_* becomes bias_* Improves code readability and maintains consistency across parameter names while preserving all existing functionality.

Changes row_stride and col_stride parameters from int to index_t template type in mask_offset and bias_offset methods. Ensures type consistency across all stride parameters and eliminates potential type mismatches in offset calculations.

Copilot

Pull Request Overview

Refactors attention mask and bias parameter names by removing redundant "attn_" prefixes to improve code readability and consistency across the flash attention implementation. Additionally removes the unused keep_window_size parameter to streamline the API interface.

Remove "attn_" prefix from mask and bias parameter names throughout the codebase
Update corresponding function signatures and struct member names for consistency
Eliminate unused keep_window_size parameter from the API

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

File	Description
csrc/src/flash_fwd_kernel.h	Update parameter references to use simplified mask/bias naming in kernel functions
csrc/src/flash.h	Rename struct members in Mask_params and Bias_params, remove unused keep_window_size field
csrc/src/block_info.h	Update function names from attn_mask_offset/attn_bias_offset to mask_offset/bias_offset
csrc/flash_api.cpp	Update function parameters and internal references to use simplified naming, remove keep_window_size parameter

Copilot · 2025-07-28T04:43:38Z

csrc/src/block_info.h

+    __forceinline__ __device__ index_t mask_offset(const index_t batch_stride, const index_t row_stride, const index_t col_stride, const int bidb) const {
        index_t offset = sum_s_q == -1 ? bidb * batch_stride : uint32_t(sum_s_q) * row_stride;
        sum_s_k == -1 ? offset += leftpad_k * col_stride : offset += uint32_t(sum_s_k + leftpad_k) * col_stride;
        return offset;
    }

    template <typename index_t>
-    __forceinline__ __device__ index_t attn_bias_offset(const index_t batch_stride, const int row_stride, const int col_stride, const int bidb
-    ) const {
+    __forceinline__ __device__ index_t bias_offset(const index_t batch_stride, const index_t row_stride, const index_t col_stride, const int bidb) const {


The parameter type for row_stride was changed from int to index_t, but this change is inconsistent with the col_stride parameter which remains int. For consistency, all stride parameters should have the same type.

Copilot · 2025-07-28T04:43:39Z

csrc/src/block_info.h

    template <typename index_t>
-    __forceinline__ __device__ index_t attn_bias_offset(const index_t batch_stride, const int row_stride, const int col_stride, const int bidb
-    ) const {
+    __forceinline__ __device__ index_t bias_offset(const index_t batch_stride, const index_t row_stride, const index_t col_stride, const int bidb) const {


The parameter type for row_stride and col_stride were changed from int to index_t, but this creates an inconsistency where bidb remains int while other parameters use index_t. Consider using consistent types for all index-related parameters.

LoserCheems added 5 commits July 28, 2025 12:33

Simplifies attention mask and bias parameter naming

b0054e2

Removes redundant "attn_" prefixes from mask and bias parameter names to improve code readability and consistency. Also removes unused keep_window_size field from Mask_params struct.

Merge branch 'Support-backward' of https://github.com/SmallDoges/flas…

11a28f7

…h-dmattn into Support-backward

Simplifies method names by removing attn prefix

90b9ccf

Renames attn_mask_offset to mask_offset and attn_bias_offset to bias_offset to improve code readability and reduce verbosity while maintaining the same functionality.

LoserCheems requested review from Evanwu1125, SNHuan, Copilot and wubingheng111 and removed request for Copilot July 28, 2025 04:37

LoserCheems assigned LoserCheems, Copilot, Evanwu1125, SNHuan and wubingheng111 Jul 28, 2025

This comment was marked as outdated.

Sign in to view

Standardizes parameter types for consistency

1efb25e

Changes row_stride and col_stride parameters from int to index_t template type in mask_offset and bias_offset methods. Ensures type consistency across all stride parameters and eliminates potential type mismatches in offset calculations.

LoserCheems requested a review from Copilot July 28, 2025 04:43

Copilot AI reviewed Jul 28, 2025

View reviewed changes

LoserCheems merged commit d8d3a28 into main Jul 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Simplify attention mask and bias parameter naming #76

Simplify attention mask and bias parameter naming #76

Uh oh!

LoserCheems commented Jul 28, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jul 28, 2025

Uh oh!

Copilot AI Jul 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Simplify attention mask and bias parameter naming #76

Simplify attention mask and bias parameter naming #76

Uh oh!

Conversation

LoserCheems commented Jul 28, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants