Skip to content

Conversation

@LoserCheems
Copy link
Collaborator

Adds missing row stride parameters to offset calculations for ZOH and active mask tensors, ensuring proper memory layout access.

Improves code readability by standardizing comment formatting and alignment for tensor partition declarations.

Removes template parameter from DynamicMask class instantiation, simplifying the interface.

Adds missing row stride parameters to offset calculations for ZOH and active mask tensors, ensuring proper memory layout access.

Improves code readability by standardizing comment formatting and alignment for tensor partition declarations.

Removes template parameter from DynamicMask class instantiation, simplifying the interface.
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

Updates tensor offset calculations by including row stride parameters for ZOH and active mask tensors, standardizes comment formatting for tensor partition declarations, and simplifies the DynamicMask interface by removing an unnecessary template parameter.

  • Added #include "rotary.h" to support rotary embeddings.
  • Updated zoh_offset and active_mask_offset calls to accept row stride arguments.
  • Removed the Kernel_traits::kNThreads template parameter from DynamicMask.
Comments suppressed due to low confidence (2)

csrc/src/flash_fwd_kernel.h:387

  • Please update any related documentation or usage examples to remove the second template parameter from DynamicMask, reflecting the new simplified interface.
    FLASH_NAMESPACE::DynamicMask<Is_causal> dynamic_mask(

csrc/src/flash_fwd_kernel.h:22

  • [nitpick] Consider placing this new include in the correct alphabetical or grouped order to match the project's include style conventions.
#include "rotary.h"

@LoserCheems LoserCheems merged commit f65a85b into main Jun 23, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants