[Flex Attention] Change the semantics of BlockMask/Adjust #141435

Closed

Assignees

Labels

module: flex attentionmodule: higher order operatorsmodule: pt2-dispatcheroncall: pt2triaged

opened

on Nov 23, 2024

Summary

Currently BlockMasks chunk sequences along seq_len q and seq_len k. We need to update the attirbutes to also store valid seq_len q and valid seq len kv to remove ambiguity / make the current abstraction zero cost with opt in properties.

will fill out the rest

Metadata

Assignees

drisspg

Labels

module: flex attentionmodule: higher order operatorsmodule: pt2-dispatcheroncall: pt2triaged

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests