BINARY BLOCK MASKING

This is the non-cleaned up code for the paper: "Efficiently Dispatching Flash Attention For Partially Filled Attention Masks".

Please use requirements.txt to install necessary libraries (the code should work with other versions of the packages but we mention the versions we tested on)
The folder 'binBlkMask_codes' contains the triton kernels for binary block masking. In that "base_" and "dense_" are versions used in the paper. For most applications stick to "base_"
"triton_kernels" folder has the triton kernel for pre-preprocessing masks into binary blocks. Please use "binBlkMask_kernels"
For details on how to use these functions please refer to "benchmarking_codes" within that "benchmark_longformer.py" is relatively easy to understand.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
alpaca_datasets		alpaca_datasets
benchmarking_codes		benchmarking_codes
benchresult_applications		benchresult_applications
binBlkMask_codes		binBlkMask_codes
efficient_transformers		efficient_transformers
saved_figures		saved_figures
tree_attention		tree_attention
triton_kernels		triton_kernels
README.md		README.md
blockmask_attention.py		blockmask_attention.py
combine_csv_and_plot.py		combine_csv_and_plot.py
correct_fused_attention.py		correct_fused_attention.py
create_and_viz_mats.py		create_and_viz_mats.py
customMask_attention.py		customMask_attention.py
fused_attention.py		fused_attention.py
grid_utils.py		grid_utils.py
requirements.txt		requirements.txt
save_matrices_benchmark.py		save_matrices_benchmark.py
test_basic_customMask.py		test_basic_customMask.py
test_binShift_binBlkMask.py		test_binShift_binBlkMask.py
test_binaryBlkMash_RCM.py		test_binaryBlkMash_RCM.py
test_binaryBlkMask.py		test_binaryBlkMask.py
test_correct_attention.py		test_correct_attention.py
test_dense_binaryBlkMask.py		test_dense_binaryBlkMask.py
test_indexMask.py		test_indexMask.py
test_keyStride.py		test_keyStride.py
torchMath_attention.py		torchMath_attention.py
utils.py		utils.py

Provide feedback