Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
manzilz committed Dec 15, 2020
1 parent c568f08 commit 103a334
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions README.md
Expand Up @@ -152,5 +152,7 @@ used to set the number of random blocks. The code currently uses window size of

Important points to note:
* Hidden dimension should be divisible by the number of heads.
* Currently the code only handles tensors of static shape as it is primarily designed
for TPUs which only works with statically shaped tensors.
* For sequene length less than 1024, using `original_full` is advised as there
is no benefit in using sparse BigBird attention.

0 comments on commit 103a334

Please sign in to comment.