Gated Attention

Implementation of the paper : Not all attention is needed - Gated Attention Network for Sequence Data (GA-Net)

Flow Diagram for the network:

There are two networks in the model:

Backbone Network
Auxiliary Network

Comparison with soft attention network:

Soft Attention gives some attention (low or high) to all the input tokens whereas gated attention network chooses the most important tokens to attend.

Gate Probability and gated attention:

Visualization of probability for gate to be open for input token and the actual gated attention weight.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Code		Code
Notebooks		Notebooks
images		images
Final_Presentation.pdf		Final_Presentation.pdf
Final_Report.pdf		Final_Report.pdf
Milestone_Report.pdf		Milestone_Report.pdf
Project_Proposal.pdf		Project_Proposal.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gated Attention

Flow Diagram for the network:

Comparison with soft attention network:

Gate Probability and gated attention:

About

Releases

Packages

Languages

keya-desai/Gated-Attention

Folders and files

Latest commit

History

Repository files navigation

Gated Attention

Flow Diagram for the network:

Comparison with soft attention network:

Gate Probability and gated attention:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages