tf-attentive-conv: Attentive Convolution

What is it?

This is a Tensorflow implementation of Yin Wenpeng's paper "Attentive Convolution" at TACL in 2018. Wenpeng's original code is written in Theano.

I only implement the light attentive convolution described in Sect. 3.1 of the paper. Authors argue that even this light-version AttConv outperforms some of pioneering attentive RNNs in both intra-context (context=query, i.e. self-attention) and extra-context (context!=query) settings. The following figure (from the paper) illustrates this idea:

What did I change?

Nothing big. I do add some features:

add a dropout-resnet-layernorm block before the output
add masking to ensure causality, so that one may use it for decoding as well.

By default these features are all disabled.

Run

Run app.py for a simple test on toy data.

Name	Name	Last commit message	Last commit date
Latest commit hanhxiao fix typo in readme Aug 11, 2018 8dcc403 · Aug 11, 2018 History 6 Commits
.github	.github	first commit	Aug 9, 2018
nlp	nlp	add causality masking	Aug 10, 2018
.gitignore	.gitignore	first commit	Aug 9, 2018
LICENSE	LICENSE	Initial commit	Aug 9, 2018
README.md	README.md	fix typo in readme	Aug 11, 2018
app.py	app.py	add causality masking	Aug 10, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tf-attentive-conv: Attentive Convolution

What is it?

What did I change?

Run

About

Releases

Packages

Languages

License

hanxiao/tf-attentive-conv

Folders and files

Latest commit

History

Repository files navigation

tf-attentive-conv: Attentive Convolution

What is it?

What did I change?

Run

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages