Three-attention QANet with elmo

This is a tensorflow implementation of three attention with elmo model. The input embedding layer is similar to the one in the QANet model except for our adding elmo part. As for the embedding encoding layer, the same one in QANet is used. And we also have the similar layer like context-query attention layer. Candidate answers information are added. We calculated the bi-attention between query and context first and then calculate the bi-attention between candidate answers and query-context. Finally, we only have an output layer. After getting three-attention vectors, we multiply them by transformed candidates matrix. And corss entropy loss between predicted vector and ground true label vector is used. In the testing, we select the one which has maximum probability as the answer.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.settings		.settings
__pycache__		__pycache__
screenshots		screenshots
README.md		README.md
config.py		config.py
demo.html		demo.html
demo.py		demo.py
download.sh		download.sh
evaluate-v1.1.py		evaluate-v1.1.py
layers.py		layers.py
main.py		main.py
model.py		model.py
prepro.py		prepro.py
qangaroo2squad_preprocess.py		qangaroo2squad_preprocess.py
requirements.txt		requirements.txt
util.py		util.py

colinsongf/elmo_qanet

Folders and files

Latest commit

History

Repository files navigation

Three-attention QANet with elmo

About

Resources

Stars

Watchers

Forks

Languages