Factorized Bilinear Multi-Aspect Attention (FBMA)

Code for the paper: "Event Detection using Hierarchical Multi-Aspect Attention" by Sneha Mehta, Mohammad Raihanul Islam, Huzefa Rangwala, Naren Ramakrishnan.

The model presented in the paper can be used for general text classification tasks, especially for long texts and low resource settings when training simple models from scratch is more feasible than finetuning large models such as BERT. Model assigns relative importance to different sentences and different weights to different words in each sentence to construct a document representation. To construct sentence representations the model proposes a new multi-head self-attention mechanism ([1,2]) which is shown to be computationally more efficient than Self-Attentive Sentence Embedding [3] and self-attention mechanism in Transformers.

For the event detection task, the proposed model beats the previous state-of-the-art Multi-Instance Learning based model [4] on the datasets used in the paper.

Please open an Issue if stuck!

Setup

All commands must be run from the project root. The following environment variables can be optionally defined:

bash
export HOME_DIR="project path"
export PYTHONPATH=$PYTHONPATH:.

Install dependencies

bash
sudo apt install python3-dev python3-virtualenv
virtualenv -p python3 --system-site-packages env3
. env3/bin/activate
pip install -r requirements.txt

Dataset Prep

For licensing reasons we cannot make the datasets in the paper available. Your data must be prepared as described. Data must be split into train.csv, val.csv and test.csv files. Each file must contain tokenized, lemmatized text and the class label. Sentences in the text must be separated by a <s> markup. The csv header must contain two fields text and label.

Example record: The first field is the record text while the second field is the record label.

baghdad national iraqi news agency nina the armed forces have killed dozens of  terrorists including
three suicide bombers during repelling an attack by daesh on gonna village in sharqat <s> a source
in the defense ministry told the national iraqi news agency/ nina that daesh terrorist gangs attacked
this morning the military units stationed in the district of sharqat khanokah village where they
were addressed and inflicted them heavy losses killing gonna large number of the terrorist enemy
including 3 suicide bombers and dismantled 20 explosive devices planted by terrorists to hinder
the progress of the armed forces end, 0

Train Word Embeddings

python train_word2vec.py --path <path_to_store_embeddings> \
--data_dir <path_to_processed_data> \
--dim <embedding_dimension>

Should generate word2vec.100d.[x]k.w2v file, where [x] refers to the vocab size.

Training

Creates dictionary, trains and saves the model, runs inference on the test set.

python main.py \
--data_dir <path_to_processed_data> \
--emb_path <path_to_word2vec.100d.[x]k.w2v_file> \
--aspects 16 \
--epochs 40 \
--experiment <dataset_name>

data_dir should contain train.csv, val.csv and test.csv files.

Monitoring training progress

You can point tensorboard to the training folder (by default it is ./runs) to monitor the training process:

bash
tensorboard --port 6007 --logdir ./runs

Performance Evaluation

Running the main script will create stats and output directories. It will store the performance of the current run inside stats/experiment (Note: Experiment name can be changed using the --experiment flag). It will create EVAL and TEST directories inside the experiment directory containing eval.json and test.json. Performance metrics of the model such as precision, recall, f1 etc. can be found inside these files.

References

[1]Low Rank Factorization for Compact Multi-Head Self-Attention
[2]Event Detection Using Hierarchical Multi-Aspect Attention
[3]A Structured Self-attentive Sentence Embedding
[4]A Multiple Instance Learning Framework for Identifying Key Sentences and Detecting Events

Citing this work

@inproceedings{Mehta:2019:EDU:3308558.3313659,
 author = {Mehta, Sneha and Islam, Mohammad Raihanul and Rangwala, Huzefa and Ramakrishnan, Naren},
 title = {Event Detection Using Hierarchical Multi-Aspect Attention},
 booktitle = {The World Wide Web Conference},
 series = {WWW '19},
 year = {2019},
 isbn = {978-1-4503-6674-8},
 location = {San Francisco, CA, USA},
 pages = {3079--3085},
 numpages = {7},
 url = {http://doi.acm.org/10.1145/3308558.3313659},
 doi = {10.1145/3308558.3313659},
 acmid = {3313659},
 publisher = {ACM},
 address = {New York, NY, USA},
 keywords = {Event Encoding, Hierarchical Attention, Multi-Aspect Attention, Neural Networks},
}

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data.py		data.py
main.py		main.py
model.py		model.py
requirements.txt		requirements.txt
train.py		train.py
train_word2vec.py		train_word2vec.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Factorized Bilinear Multi-Aspect Attention (FBMA)

Setup

Install dependencies

Dataset Prep

Train Word Embeddings

Training

Monitoring training progress

Performance Evaluation

References

Citing this work

About

Releases

Packages

Languages

License

sumehta/FBMA

Folders and files

Latest commit

History

Repository files navigation

Factorized Bilinear Multi-Aspect Attention (FBMA)

Setup

Install dependencies

Dataset Prep

Train Word Embeddings

Training

Monitoring training progress

Performance Evaluation

References

Citing this work

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages