GitHub - jitinkrishnan/Cross-Lingual-Crisis-Tweet-Classification: A cross-lingual attention realignment method (KiML@KDD'20)

Attention Realignment and Pseudo-Labelling for Interpretable Cross-Lingual Classification of Crisis Tweets

Purpose: A cross-lingual neural network model over XLM-R with the capability to attend over the similar words (dlo in Haitian Creole versus water in English) in different languages.

Paper/Cite

http://ceur-ws.org/Vol-2657/paper3.pdf (KiML @ KDD 2020)

@article{krishnanAttentionRealignment,
  title={Attention Realignment and Pseudo-Labelling for Interpretable Cross-Lingual Classification of Crisis Tweets},
  author={Krishnan, Jitin and Purohit, Hemant and Rangwala, Huzefa},
  booktitle={In Proceedings of KDD Workshop on Knowledge-infused Mining and Learning},
  year={2020}
}

Requirements

Python3.6, Keras, Tensorflow.
Install fairseq for XLMR. Apex is not needed.

Data

Download Appen dataset consisting of Multilingual Disaster Response Messages.

Extract XLM-R embeddings

python get_xlmr_embeddings.py en train
python get_xlmr_embeddings.py en val
python get_xlmr_embeddings.py en test
python get_xlmr_embeddings.py ml train
python get_xlmr_embeddings.py en val
python get_xlmr_embeddings.py en test This step produces 6 .npy files with embeddings and 6 .txt files with corresponding tweets. This will make it easier to train as XLMR is a bit slow.

Running Models (en --> ml)

Baseline

python baseline.py en ml

Model A

python modelA.py en ml

Model B

python modelB.py en ml

Results (Accuracy in %)

Source --> Target (Source --> Source)

S --> T	Baseline	Model A	Model B
en --> ml	59.98 (80.57)	62.53 (77.02)	66.79 (82.39)
ml --> en	60.93 (70.07)	65.69 (63.50)	70.95 (73.84)

Attention Visualization

Click Here to view the Jupyter Notebook that shows the attention heat map.

Contact information

For help or issues, please submit a GitHub issue or contact Jitin Krishnan (jkrishn2@gmu.edu).

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
Attention Plot Example.ipynb		Attention Plot Example.ipynb
README.md		README.md
baseline.py		baseline.py
dataset_utils.py		dataset_utils.py
get_xlmr_embeddings.py		get_xlmr_embeddings.py
modelA.py		modelA.py
modelB.py		modelB.py
models.py		models.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attention Plot Example.ipynb

Attention Plot Example.ipynb

README.md

README.md

baseline.py

baseline.py

dataset_utils.py

dataset_utils.py

get_xlmr_embeddings.py

get_xlmr_embeddings.py

modelA.py

modelA.py

modelB.py

modelB.py

models.py

models.py

Repository files navigation

Attention Realignment and Pseudo-Labelling for Interpretable Cross-Lingual Classification of Crisis Tweets

Paper/Cite

Requirements

Data

Extract XLM-R embeddings

Running Models (en --> ml)

Baseline

Model A

Model B

Results (Accuracy in %)

Attention Visualization

Contact information

About

Releases

Packages

Languages

jitinkrishnan/Cross-Lingual-Crisis-Tweet-Classification

Folders and files

Latest commit

History

Repository files navigation

Attention Realignment and Pseudo-Labelling for Interpretable Cross-Lingual Classification of Crisis Tweets

Paper/Cite

Requirements

Data

Extract XLM-R embeddings

Running Models (en --> ml)

Baseline

Model A

Model B

Results (Accuracy in %)

Attention Visualization

Contact information

About

Resources

Stars

Watchers

Forks

Languages