Skip to content

MUSIED: A Benchmark for Event Detection from Multi-Source Heterogeneous Informal Texts

License

Notifications You must be signed in to change notification settings

myeclipse/MUSIED

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MUSIED

Dataset and baselines for paper "MUSIED: A Benchmark for Event Detection from Multi-Source Heterogeneous Informal Texts".

Data

The dataset can be obtained from the “data” folder. The data format is introduced in this document.

Data preprocess

Run preprocessing.py to obtain the sentence-level input of model. The result is saved in data directory.

├── data
│     └── train_sentence.json
│     └── dev_sentence.json
│     └── test_sentence.json

Codes

We release the source codes for the baselines, including

sentence-level models:

--DMCNN

--BiLSTM

--BERT

--C-BiLSTM

--DMBERT

document-level models

--HBTNGMA

--MLBiNet.

About

MUSIED: A Benchmark for Event Detection from Multi-Source Heterogeneous Informal Texts

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages