- matplotlib==3.6.2
- numpy==1.23.5
- scikit_learn==1.2.2
- torch==1.10.0
- tqdm==4.64.1
- transformers==4.28.1
pip install -r requirements.txt
For ACE2005, we adopt 30 classes in ACE2005 which have more than 10 instances and randomly divide them into subsets with 10, 10 and 10 classes for training, validation and test, respectively.
For MAVEN , we adopt 100 classes in MAVEN which have more than 200 instances and randomly divide them into subsets with 64, 16 and 20 classes for training, validation and test, respectively.
For FewEvent, we adopt the version split by Cong et al., 2021, which contains 80, 10 and 10 event types for training, validation and test, respectively.
For ERE, we adopt the top 30 classes with the most samples and split into 10/10/10 for training, validation and test, respectively.
Dataset | #Class | #Train | #Dev | #Test |
---|---|---|---|---|
ACE2005 | 30 | 10 | 10 | 10 |
ERE | 30 | 10 | 10 | 10 |
FewEvent | 100 | 80 | 10 | 20 |
MAVEN | 100 | 64 | 16 | 20 |
Metric-based:
-
Match: uses cosine function to measure the similarity.
-
Proto: uses Euclidean Distance as the similarity metric.
-
Proto-dot: uses dot product to compute the similarity
-
Relation: building a two-layer neural networks to measure the similarity.
CRF-based:
- Vanilla CRF: adopt the vanilla CRF in the FSED task without considering the adaptation problem.
- PA-CRF: exploring event type dependencies via Gaussian distribution for approximation.
- SpanFSED: introducing a global boundary matrix and an external knowledge base.