Few Shot Argument Mining

This is the repository for the ACL 2024 paper Argument Mining in Data Scarce Settings: Cross-lingual Transfer and Few-shot Techniques

In the paper, we explore different strategies for dealing with data scarcity in Argument Mining tasks, namely fine-tuning multilingual BERT and adapting EntLM (a template-free few-shot approach for sequence labeling task). In our experiments, we generate the few-shot medical data from the AbstRCT corpus in 4 languages (English, Spanish, Italian and French).

Usage

Run pip install requirements to install the required packages

Data

All the data used for the experiments can be found in dataset folder.

EntLM

Run sh scripts/count_freq.sh to generate the label words for EntLM
Run sh scripts/run_fewshot.sh to launch few-shot learning using EntLM

Fine-tuning mBERT

Run sh fine-tuning/finetune_fewshot.sh to fine-tune the model with a small amount of data

Alternatively, you can run sh fine-tuning/finetune_full.sh in order to fine-tune the model using full data.

Citation

@article{yeginbergen2024argument,
  title={Argument Mining in Data Scarce Settings: Cross-lingual Transfer and Few-shot Techniques},
  author={Yeginbergen, Anar and Oronoz, Maite and Agerri, Rodrigo},
  journal={arXiv preprint arXiv:2407.03748},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
dataset		dataset
entlm		entlm
fine-tuning		fine-tuning
scripts		scripts
README.md		README.md
requirements		requirements

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Few Shot Argument Mining

Usage

Data

EntLM

Fine-tuning mBERT

Citation

About

Releases

Packages

Languages

anaryegen/few_shot_argument_mining

Folders and files

Latest commit

History

Repository files navigation

Few Shot Argument Mining

Usage

Data

EntLM

Fine-tuning mBERT

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages