XAL

XAL: EXplainable Active Learning Makes Classifiers Better Low-resource Learners

Requirements

The original project is tested under the following environments:

python==3.9.16
torch==1.10.1+cu111
transformers==4.28.1
numpy==1.24.2
scikit-learn==1.2.2
openai==0.27.7

Quick Start

To train XAL models or baseline models, you can directly use the bash scripts demo_XAL.sh and demo_baseline.sh.

Note that for the codes in this repository, we fix the number of initial data points to 100, and select data in five rounds where 100 unlabeled data are selected.

If you expect to implement this framework in other text classification tasks, you need to add a new processor in glue_utils.py, and change the number of iterations and data selection in run_active_rank.py.

Generate Explanations

The scripts can be found in chatgpt_query.ipynb using jupyter notebook. You need to add your openai key to the codes.

Datasets

We conduct experiments on six different text classification tasks. :

Natural Language Inference aims to detect whether the meaning of one text is entailed (can be inferred) from the other text; (RTE)
Paraphrase Detection requires identifying whether each sequence pair is paraphrased; (MRPC)
Category Sentiment Classification aims to identify the sentiment (Positive/Negative/Neutral) of a given review to a category of the target such as food and staff; (MAMS)
Stance Detection aims to identify the stance (Favor/Against/Neutral) of a given text to a target; (COVID19)
(Dis)agreement Detection aims to detect the stance (Agree/Disagree/Neutral) of one reply to a comment; (DEBA)
Relevance Classification aims to detect whether a scientific document is relevant to a given topic. (CLEF)

You can directly download the processed data together with explanatios from link.

Citation

@misc{luo2023xal,
      title={XAL: EXplainable Active Learning Makes Classifiers Better Low-resource Learners}, 
      author={Yun Luo and Zhen Yang and Fandong Meng and Yingjie Li and Fang Guo and Qinglin Qi and Jie Zhou and Yue Zhang},
      year={2023},
      eprint={2310.05502},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
README.md		README.md
api_request_parallel_processor.py		api_request_parallel_processor.py
baseline_helper.py		baseline_helper.py
chatgpt_query.ipynb		chatgpt_query.ipynb
demo_XAL.sh		demo_XAL.sh
demo_baseline.sh		demo_baseline.sh
glue_utils.py		glue_utils.py
model.py		model.py
run_active_baselines.py		run_active_baselines.py
run_active_rank.py		run_active_rank.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

XAL

Requirements

Quick Start

Generate Explanations

Datasets

Citation

About

Releases

Packages

Languages

LuoXiaoHeics/XAL

Folders and files

Latest commit

History

Repository files navigation

XAL

Requirements

Quick Start

Generate Explanations

Datasets

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages