CANDLE: Iterative Conceptualization and Instantiation Distillation from Large Language Models for Commonsense Reasoning

This is the official code and data repository for the ACL2024 (Main Conference) paper: CANDLE: Iterative Conceptualization and Instantiation Distillation from Large Language Models for Commonsense Reasoning.

1. Download Dataset/Model Checkpoints

Distilled conceptualizations and instantiations, based on the ATOMIC and AbstractATOMIC datasets, and model checkpoints for all downstream tasks, can be downloaded at this link.

2. Distilling with CANDLE

The code for distilling conceptualizations with ChatGPT is CANDLE_Distillation/ChatGPT_conceptualization.py, and the code for distilling instantiations with GPT-3 is CANDLE_Distillation/LLAMA2_instantiation.py.

Replace the OpenAI key and the HuggingFace Hub key with your own ones in the code to execute the distillation process.

3. Training Downstream Models

For the CSKB Conceptualization and the COMET tasks, please refer to this repository. To train LLAMA2 for the COMET task, we use the code from the LLaMA-Factory.

For zero-shot Commonsense QA task, please refer to this repository.

4. Citing this work

Please use the bibtex below for citing our paper:

@inproceedings{CANDLE,
  author       = {Weiqi Wang and
                  Tianqing Fang and
                  Chunyang Li and
                  Haochen Shi and
                  Wenxuan Ding and
                  Baixuan Xu and
                  Zhaowei Wang and
                  Jiaxin Bai and
                  Xin Liu and
                  Jiayang Cheng and
                  Chunkit Chan and
                  Yangqiu Song},
  title        = {{CANDLE:} Iterative Conceptualization and Instantiation Distillation
                  from Large Language Models for Commonsense Reasoning},
  year         = {2024},
  booktitle    = {Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, {ACL} 2024}
}

5. Acknowledgement

The authors of this paper were supported by the NSFC Fund (U20B2053) from the NSFC of China, the RIF (R6020-19 and R6021-20), and the GRF (16211520 and 16205322) from RGC of Hong Kong. We also thank the support from the UGC Research Matching Grants (RMGS20EG01-D, RMGS20CR11, RMGS20CR12, RMGS20EG19, RMGS20EG21, RMGS23CR05, RMGS23EG08).

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
CANDLE_Distillation		CANDLE_Distillation
demo		demo
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CANDLE: Iterative Conceptualization and Instantiation Distillation from Large Language Models for Commonsense Reasoning

1. Download Dataset/Model Checkpoints

2. Distilling with CANDLE

3. Training Downstream Models

4. Citing this work

5. Acknowledgement

About

Releases

Packages

Languages

License

HKUST-KnowComp/CANDLE

Folders and files

Latest commit

History

Repository files navigation

CANDLE: Iterative Conceptualization and Instantiation Distillation from Large Language Models for Commonsense Reasoning

1. Download Dataset/Model Checkpoints

2. Distilling with CANDLE

3. Training Downstream Models

4. Citing this work

5. Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages