Are Multilingual LLMs Culturally-Diverse Reasoners? An Investigation into Multicultural Proverbs and Sayings
This repository contains our code for experiments and experiment logs.
To access the data, please fill the form here. A link to our dataset will show automatically after submit the form.
If you encounter any issues or have suggestions, please do not hesitate to email us at: chen.liu AT tu-darmstadt DOT de
https://www.ukp.tu-darmstadt.de/
This repository contains experimental software and is published for the sole purpose of giving additional background details on the respective publication.
Install all Python requirements listed in requirements.txt
(check here
to see how to install Pytorch on your system).
You can install the requirements.txt like:
pip install --upgrade pip
pip install -r requirements.txt
Please put the data into the dataset folder.
- Memorization experiment:
memorization.py
. - Inference experiment:
qa_experiments.py
. - Fine-tuning baseline:
ft_standard.py
.
To run:
python EXPERIMENT_NAME.py
The corresponding hyperparameters are in the configs
folder.
Each experiment entry point has its own config file, managed through Hydra.
To add models, checkout the model_collection.py
.
To change prompt patterns, checkout the patterns.py
.
If you find this repository helpful, consider to cite the following paper:
@article{liu2023multilingual,
author = {Chen Cecilia Liu and
Fajri Koto and
Timothy Baldwin and
Iryna Gurevych},
title = {Are Multilingual LLMs Culturally-Diverse Reasoners? An Investigation
into Multicultural Proverbs and Sayings},
journal = {CoRR},
volume = {abs/2309.08591},
year = {2023},
url = {https://doi.org/10.48550/arXiv.2309.08591},
doi = {10.48550/ARXIV.2309.08591},
eprinttype = {arXiv},
eprint = {2309.08591}
}