Knowledge Probing with Large Language Models

This is a repository for knowledge probing for large language models, which is part of the paper "Why Do Neural Language Models Still Need Commonsense Knowledge?" and "Impact of Co-occurrence on Factual Knowledge of Large Language Models" (EMNLP 2023 Findings) (project page).

Installation

Set up a Conda Environment

This setup script creates an environment named 'factual_knowledge_probing'.

bash scripts/installation/setup_conda.sh

(Optional) Download the LAMA TREx dataset

The original dataset is saved in 'data/original_LAMA'.
The preprocessed dataset is saved in 'data/LAMA_TREx'.

bash scripts/installation/download_LAMA.sh
bash scripts/installation/preprocess_LAMA_TREx.sh

Check the number of samples for each relation.

# dataset_name: ['LAMA_TREx', 'ConceptNet']
bash scripts/installation/check_number_of_samples.sh {dataset_name}

(Optional) Download the Pretrained Models

The pretrained models (e.g. 'gpt_j_6B') are saved in 'results/{model_name}'.

bash scripts/installation/download_pretrained_models.sh

Evaluation

Test

The prediction file (e.g. 'pred_{dataset_name}_test.jsonl') is saved in 'results/{model_name_or_path}_{dataset_name}'.

# Zero-shot test
# model_type: ['clm', 'mlm']
# model_name_or_path: ['EleutherAI/gpt-neo-125m', 'EleutherAI/gpt-j-6b', 'bert-base-uncased', 'bert-large-uncased', ...] 
# dataset_type: ['test', 'train', ...]
bash scripts/factual_knowledge_probing/test/test_zeroshot.sh {model_type} {model_name_or_path} {dataset_name} {dataset_type}
# (Optional) Run on multi-gpus with DeepSpeed ZeRO-3
bash scripts/factual_knowledge_probing/test/test_zeroshot_ds_zero3.sh {model_type} {model_name_or_path} {dataset_name} {dataset_type}

# Test finetuned models
bash scripts/factual_knowledge_probing/test/test_finetuned.sh {model_type} {model_name_or_path} {dataset_name} {dataset_type}

# Test prompt-tuned models (Note that this script tests the model for every relation iteratively.)
bash scripts/factual_knowledge_probing/test/test_prompt_tuned.sh {model_type} {model_name_or_path} {dataset_name} {dataset_type}
# Aggregate predictions of prompt-tuned models
bash scripts/factual_knowledge_probing/test/aggregate_predictions_for_prompt_tuning.sh {model_name_or_path} {dataset_name} {dataset_type}

Compute Score

This evaluation script computes score and saves the results in 'results/{model_name_or_path}/score_{dataset_name}_{dataset_type}.json'.

# prediction_file: ['results/gpt-neo-125m_LAMA_TREx_zeroshot/pred_LAMA_TREx_test.jsonl', ...]
bash scripts/factual_knowledge_probing/test/compute_score.sh {prediction_file} {dataset_name}

Training

Finetuning

Run the following script to finetune the model.
The finetuned models and prediction files are saved in 'results/{model_name_or_path}_{dataset_name}_finetuning'.

bash scripts/factual_knowledge_probing/finetuning/finetuning.sh {model_type} {model_name_or_path} {dataset_name}

Prompt Tuning

Run the following script to train the model with prompt tuning.
The finetuned models and prediction files are saved in 'results/{model_name_or_path}_{dataset_name}_prompt_tuning'.
Note that this script runs prompt tuning for every relation iteratively.

bash scripts/factual_knowledge_probing/prompt_tuning/prepare_for_prompt_tuning.sh {dataset_name}
bash scripts/factual_knowledge_probing/prompt_tuning/prompt_tuning.sh {model_type} {model_name_or_path} {dataset_name}

In-context Learning

The following script generates the test data with demonstrations (few-shot prompts).
Then, you can test the models with few-shot prompts.

bash scripts/factual_knowledge_probing/in_context_learning/prepare_for_in_context_learning.sh {dataset_name}
bash scripts/factual_knowledge_probing/test/test_zeroshot.sh {model_type} {model_name_or_path} {dataset_name} test_4_shot

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
data		data
results		results
scripts		scripts
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

results

results

scripts

scripts

src

src

.gitattributes

.gitattributes

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Knowledge Probing with Large Language Models

Installation

Set up a Conda Environment

(Optional) Download the LAMA TREx dataset

(Optional) Download the Pretrained Models

Evaluation

Test

Compute Score

Training

Finetuning

Prompt Tuning

In-context Learning

About

Releases 3

Packages

Contributors 2

Languages

License

CheongWoong/factual_knowledge_probing

Folders and files

Latest commit

History

Repository files navigation

Knowledge Probing with Large Language Models

Installation

Set up a Conda Environment

(Optional) Download the LAMA TREx dataset

(Optional) Download the Pretrained Models

Evaluation

Test

Compute Score

Training

Finetuning

Prompt Tuning

In-context Learning

About

Topics

Resources

License

Stars

Watchers

Forks

Languages