causality_LLMs

We play around with different tasks revolving around causality in LLMs like GPT-3. Our goal is to measure the quality of its causal modelling capabilities in real-world tasks, toy problems and adversarial examples.

The final report can be found on the Alignment Forum

To reproduce the results

set your OpenAI key as a variably by typing export OPENAI_KEY="<insert_your_key_here>"in your console before running your experiments.
(optional) check the playground notebooks to get a better feeling for the experiments.
Run all of the experiment.py scripts to produce the results (Don't forget that running experiments costs money).
Run the evaluation jupyter notebooks.
(optional) run the analysis for report.ipynb to reproduce the exact figures of the report.

Please note that this is just a small side project and the code has not been optimized for efficiency or readability.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
data		data
figures		figures
setups		setups
task_src		task_src
.DS_Store		.DS_Store
.gitignore		.gitignore
Analysis for report.ipynb		Analysis for report.ipynb
CE_data_cleaning.ipynb		CE_data_cleaning.ipynb
Evaluate_ce_one_sentence.ipynb		Evaluate_ce_one_sentence.ipynb
Evaluate_ce_two_sentences.ipynb		Evaluate_ce_two_sentences.ipynb
Evaluate_toy_problem_3_colors.ipynb		Evaluate_toy_problem_3_colors.ipynb
Evaluate_toy_problem_5_colors.ipynb		Evaluate_toy_problem_5_colors.ipynb
Evaluate_toy_problem_nonsense_words.ipynb		Evaluate_toy_problem_nonsense_words.ipynb
GPT3_playground_3_colors.ipynb		GPT3_playground_3_colors.ipynb
GPT3_playground_3_nonsense_words.ipynb		GPT3_playground_3_nonsense_words.ipynb
GPT3_playground_5_colors.ipynb		GPT3_playground_5_colors.ipynb
GPT3_playground_ce_one_sentence.ipynb		GPT3_playground_ce_one_sentence.ipynb
GPT3_playground_ce_two_sentences.ipynb		GPT3_playground_ce_two_sentences.ipynb
README.md		README.md
ce_1_sentence_experiments.py		ce_1_sentence_experiments.py
ce_2_sentences_experiments.py		ce_2_sentences_experiments.py
ce_one_sentence_no_prompt.json		ce_one_sentence_no_prompt.json
definitions.py		definitions.py
toy_problem_3_colors_experiments.py		toy_problem_3_colors_experiments.py
toy_problem_3_colors_experiments_huggingface.py		toy_problem_3_colors_experiments_huggingface.py
toy_problem_3_nonsense_words_experiments.py		toy_problem_3_nonsense_words_experiments.py
toy_problem_5_colors_experiments.py		toy_problem_5_colors_experiments.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

causality_LLMs

About

Releases

Packages

Contributors 3

Languages

mariushobbhahn/causality_LLMs

Folders and files

Latest commit

History

Repository files navigation

causality_LLMs

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages