When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment

This is the repo for: "When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment" 2022 by Zhijing Jin*, Sydney Levine*, Fernando Gonzalez*, Ojasv Kamal, Maarten Sap, Mrinmaya Sachan, Rada Mihalcea, Josh Tenenbaum, Bernhard Schoelkopf

The dataset can be found here

models Contains the scripts to get the model predictions using GPT3 and baseline predictions
extra_analyses Scripts with extra analyses. E.g. Domain features evaluation, price estimation, dogmatic score.
input_data Contains the dataset ("complete_file.csv") and costs of items estimated by humans

Instructions to run the models

Installation

conda create -n moralcot python=3.7
conda activate moralcot
pip install -r requirements.txt
export base_folder=path_to_the_project
export OPENAI_API_KEY=your_gpt3_key necessary to query GPT3

Generating predictions

To generate the predictions for all the models including paraphrases run:

./main_models/paraphrases/run_models_ensemble.sh

Dataset

feradauto/MoralExceptQA -- https://huggingface.co/datasets/feradauto/MoralExceptQA

Reference

When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment -- https://arxiv.org/abs/2210.01478

@misc{jin2022make,
      title={When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment}, 
      author={Zhijing Jin and Sydney Levine and Fernando Gonzalez and Ojasv Kamal and Maarten Sap and Mrinmaya Sachan and Rada Mihalcea and Josh Tenenbaum and Bernhard Schölkopf},
      year={2022},
      eprint={2210.01478},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
extra_analyses		extra_analyses
input_data		input_data
models		models
outputs_final		outputs_final
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

extra_analyses

extra_analyses

input_data

input_data

models

models

outputs_final

outputs_final

.gitattributes

.gitattributes

.gitignore

.gitignore

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment

Instructions to run the models

Installation

Generating predictions

Dataset

Reference

About

Releases

Packages

Languages

feradauto/MoralCoT

Folders and files

Latest commit

History

Repository files navigation

When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment

Instructions to run the models

Installation

Generating predictions

Dataset

Reference

About

Resources

Stars

Watchers

Forks

Languages