Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections

This repository contains the code for model development and human evaluation interface. To assist human navigation with AI-generated instructions, we develop HEAR model to highlight hallucination spans and suggest possible corrections. The green box marks the ground-truth destination, and the orange highlight shows the hallucination span with correction suggestions in dropdown menu.

Visit the project's website to learn more.

🛠️ Getting Started: Setup

Clone this Repo. Install the relevant dependencies through pip:

git clone https://github.com/lingjunzhao/HEAR.git

pip install -r requirements.txt

🐾 Train and Validation Dataset Link

Finally, download the synthetic training dataset from this link, unzip the file, and put the json files inside cal_data/ folder.

🍀 Pre-trained Checkpoints

Download the pretrained Airbert model checkpoint from this link, put it under model-zoos/ folder.

🌎 Training the HEAR Model

Use the bash script to train the hallucination detection classifier and hallucination type classifier:

run/train_hallucination_detection_parallel.sh

Use the bash script to train the hallucination type classifier:

run/train_hallucination_type_parallel.sh

The above scripts will generate folders in data/runs/ containing model checkpoints.

If you want to use pretrained models instead, you can download the pretrained checkpoints from this link, and put them under data/runs/ folder after unzipping.

🤖 Decoding the HEAR Model

Use the bash script to decode the hallucination detection classifier to obtain hallucination scores:

run/test_hallucination_detection.sh

The output will be saved in data/runs/run-test_hallucination_detection/ folder.

Use the bash script to decode the hallucination type classifier to obtain intrinsic vs extrinsic scores:

run/test_hallucination_type.sh

The output will be saved in data/runs/run-test_hallucination_type/ folder.

Merge the two-stage outputs using the following python script:

utils/process_alternative_output.py

The output would be saved as:

data/runs/run-test_hallucination_detection/_sigmoid_scores_t5_val_seen_highlighted_phrase_alters_gpt4_direction_dev_test_merged.json

📊 Human Evaluation Interface

Refer to the human_evaluation/README.md folder for instructions.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
annotation		annotation
cal_data		cal_data
hear		hear
human_eval_interface		human_eval_interface
run		run
utils		utils
README.md		README.md
hear.png		hear.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

annotation

annotation

cal_data

cal_data

hear

hear

human_eval_interface

human_eval_interface

run

run

utils

utils

README.md

README.md

hear.png

hear.png

Repository files navigation

Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections

🛠️ Getting Started: Setup

🐾 Train and Validation Dataset Link

🍀 Pre-trained Checkpoints

🌎 Training the HEAR Model

🤖 Decoding the HEAR Model

📊 Human Evaluation Interface

About

Releases

Packages

Languages

lingjunzhao/HEAR

Folders and files

Latest commit

History

Repository files navigation

Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections

🛠️ Getting Started: Setup

🐾 Train and Validation Dataset Link

🍀 Pre-trained Checkpoints

🌎 Training the HEAR Model

🤖 Decoding the HEAR Model

📊 Human Evaluation Interface

About

Resources

Stars

Watchers

Forks

Languages