VALUE: The VernAcular Language Understanding Evaluation benchmark

This repository contains source code necessary to build the VALUE datasets.

Feel free to contact Caleb Ziems with any questions.

Setup

Prerequisites:

Create a virtual environment

conda create --name value python=3.7
conda activate value

Install requirements:

pip install -r requirements.txt

Install spaCy English pipeline and nltk wordnet

python -m spacy download en_core_web_sm
python 
>>> import nltk
>>> nltk.download('wordnet')
>>> quit()

Example Transformation Usage

Inside of repl.py, you'll find a minimal example using the VALUE transformations for a simple Read-Evaluate-Print-Loop.

This can be run and used with python repl.py --transform=aave_like.

Run Experiments

Modify the run_glue.sh script accordingly. The script automatically downloads GLUE and runs the transformations, but if you would like to manually complete the transformation pipeline, see the next section Manually Build VALUE variants (optional)

Manually Build VALUE variants (optional)

Note: This can take a while to run. To create only a single task, replace the --all tag with the task-specific tag (e.g. --MNLI) in each of the following commands.

Download the datasets from the GLUE benchmark where each task is a subdirectory of data/GLUE

python download_glue_data.py --data_dir "data/GLUE" --tasks all

Move to src (cd src) and complete the following:

Build VALUE base variant with column for HTML tagging (to be used in MTurk validation)

python -m src.build_value --all --VALUE 'data/VALUE' --lexical_mapping 'resources/sae_aave_mapping_dict.pkl' --morphosyntax --html --dialect aave

Build VALUE_no_morpho variant

python -m src.build_value --all --VALUE 'data/VALUE' --lexical_mapping 'resources/sae_aave_mapping_dict.pkl' --html --dialect aave

Build VALUE_no_lex variant

python -m src.build_value --all --VALUE 'data/VALUE' --morphosyntax --html --dialect aave

Build the VALUE_style_transfer variant by cloning the style-transfer-paraphrase repo and running the following for each task dataframe

from style_paraphrase.inference_utils import GPT2Generator
import pandas as pd

paraphraser = GPT2Generator('pretrained_style_transfer/models/paraphraser_gpt2_large/', upper_length="same_5")
paraphraser.modify_p(top_p=0.6)
sae_to_aave = GPT2Generator('pretrained_style_transfer/models/cds_models/aae', upper_length="same_5")
sae_to_aave.modify_p(top_p=0.6)

df = pd.read_csv('path/to/specific/task') # FILL THIS PATH IN
converted = []
batch_size = 32
for i in range(int(len(df)/batch_size)+1):
    sub_df = df.iloc[batch_size*(i):batch_size*(i+1)].copy()
    for col in df.columns:
        if (('sentence' in col) or ('question' in col)) and ('parse' not in col):
            sub_df = sub_df[[type(c)==str for c in sub_df[col].values]].copy()
            consider = sub_df[col].values
            para, prob = paraphraser.generate_batch(consider)
            aave, prob = sae_to_aave.generate_batch(para)
            sub_df[col+'-glue'] = consider
            sub_df[col] = aave
            converted.append(sub_df)
            
converted_df = pd.concat(converted)
converted_df.to_csv('path/to/transformed/task', sep='\t') # FILL THIS PATH IN

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
coqa_exp		coqa_exp
hit		hit
resources		resources
src		src
README.md		README.md
download_glue_data.py		download_glue_data.py
repl.py		repl.py
requirements.txt		requirements.txt
run_coqa_sae.sh		run_coqa_sae.sh
run_coqa_value.sh		run_coqa_value.sh
run_coqa_value_combined.sh		run_coqa_value_combined.sh
run_glue.py		run_glue.py
run_glue.sh		run_glue.sh

SALT-NLP/value

Folders and files

Latest commit

History

Repository files navigation

VALUE: The VernAcular Language Understanding Evaluation benchmark

Setup

Prerequisites:

Example Transformation Usage

Run Experiments

Manually Build VALUE variants (optional)

About

Resources

Stars

Watchers

Forks

Languages