safeNLP - Safety-critical NLP benchmark for VNNCOMP

Structure

.
├── data
│   ├── medical                                      - folder containing the data for the Medical dataset
│   └── ruarobot                                     - folder containing the data for the R-U-A-Robot dataset
│   
├── onnx
│   ├── medical                                      - folder containing the onnx network for the Medical dataset
│   └── ruarobot                                     - folder containing the onnx network for the R-U-A-Robot dataset
│   
├── vnnlib
│   ├── medical                                      - folder containing the queries in vnnlib format for the Medical dataset
│   └── ruarobot                                     - folder containing the queries in vnnlib format for the R-U-A-Robot dataset
│
└── generate_properties.py                           - python script to generate the vnnlib queries and the instances.csv file (which is a list of network-query-timeout instances)

Generate benchmarks

To generate the .vnnlib and instances.csv files run:

python generate_properties.py SEED

Details

Pre-processing: by following the best methodology and results from our paper, we use ANTONIO to pre-process the data. The pre-processing comprehends embedding, reducing dimensionality and rotating the data.

Embedding model	Original embedding dimension	Dimension after PCA dimensionality reduction	Space manipulations
all-MiniLM-L6-v2	384	30	Eigenspace rotation

Datasets:

Name	Number of instances	Description
Medical	2886	The Medical safety dataset is a written English dataset consisting of 2,917 risk-graded medical and non medical queries (1,417 and 1,500 examples respectively). The dataset was constructed via collecting questions posted on reddit, such as r/AskDocs. The medical queries have been labelled by experts and crowd annotators for both relevance and levels of risk (i.e. non-serious, serious to critical) following established World Economic Forum (WEF) risk levels designated for chatbots in healthcare. We merge the medical queries of different risk-levels into one class, given the high scarcity of the latter 2 labels to create an in-domain/out-of-domain classification task for medical queries. Additionally, we consider only the medical queries that were labelled as such by expert medical practitioners. Thus this dataset will facilitate discussion on how to guarantee a system recognises medical queries, in order to avoid generating medical output.
R-U-A-Robot	7926	The R-U-A-Robot dataset is a written English dataset consisting of 6800 variations on queries relating to the intent of ‘Are you a robot?’, such as ‘I’m a man, what about you?’. The dataset was created via a context-free grammar template, crowd-sourcing and pre-existing data sources. It consists of 2,720 positive examples (where given the query, it is appropriate for the system to state its non-human identity), 3,400 negative/adversarial examples and 680‘ambiguous-if-clarify’ examples (where it is unclear whether the system is required to state its identity). The dataset was created to promote transparency which may be required when the user receives unsolicited phone calls from artificial systems. Given systems like Google Duplex, and the criticism it received for human-sounding outputs, it is also highly plausible for the user to be deceived regarding the outputs generated by other NLP-based systems. Thus we choose this dataset to understand how to enforce such disclosure requirements. We collapse the positive and ambiguous examples into one label, following the principle of ‘better be safe than sorry’, i.e. prioritising a high recall system.

Networks:

Input dimension Layers type Layers number Layers size Activation functions

30 Fully connected 2 (128, 2) ReLU

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

safeNLP - Safety-critical NLP benchmark for VNNCOMP

Structure

Generate benchmarks

Details

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data		data
onnx		onnx
vnnlib		vnnlib
README.md		README.md
generate_properties.py		generate_properties.py

ANTONIONLP/safeNLP

Folders and files

Latest commit

History

Repository files navigation

safeNLP - Safety-critical NLP benchmark for VNNCOMP

Structure

Generate benchmarks

Details

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages