GitHub - shangdatalab/Deep-Contam

Data Contamination Can Cross Language Barriers

Overview • Quick Start • Data Release • 🤗 Models • Paper

Overview

Deep Contam represents the cross-lingual contamination that inflates LLMs' benchmark performance while evading existing detection methods. An effective method to detect it is also provided in this repository.

Quick Start

To detect potential hidden contamination in a specific model, follow the steps below.

Install dependencies.
```
pip install -r requirements.txt
```

Specify model_path and run the following command.

python detect.py --model_path MODEL_PATH --dataset_name DATA_NAME

For example,

python detect.py --model_path 'microsoft/phi-2' --dataset_name MMLU,ARC-C,MathQA

The output would be:

MMLU
    original: 23.83
    generalized: 25.02
    difference: +1.20
----------------------
ARC-C
    original: 42.92
    generalized: 47.27
    difference: +4.35
----------------------
MathQA
    original: 31.32
    generalized: 38.70
    difference: +7.38

Data Release

The generalized versions of the benchmark we constructed to detect the potential contamination are released as follows.

Contaminated Models

Checkpoints of the models we deliberately injected with cross-lingual contamination are provided as follows.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
custom_tasks		custom_tasks
detect_baselines		detect_baselines
detect_method		detect_method
imgs		imgs
inject		inject
translate		translate
.gitignore		.gitignore
README.md		README.md
detect.py		detect.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Quick Start

Data Release

Contaminated Models

About

Releases

Packages

Contributors 5

Languages

shangdatalab/Deep-Contam

Folders and files

Latest commit

History

Repository files navigation

Overview

Quick Start

Data Release

Contaminated Models

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages