Adversarial Sber

Usage

Step 0. Install dependencies

poetry install
poetry shell

Reproducibility

To reproduce all our experiments, please, run all bash scripts from ./scripts in numerical order:

01_build_datasets.sh
02_build_vocabs_discretizers.sh
03_train_all_classifiers.sh
04_train_all_lm.sh
05_attack_all_classifiers.sh

Step 1. Building datasets

We are working with following transactional datasets:

Age 1 and Age 1 Short: https://drive.google.com/drive/u/0/folders/1oTkPI5Z091JbXHmOR0N7D-KKpN_9Qiyp
Age 2 (Tinkoff): https://drive.google.com/drive/u/0/folders/1BES-FKzGuTvnmXKYeSKFyDVPIymLR8Fo
Client Leaving (Rosbank): https://drive.google.com/drive/u/0/folders/122DETMAOT_cVKiRnXsVLiJ8gBHyUOhfw

Scoring (Sberbank) dataset is not available for public use due to NDA.

To get the processed datasets, you need to run

bash scripts/01_build_datasets.sh

As a result, in the directory ../data you will get the data for the next experiments in following directories:

.
├── lm
│   ├── train.jsonl
│   └── valid.jsonl
├── substitute_clf
│   ├── train.jsonl
│   └── valid.jsonl
├── target_clf
│   ├── train.jsonl
│   └── valid.jsonl
└── test.jsonl

Each row in .jsonl -- is a dictionary with keys transactions, amounts, label, client_id.

Step 2. Building vocabs and discretizers.

To build vocabulary and train discretizer run:

bash scripts/02_build_vocabs_discretizers.sh

Trained discretizers will be stored in ./presets/${DATASET_NAME}/discretizers/100_quantile, and vocabs in ./presets/${DATASET_NAME}/vocabs/100_quantile.

Experiments

All results will be at ../experiments:

Trained models: ../experiments/trained_models
Result of attacks: ../experiments/attacks

Step 3. Training all classifiers.

To train all classifiers (LSTM, CNN, GRU) run:

bash scripts/03_train_all_classifiers.sh

As a result, all trained models will be stored in ../experiments/trained_models.

If you want to train a certain model, use:

bash scripts/local/train_clf.sh ${config_name} ${clf_type} "100_quantile" ${dataset_name},

where clf_type is "substitute" or "target" and config_name is "gru_with_amounts"/"lstm_with_amounts"/"cnn_with_amounts".

Step 4. Training language models.

To train all lanuage models run:

bash scripts/03_train_all_lm.sh

As a result, all trained language models will be stored in ../experiments/trained_models.

Step 5. Attacking all models

To attack all models run:

bash scripts/05_attack_all_classifiers.sh

The results will be stored in ../experiments/trained_models/attacks. There metrics of resulted attacks will be available at .metrics.json and adversarial data in adversarial.json.

If you want to attack a certain model for fixed dataset, you can use:

bash scripts/local/attack.sh ${subst_clf} ${targ_clf} ${number of samples to attack} ${dataset_name},

where subst_clf and targ_clf are "gru_with_amounts"/"lstm_with_amounts"/"cnn_with_amounts".

Name		Name	Last commit message	Last commit date
Latest commit History 407 Commits
.github/workflows		.github/workflows
advsber		advsber
bin		bin
configs		configs
notebooks		notebooks
presets		presets
scripts		scripts
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Makefile		Makefile
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adversarial Sber

Usage

Step 0. Install dependencies

Reproducibility

Step 1. Building datasets

Step 2. Building vocabs and discretizers.

Experiments

Step 3. Training all classifiers.

Step 4. Training language models.

Step 5. Attacking all models

About

Releases

Packages

Contributors 4

Languages

fursovia/adversarial_sber

Folders and files

Latest commit

History

Repository files navigation

Adversarial Sber

Usage

Step 0. Install dependencies

Reproducibility

Step 1. Building datasets

Step 2. Building vocabs and discretizers.

Experiments

Step 3. Training all classifiers.

Step 4. Training language models.

Step 5. Attacking all models

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages