Odyssey: Creation, Analysis and Detection of Trojan Models

Marzieh Edraki¹, Nazmul Karim ²
Nazanin Rahnavard^1,2, Ajmal Mian ³ and Mubarak Shah¹
¹ Center for Research in Computer Vision
²Department of Electrical and Computer Engineering
The University of Central Florida
³ School of Computer Science and Software Engineering
The University of Western Australia

Odyssey is a comprehensive study on creating, analaysing and detecting Trojan models conducted jointly by CRCV group, MIG group and LCWN lab.

Introduction

Trojan attack is one of the recent variant of data poisoning attacks that involves manipulation or modification of the model to act balefully. This can occur when an attacker interferes with the training pipeline by inserting triggers into some of the training samples and trains the model to act maliciously only for samples that are stamped with trigger. Since the knowledge of such triggers is only privy to the attacker, detection of Trojan behavior is a challenge task.

A major reason for the lack of a realistic Trojan detection method has been the unavailability of a large-scale benchmark dataset, consisting of clean and Trojan models. Here we introduce Odysseus the largest public dataset that contains over 3,000 trained clean and Tojan models based on Pytorch.

Odysseus

While creating Odysseus, we focused on several factors such as mapping type, model architectures, fooling rate and validation accuracy of each model and also type of trigger. These models are trained on CIFAR10, Fashion-MNIST and MNIST datasets. For each dataset, clean and Trojan models are trained for 4 different architectures. Namely Resent18, VGG19, Densenet and GoogleNet for CIFAR10 and Fashion-MNIST and 4 custom designed architectures for MNIST. We also considered various source to target label mapping for the Trojan models.

The overall statistics of clean and Trojan models per datasets is as follow:

Trojan Detector

By analyzing the models in Odysseus, we found that Trojan attacks affect the intrinsic properties of NN models. Specifically they can affect

The classifier margin.
The non-linearity of the deciscion boundaries around clean data.

Based on these findings, we propose a new Trojan detector that sets a high baseline for the Odysseus dataset.

The details of this study can be found here

Accessing Odysseus

You can download the whole dataset or part of it based on your need.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
Odysseus		Odysseus
fig		fig
Dataset Descritption.pdf		Dataset Descritption.pdf
README.md		README.md
Readme_detector.txt		Readme_detector.txt
fake_trojan_detector.py		fake_trojan_detector.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Odyssey: Creation, Analysis and Detection of Trojan Models

Introduction

Odysseus

Trojan Detector

Accessing Odysseus

About

Releases

Packages

Languages

nazmul-karim170/Odessey

Folders and files

Latest commit

History

Repository files navigation

Odyssey: Creation, Analysis and Detection of Trojan Models

Introduction

Odysseus

Trojan Detector

Accessing Odysseus

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages