Skip to content

Official Implementation of IEEE TIFS paper Odyssey: Creation, Analysis and Detection of Trojan Models

Notifications You must be signed in to change notification settings

nazmul-karim170/Odessey

 
 

Repository files navigation

Odyssey: Creation, Analysis and Detection of Trojan Models

Marzieh Edraki1, Nazmul Karim 2
Nazanin Rahnavard1,2, Ajmal Mian 3 and Mubarak Shah1
1 Center for Research in Computer Vision
2Department of Electrical and Computer Engineering
The University of Central Florida
3 School of Computer Science and Software Engineering
The University of Western Australia

Odyssey is a comprehensive study on creating, analaysing and detecting Trojan models conducted jointly by CRCV group, MIG group and LCWN lab.

Introduction

Trojan attack is one of the recent variant of data poisoning attacks that involves manipulation or modification of the model to act balefully. This can occur when an attacker interferes with the training pipeline by inserting triggers into some of the training samples and trains the model to act maliciously only for samples that are stamped with trigger. Since the knowledge of such triggers is only privy to the attacker, detection of Trojan behavior is a challenge task.

A major reason for the lack of a realistic Trojan detection method has been the unavailability of a large-scale benchmark dataset, consisting of clean and Trojan models. Here we introduce Odysseus the largest public dataset that contains over 3,000 trained clean and Tojan models based on Pytorch.

Odysseus

While creating Odysseus, we focused on several factors such as mapping type, model architectures, fooling rate and validation accuracy of each model and also type of trigger. These models are trained on CIFAR10, Fashion-MNIST and MNIST datasets. For each dataset, clean and Trojan models are trained for 4 different architectures. Namely Resent18, VGG19, Densenet and GoogleNet for CIFAR10 and Fashion-MNIST and 4 custom designed architectures for MNIST. We also considered various source to target label mapping for the Trojan models.

alt tag

The overall statistics of clean and Trojan models per datasets is as follow:

alt tag

Trojan Detector

By analyzing the models in Odysseus, we found that Trojan attacks affect the intrinsic properties of NN models. Specifically they can affect

  • The classifier margin.
  • The non-linearity of the deciscion boundaries around clean data.

Based on these findings, we propose a new Trojan detector that sets a high baseline for the Odysseus dataset.

alt tag

The details of this study can be found here

Accessing Odysseus

You can download the whole dataset or part of it based on your need.

About

Official Implementation of IEEE TIFS paper Odyssey: Creation, Analysis and Detection of Trojan Models

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%