Multi-Scale Cross-Attention Transformer Encoder for Event Calssification

A Transformer encoder model that incorporates different scale information via a multi-modal network with self and cross-attention layers. The package is based on (arXiv:2401.00452 [hep-ph] ). The current version of the package is very generic and spans three configurations as follows:

Model_1: Transformer model with a single input and self-attention heads.
Model_2: Transformer model with three inputs; each input is followed by transformer layers with self-attention heads, and the output is fed to transformer layers with cross-attention heads.
Model_3: Transformer model with two inputs; each input is followed by transformer layers with self-attention heads, and the output is fed to transformer layers with cross-attention heads.

$~~~~~~~~~~~$ Table of content

$~~~~~~~~~~~$ $~~~~~~~~~~~$ 1. Introduction

$~~~~~~~~~~~$ $~~~~~~~~~~~$ 2. Requirements

$~~~~~~~~~~~$ $~~~~~~~~~~~$ 4. Getting start

$~~~~~~~~~~~$ $~~~~~~~~~~~$ 3. Package structure

$~~~~~~~~~~~$ $~~~~~~~~~~~$ 6. Flow Chart of the networks

Introduction

Information about jet identification provides powerful insights into collision events and can %help separating help to separate different physics processes originating these. This information can be extracted from the elementary particles localized inside a jet. Recently, various methods have been used to exploit the substructure of a jet to probe new physics signatures using advanced Machine Learning (ML) techniques. Conversely, using the reconstructed kinematics from the final state jets for event classification spans the full phase space and exhibits large classification performance. Such high-level kinematics (i.e., encoding the global features of the final state particles), possibly together with the knowledge of the properties of (known or assumed) resonant intermediate particles, remains blind to the information encoded inside the final state jets. A possible way to extract information from both jet substructure and global jet kinematics is to concatenate the information extracted from a multi-modal network. However, such a simple concatenation leads to an imbalance of the extracted information, within which the kinematic information generally dominates. We present a novel method for incorporating different-scale information extracted from both global kinematics and substructure of jets via a transformer encoder with a cross-attention layer. The model initially extracts the most relevant information from each dataset individually using self-attention layers before incorporating these using a cross-attention layer. The method demonstrates a larger improvement in classification performance compared to the simple concatenation method.

Requirements

To run the package you need python3 with the following modules:

Numpy
TensorFlow
sklearn
matplotlib

Requirements can be easily installed by pip3 install module or the user can use the given enviroment.yml to create a conda enviroment.

Get start

To run the package, the user has to fill the file input.py for the used model. For example, if the user wants to use model_1, then only the corresponding lines for model_1 have to be filled, while all other inputs are ignored by the code. To run the code, type in the terminal: python3 run.py.

The network assumes the signal events are in one file and all the backgrounds are in one file in the numpy format ".npz," which can be easily obtained by the command numpy.savez_compressed().

A demo version is also provided, in which the user can run it to test the package. To run the demo version, type in the terminal: python3 run_demo.py.

Structure of the code

The package consists of the following files:

input.py Input file which has to be filled by the user to control the network structure
run.py The run file which takes as input the files in the source directory and the input.py
run_demo.py A demo version of the code in wich the user can run it for test.
data/ Data directory contains the signal and background files for the demo.
source/transformer.py Source code for the transformer network.
source/train.py Source code for the training and teseting loop for each model.
source/Analysis.py Example of Delphes analysis that the user can consider.
source/prepare_files.py Source code to prepae the input files to the network.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Scale Cross-Attention Transformer Encoder for Event Calssification

$~~~~~~~~~~~$ Table of content

Introduction

Requirements

Get start

Structure of the code

Flow chart example of model_1 , the user controls the structure of the model from the input file

Flow chart example of model_2, the user controls the structure of the model from the input file

Flow chart example of model_3, the user controls the structure of the model from the input file

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
data		data
source		source
README.md		README.md
enviroment.yml		enviroment.yml
input.py		input.py
run.py		run.py
run_demo.py		run_demo.py

AHamamd150/Multi-Scale-Transformer-encoder

Folders and files

Latest commit

History

Repository files navigation

Multi-Scale Cross-Attention Transformer Encoder for Event Calssification

$~~~~~~~~~~~$ Table of content

Introduction

Requirements

Get start

Structure of the code

Flow chart example of model_1 , the user controls the structure of the model from the input file

Flow chart example of model_2, the user controls the structure of the model from the input file

Flow chart example of model_3, the user controls the structure of the model from the input file

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages