ACENet: Adaptive Context Enhancement Network for RGB-T Video Object Detection

Zhengzheng Tu, Le Gu, Danyin Lin, and Zhicheng Zhao.

Introduction

This repository is the official implementation for "ACENet: Adaptive Context Enhancement Network for RGB-T Video Object Detection".

Abstract

RGB-thermal (RGB-T) video object detection (VOD) aims to leverage the complementary advantages of visible and thermal infrared sensors to achieve robust performance under various challenging conditions, such as low illumination and extreme illumination changes. However, existing multimodal VOD approaches face two critical challenges: accurate detection of objects at different scales and efficient fusion of temporal information from multimodal data. To address these issues, we propose an Adaptive Context Enhancement Network (ACENet) for RGB-T VOD. Firstly, we design an Adaptive Context Enhancement Module (ACEM) to adaptively enhance multi-scale context information. We introduce ACEM in the FPN section, where it can adaptively extract context information and incorporate it into the high-level feature maps. Secondly, we design a Multimodal Temporal Fusion Module (MTFM) to perform temporal and modal fusion using coordinate attention with atrous convolution at the early stage, significantly reducing the complexity of fusing temporal information from RGB and thermal data. Experimental results on the VT-VOD50 dataset show that our ACENet significantly outperforms other mainstream VOD methods. Our code is available at: https://github.com/bscs12/ACENet.

Getting Started

Prepare Environment

Clone repository

git clone https://github.com/bscs12/ACENet.git
cd ACENet

Create environment

conda create -n ACENet python=3.7
conda activate ACENet

Install PyTorch

pip install torch==1.7.0+cu101 torchvision==0.8.0+cu101 torchaudio===0.7.0 -f https://download.pytorch.org/whl/torch_stable.html

or

conda install pytorch==1.7.0 torchvision==0.8.0 torchaudio==0.7.0 cudatoolkit=10.1 -c pytorch

Install ACENet

pip install -U pip && pip install -r requirements.txt
pip install -v -e .  # or  python setup.py develop

Install APEX

git clone https://github.com/NVIDIA/apex
cd apex
pip install -v --disable-pip-version-check --no-cache-dir --global-option="--cpp_ext" --global-option="--cuda_ext" ./

Install pycocotools

pip install cython
pip install pycocotools

Prepare Checkpoint

You can download the pretrained checkpoint from [https://github.com/Megvii-BaseDetection/YOLOX].

Prepare Dataset

Download dataset

You can download the MMVOD2022 dataset from [https://pan.baidu.com/s/1S1onHrlVH8s6xF2uXlELYw?pwd=e2de] [Password: e2de].

Prepare your own dataset

Make sure the dataset folder structure like this:

datasets
    ├── MMVOD2022
    │   ├── VOD2022
    │   │   ├── JPEGImages
    │   │   │   ├── L1
    │   │   │   │   ├── RGB
    │   │   │   │   │   ├── xxx.jpg
    │   │   │   │   │   ├── xxx.jpg
    │   │   │   │   │   ├── ...
    │   │   │   │   ├── T
    │   │   │   │   │   ├── xxx.jpg
    │   │   │   │   │   ├── xxx.jpg
    │   │   │   │   │   ├── ...
    │   │   │   ├── L2
    │   │   │   │   ├── RGB
    │   │   │   │   │   ├── xxx.jpg
    │   │   │   │   │   ├── xxx.jpg
    │   │   │   │   │   ├── ...
    │   │   │   │   ├── T
    │   │   │   │   │   ├── xxx.jpg
    │   │   │   │   │   ├── xxx.jpg
    │   │   │   │   │   ├── ...
    │   │   │   ├── ...
    │   │   ├── Annotations
    │   │   │   ├── L1
    │   │   │   │   ├── RGB
    │   │   │   │   │   ├── xxx.xml
    │   │   │   │   │   ├── xxx.xml
    │   │   │   │   │   ├── ...
    │   │   │   ├── ...
    │   │   ├── ImageSets
    │   │   │   ├── Main
    │   │   │   │   ├── train.txt
    │   │   │   │   ├── test.txt

Modify the classes in ACENet/yolox/data/datasets/voc_classes.py

Train

Modify the parameters in ACENet/yolox/exp/yolox_base.py and ACENet/train.py

python train.py

Evaluation

python eval.py

Cite

If you find this work useful for your, please consider citing our paper. Thank you!

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
assets		assets
build		build
datasets		datasets
demo		demo
docs		docs
exps		exps
tools		tools
weights		weights
yolox.egg-info		yolox.egg-info
yolox		yolox
ACENet.png		ACENet.png
README.md		README.md
demo.py		demo.py
eval.py		eval.py
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ACENet: Adaptive Context Enhancement Network for RGB-T Video Object Detection

Introduction

Abstract

Getting Started

Prepare Environment

Prepare Checkpoint

Prepare Dataset

Train

Evaluation

Cite

About

Releases

Packages

Languages

tzz-ahu/ACENet

Folders and files

Latest commit

History

Repository files navigation

ACENet: Adaptive Context Enhancement Network for RGB-T Video Object Detection

Introduction

Abstract

Getting Started

Prepare Environment

Prepare Checkpoint

Prepare Dataset

Train

Evaluation

Cite

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages