End-to-end object detection by using query-selection encoder with hierarchical feature-aware attention

News

[1/24/2025] At present, our paper is under review. We are in the process of finalizing the code and documentation to ensure that it is fully functional and user-friendly. All of the code will be publicly available through this github repository soon.

Introduction

In this paper, we introduce a novel query-selection encoder (QSE) designed for end-to-end object detectors to improve training convergence speed and detection accuracy. QSE is composed of multiple encoder layers stacked on top of the backbone. A lightweight head network is added after each encoder layer to continuously optimize features in a cascading manner, providing more positive supervision for efficient training. Additionally, a hierarchical feature-aware attention (HFA) mechanism is devised in each encoder layer, including in-level feature attention and cross-level feature attention, to enhance the interaction between features from different levels. HFA can effectively suppress similar feature representations and highlight the discriminative ones thereby accelerating the feature selection process. Our method is highly versatile in accommodating both CNN-based and transformer-based detectors.

Model Zoo

Running

Install

We implement QSE using MMDetection V2.25.3 and MMCV V1.5.0. The source code of MMdetection has been included in this repo and you only need to build MMCV following official instructions. We test our models under python=3.7.11,pytorch=1.11.0,cuda=11.3. Other versions may not be compatible.

Data

The COCO dataset and LVIS dataset should be organized as:

QSE
└── data
    └── coco
       ├── annotations
       │      ├── instances_train2017.json
       │      └── instances_val2017.json
       ├── train2017
       └── val2017

Training

Testing

Cite our work

License

This project is released under the MIT license. Please see the LICENSE file for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

End-to-end object detection by using query-selection encoder with hierarchical feature-aware attention

News

Introduction

Model Zoo

Running

Install

Data

Training

Testing

Cite our work

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

End-to-end object detection by using query-selection encoder with hierarchical feature-aware attention

News

Introduction

Model Zoo

Running

Install

Data

Training

Testing

Cite our work

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages