HOI-Net

This is the Pytorch implementation of HOI-Net：Li N, Wang J, Luo Z, et al. High-Order-Interaction for weakly supervised Fine-Grained Visual Categorization[J]. Neurocomputing, 2021, 464: 27-36. which can be downloaded from (https://www.sciencedirect.com/science/article/abs/pii/S0925231221013060)

Abstract

Fine-Grained Visual Categorization (FGVC) is a challenging task due to the large intra-subcategory and small inter-subcategory variances. Recent studies tackle this task in a weakly supervised manner without using the part annotation from the experts. Of those, methods based on bilinear pooling are one of the main categories for computing the interaction between deep features and have shown high effectiveness. However, these methods mainly focus on the correlation within one specific layer but largely ignore the high interactions between multiple layers. In this study, we argue that considering the high interaction between the features from multiple layers can help to learn more distinguishing fine-grained features. To this end, we propose a High-Order-Interaction (HOI) method for FGVC. In our HOI, an efficient cross-layer trilinear pooling is introduced to calculate the third-order interaction between three different layers. Third-order interactions of different combinations are then fused to form the final representation. HOI can produce more discriminative representations and be readily integrated with the two popular techniques, attention mechanism, and triplet loss, to obtain superposed improvement. Extensive experiments conducted on four FGVC datasets show the great superiority of our method over bilinear-based methods and demonstrate that the proposed method achieves the state of the art.

Figure1

Figure 1: (a) The challenge of FGVC on the CUB-200-2011 dataset. The bird samples of the same subcategory may have large differences, while the bird samples of different subcategories may have great similarities. (b) Effectiveness of the proposed High-Order-Interaction (HOI). Ordinary CNN networks can not find discriminative regions without part annotations and thus fail to recognize samples from similar subcategories. HOI can activate important parts and thus accurately distinguish samples from similar subcategories.

Figure2

Figure 2: The overall framework of our proposed High-Order-Interaction (HOI) method for fine-grained visual categorization. The model mainly consists of two levels: (1) Feature Attention Pyramid, and (2) High Order Interaction. In level 1, we extract features from multiple different layers and use the attention mechanism to improve discrimination. In level 2, several features obtained by using the cross-layer trilinear pooling are concatenated together to form the final feature representation of a given image. Finally, the cross-entropy and the triplet loss function are jointly used to optimize the model.

Compatibility

The code is tested using Pytorch 1.9.0+cu111 under Ubuntu 18.04 with Python 3.6.9. CPU: 64 Intel(R) Xeon(R) Gold 5218 CPU @ 2.30GHz GPU:3090

Preparing FGVC CUB-200-2011 Datasets

You can download FGVC CUB-200-2011 Datasets dataset from http://www.vision.caltech.edu/visipedia/CUB-200-2011.html or https://pan.baidu.com/s/1JQxa3DYDrM329skC73kbzQ

Running testing

Python test.py

Results

You can download HOI-Net(resnet50) and HOI-Net(resnet101) models from link：https://pan.baidu.com/s/1P0uqWbEN9u2MFyQ5mPWoLQ?pwd=qgt9 code：qgt9 and https://pan.baidu.com/s/1NU8rk3gHCRuSVegrsZtTTA?pwd=qa57 code：qa57 and our models achieve the following performance on FGVC CUB-200-2011 Datasets.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md
cub_test_list.txt		cub_test_list.txt
data.py		data.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

cub_test_list.txt

cub_test_list.txt

data.py

data.py

test.py

test.py

Repository files navigation

HOI-Net

Abstract

Figure1

Figure2

Compatibility

Preparing FGVC CUB-200-2011 Datasets

Running testing

Results

performance on FGVC CUB-200-2011 Datasets

About

Releases

Packages

Languages

puallee/HOI-Net

Folders and files

Latest commit

History

Repository files navigation

HOI-Net

Abstract

Figure1

Figure2

Compatibility

Preparing FGVC CUB-200-2011 Datasets

Running testing

Results

performance on FGVC CUB-200-2011 Datasets

About

Resources

Stars

Watchers

Forks

Languages