Modality-Agnostic Adapter (MAA) For Fine-grained Scene Image Classification

This is the official repo of the paper Fine-Grained Scene Image Classification with Modality-Agnostic Adapter to appear in ICME 2024.

Environment

conda env create -f environment.yml
conda activate maa
pip install -r requirements.txt

git clone https://github.com/matt-peters/allennlp.git
git clone https://github.com/allenai/kb.git

cd allennlp
git checkout 2d7ba1cb108428aaffe2dce875648253b44cb5ba
pip install -e .
cd ..

cd kb
pip install -r requirements.txt 
python -c "import nltk; nltk.download('wordnet')"
pip install https://github.com/explosion/spacy-models/releases/download/en_core_web_sm-2.0.0/en_core_web_sm-2.0.0.tar.gz
pip install -e .
cd ..

Dataset

1. Download datasets

Images of ConText and Activity can be downloaded from the above links.

Annotation files and google ocr results can be downloaded here.

2. Folder structure

datasets
├── activity
│   ├── images
│   ├── text.json
│   └── split_0.json
└── context
    ├── images
    ├── text.json
    └── split_0.json

Usage

1. Train

python main.py -c CONFIG_PATH

For example:

python main.py -c configs/train_context.toml
python main.py -c configs/train_activity.toml

You can also pass parameters like this:

python main.py -c CONFIG_PATH --cfgs OUTPUT_DIR outputs NUM_EPOCHS 50 BATCH_SIZE_PER_GPU 8

The parameters after --cfgs are config items in configs/*.toml.

2. Test

python main.py -c TEST_CONFIG_PATH

Acknowledgments

https://github.com/MCLAB-OCR/KnowledgeMiningWithSceneText

https://github.com/AndresPMD/Fine_Grained_Clf

https://github.com/AndresPMD/GCN_classification

https://github.com/allenai/kb

https://github.com/rwightman/pytorch-image-models

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
configs		configs
model		model
utils		utils
README.md		README.md
architecture.png		architecture.png
environment.yml		environment.yml
main.py		main.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Modality-Agnostic Adapter (MAA) For Fine-grained Scene Image Classification

Environment

Dataset

1. Download datasets

2. Folder structure

Usage

1. Train

2. Test

Acknowledgments

About

Releases

Packages

Languages

quniLcs/MAA

Folders and files

Latest commit

History

Repository files navigation

Modality-Agnostic Adapter (MAA) For Fine-grained Scene Image Classification

Environment

Dataset

1. Download datasets

2. Folder structure

Usage

1. Train

2. Test

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages