Multi-Modal Domain Fusion for Multi-modal Aerial View Object Classification

In this work, we explore a methodology to use both EO and SAR sensor’s information to effectively improve the performance of the ATR (Automatic Target Recognition) systems by handling the shortcomings of each of the sensors. A novel Multi-Modal Domain Fusion(MDF) network is proposed to learn the domain invariant features from multi-modal data and use it to accurately classify the aerial view objects. The proposed MDF network achieves top-10 performance in the Track-1 (test on SAR data only) with an accuracy of 25.3% and top-5 performance in Track-2 (test on EO and SAR data) with an accuracy of 34.26% in the test phase on the PBVS MAVOC Challenge dataset.

Check out our paper here.

We advise you to use conda environment to run the package. Run the following command to install all the necessary modules:

conda env create -f environment.yaml 
conda activate PBVS

To reproduce our results, you will need to download the NTIRE/PBVS MAVOC challenge dataset.

To reproduce the results that we have achieved, please follow the following steps:

It is advised that the test data, the test executable file(result_gen.py), the trained model file(.pth file) and the results.csv file are all in the working directory so that path issues do not occur. Now, conda activate <environment_name>.
Replace the path defined in the img_folder1 variable in the result_gen.py file with the path of the EO test images.
Replace the path defined in the img_folder2 variable in the result_gen.py file with the path of the SAR test images.
Create empty results.csv file with just the headers "image_id" and "class_id" in the working directory.
Just execute the result_gen.py using command: python result_gen.py

The results.csv wil be filled with the predictions which you may use to calculate the test accuracy. Thats it to reproduce our results!

The pretrained model ckpt can be found here.

For Track1 (SAR only), you will find the model ckpts here and here.

For Track2 (EO and SAR), you will find the model ckpt here.

Citation

If you find this repo useful for your work, please cite our paper:

@misc{udupa2022multimodal,
      title={Multi-Modal Domain Fusion for Multi-modal Aerial View Object Classification}, 
      author={Sumanth Udupa and Aniruddh Sikdar and Suresh Sundaram},
      year={2022},
      eprint={2212.07039},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
Exec_Submission		Exec_Submission
models		models
utils		utils
README.md		README.md
environment.yaml		environment.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Modal Domain Fusion for Multi-modal Aerial View Object Classification

Citation

About

Releases

Packages

Languages

Sumanth181099/PBVS_MAVOC2022

Folders and files

Latest commit

History

Repository files navigation

Multi-Modal Domain Fusion for Multi-modal Aerial View Object Classification

Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages