Dirichlet-Derived Multiple Topic Scene Classification Model Fusing Heterogeneous Features for High Resolution Remote Sensing Imagery
by Bei Zhao, Yanfei Zhong, Guisong Xia, and Liangpei Zhang
This is an official implementation of DMTM in our TGRS 2016 paper "Dirichlet-Derived Multiple Topic Scene Classification Model Fusing Heterogeneous Features for High Resolution Remote Sensing Imagery".
Abstract
Due to the complex arrangements of the ground objects in high spatial resolution (HSR) imagery scenes, HSR imagery scene classification is a challenging task, which is aimed at bridging the semantic gap between the low-level features and high-level semantic concepts. A combination of multiple complementary features for HSR imagery scene classification is considered to be a potential way to improve the performance. However, the different types of features have different characteristics, and how to fuse the different types of features is a classic problem. In this paper, a Dirichlet-derived multiple topic model (DMTM) is proposed to fuse heterogeneous features at a topic level for HSR imagery scene classification. An efficient algorithm based on a variational expectation-maximization framework is developed to infer the DMTM and estimate the parameters of the DMTM. The proposed DMTM scene classification method is able to incorporate different types of features with different characteristics, no matter whether these features are local or global, discrete or continuous. Meanwhile, the proposed DMTM can also reduce the dimension of the features representing the HSR images. In our experiments, three types of heterogeneous features—local spectral features, local structural features, and global textural features—were employed. The experimental results with three different HSR imagery datasets show that the three types of features are complimentary, and the proposed DMTM is able to reduce the dimension of the features representing the HSR images, fuse the different types of features efficiently, and improve the performance of the scene classification over that of other scene classification algorithms based on spatial pyramid matching (SPM), probabilistic latent semantic analysis (PLSA), and latent Dirichlet allocation (LDA).
- 2020/08/28, We release the code of DMTM.
- Combine the spectral, structural and texture information
- Incorporate different types of features with different characteristics at a topic level for HSR imagery scene classification.
If you use DMTM in your research, please cite the following paper:
@article{article,
author = {Zhao, Bei and Zhong, Yanfei and Xia, Guisong and Zhang, Liangpei},
year = {2016},
month = {04},
pages = {2108-2123},
title = {Dirichlet-Derived Multiple Topic Scene Classification Model Fusing Heterogeneous Features for High Resolution Remote},
volume = {54},
journal = {IEEE Transactions on Geoscience and Remote Sensing},
doi = {10.1109/TGRS.2015.2496185}
}
System Requirements: linux
Compiler Environment:
cd DMTM/code/topic_model/code
run makeThe “HOME” path in the makefile should be changed to the path of DMTM/code/topic_model/code
The project should be organized as:
DMTM
├── code
│ ├── feat_coding /feature coding
│ ├── libsvm-3.20 //tools of SVM classifier
│ ├── scenefeature //feature extraction
│ ├── topic_model //LDA for scene classification
│ ├── util
│ ├── vlfeat-0.9.17 //tools of K-means
│ ├── demo_cal_MeanStd_KM.m
│ ├── demo_cal_SIFT_KM.m
│ ├── demo_cal_Texture_KM.m
│ ├── DMTM_main.m
├── data // dataset
│ ├── Google dataset of SIRI-WHU (http://rsidea.whu.edu.cn/resource_sharing.htm)
run 'DMTM_main.m'Meaning of Variables:
"12class_tif": a folder for storing image data image_data_dir: the path of "12class_tif" , the folder name in the path cannot have spaces. out_meanstd_feature_dir, out_sift_feature_dir, out_siti_feature_dir, : The output path of final feature
*The folder name in the path cannot have spaces.
*Texture feature extraction (siti) needs to be carried out in the c++ 2010 environment.
cd DMTM/code/topic_model
run './mflda dmtm_google.txt' (dmtm_google.txt is the configuration file)The copyright belongs to Intelligent Data Extraction, Analysis and Applications of Remote Sensing(RSIDEA) academic research group,State Key Laboratory of Information Engineering in Surveying, Mapping, and Remote Sensing (LIESMARS),Wuhan University. This program is for academic use only. For commercial use, please contact Professor.Zhong(zhongyanfei@whu.edu.cn). The homepage of RSIDEA is: http://rsidea.whu.edu.cn/index.html