This is the code for the paper [Short Text Hashing Improved by Integrating Multi-Granularity Topics and Tags. CICLing 2015]
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
Code
Data/SearchSnippets
LICENSE
README.md

README.md

Jiaming Xu
jacoxu@msn.com
Mar. 3, 2015

Short Text Hashing Improved by Integrating Multi-Granularity Topics and Tags

Usage:
The main function: ./Code/main.m, please run this file;

Notices:
1. Give a example on SearchSnippets dataset: ./Data/SearchSnippets/SearchSnippets.mat
2. The max memory will be cost 5GB RAM;
3. The max disk space will be cost 100MB;
4. Topic models are pre-trained by A Java Implementation of Latent Dirichlet Allocation (LDA)
website: http://jgibblda.sourceforge.net/
5. The result figures will be saved in the folder: ./fig/
6. This is a demo package includes the all details about porposed methods.
7. Please feel free to send me emails if you have any problems in using this package.

Some of the code is modified from

  • Self-taught hashing for fast similarity search. sigir, 2010
  • Composite hashing with multiple information sources. sigir, 2011

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.