Skip to content

long-gong/datasets-E2H

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

56 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

E2H: Euclidean Datasets to Hamming Datasets

Build Status

E2H implements the preprocessing tool used in our recent paper, "Long Gong, Huayi Wang, Mitsunori Ogihara, and Jun Xu. 2020. IDEC: indexable distance estimating codes for approximate nearest neighbor search. Proc. VLDB Endow. 13, 9 (May 2020), 1483–1497. DOI:https://doi.org/10.14778/3397230.3397243." E2H is used to convert Euclidean datasets to Hamming datasets.

Install Dependecies

./install_deps.sh

Usage

make <dataset>
./<dataset> m 

<dataset>: audio|glove|mnist|enron|sift1m|gist1m|sift1b|gist80m m: dimension for Hamming data (suggested value: rounding original dim to multiples of 64)