Given and integer
Authors | Name | Codename |
---|---|---|
Jaud, Wirth and Choudhury | mgv |
|
McGregor and Vu | mgvo |
|
Saha and Getoor | sg |
|
Yu and Yuan | yy |
|
Norouzi-Far et al. | ajsaao |
|
Badanidiyuru et al. | bmkk |
g++ -std=c++2a streaming-maximum-cover/*.cpp -o smc -O3
The dataset must be of the following format (one set per line):
<int> <int> ... <int>
<int> <int> ... <int>
.
.
.
<int> <int> ... <int>
An example (chess.dat) is given in the dataset folder. The dataset must also be located next to a file named dataset_infos.txt. It must include a summary of the dataset. Each line in dataset_infos.txt is of the format:
<dataset name> <size (Bytes)> <m> <n> <maximum set size>
An example is given in the dataset folder.
./smc <algo> <path> <dataset> <k> <eps> <inde>
Parameter | Type | Description |
---|---|---|
<path> |
string | path the dataset folder (must end with "/") |
<dataset> |
string | filename of the dataset (must be inside <path> ) |
<k> |
int | number of sets to select |
<eps> |
float | precision parameter mgv , mgvo , bmkk and ajsaao ) |
<inde> |
string | codename of the independence factor mgv and mgvo ) |
The possible values for <inde>
are
Codename | Description |
---|---|
fullsamp |
To execute the full sampling algorithm |
full |
|
opt |
|
pairwise |
For more detail regarding the independence factor
The output consists of one line
<n>,<m>,<k>,<eps>,<algo>-<inde>,<|I|>,<|C|>,<space>,<subsampling time (ms)>,<total time (ms)>,<dataset name>