Skip to content

LiUSemWeb/HeFQUIN-VocabMappingsExperiments

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HeFQUIN-VocabMappingsExperiments

This repository contains artifacts and resources for conducting experiments related to the paper ‘Considering Vocabulary Mappings in Query Plans for Federations of RDF Data Sources’ (download camera-ready version).

The complete resources can be downloaded from Zenodo.

Federations

We provide three federations for the evaluation:

  • Fed0: all federation members use the global vocabulary.
  • Fed1: all federation members use the same local vocabulary, which is different from the global vocabulary.
  • Fed2: each federation member uses a different local vocabulary.

For detailed instructions and configuration files to set up each federation, navigate to the federations directory (e.g., Fed0).

Datasets

To construct the aforementioned federations, we use the dataset generator of Lehigh University Benchmark (LUBM). We generate data for ten universities and split them into ten separate datasets (one per university). An instruction for generating datasets can be found in the folder datasets.

  • For Fed1, we directly use the generated datasets and construct vocabulary mappings between the global vocabulary and local vocabulary that are used in the generated datasets.

  • For Fed0, to create datasets with global vocabulary, the script 'construct_globaldatasets.sh' can be found under datasets:

     sh construct_globaldatasets.sh
    
  • For each federation member of Fed2, we change the IRIs of the vocabulary terms used in the dataset of each member by appending a member-specific suffix (ranging from 0 to 9) to each such IRI. So this federation is structurally the same as Fed1, but with a different vocabulary for each federation member.

    To create datasets with local vocabularies, find the script 'rewrite_to_localschema.sh' under the folder datasets and run the following command:

     sh rewrite_to_localschema.sh
    

Alternatively, you can download generated datasets directly from the Zenodo in the directory datasets.

Queries

We designed seven benchmark queries expressed in terms of the global vocabulary. These queries differ regarding the types of vocabulary mapping rules that are relevant to them, and can be used for all three federations. table_queries

You can find all the queries in the queries folder.

Experiments

To execute experiments after setting up a federation, queries can be invoked from the test driver HeFQUIN engine using a shell script. We provide an example script under the federations directory.

sh run_experiment.sh

You can find the latest implementation of the HeFQUIN engine in the Github repository. For the evaluation in this paper, we use compilable JAR files for the HeFQUIN engine, which can be located in the experiments directory.

Results

This repository also contains an analysis of the experiment results, including query execution time and query planning time, in the resultsAnalysis directory. If you are interested in the original log files, all these row data can be downloaded from the experiments directory on Zenodo.

About

Artifacts of the experiments for our paper about considering vocabulary mappings in federated query processing.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages