wimuQ

Over the last years, the Web of Data has grown significantly. Various interfaces such as LOD Stats, LOD Laudromat, SPARQL endpoints provide access to the hundered of thousands of RDF datasets, representing billions of facts. These datasets are available in different formats (e.g., raw data dumps, HDT files) or directly accessible via SPARQL endpoints. Querying such large amount of distributed data is particularly challenging. In addition, many of these datasets are available as raw data dumps or HDT files and cannot be directly queried using the SPARQL query language. To tackle these problems, we present WimuQ, an approach to execute SPARQL queries over large amount of heterogeneous RDF data sources. At present, WimuQ is able to execute both federated and non-federated SPARQL queries over a total of 668,166 datasets from LOD Stats and LOD Laudromat as well as 559 active SPARQL endpoints. These data sources represent a total of 221.7 billion triples from more than 5 terabytes of information from datasets retrieved using the service 'Where is My URI' (WIMU). Our evaluation on state-of-the-art real-data benchmarks shows that WimuQ brings at least three times more results than previous approaches.

Experiments: nohup java -Xmx10G -jar wimuT.jar queries.txt <TYPE> &

(wimuT.jar)[https://doi.org/10.6084/m9.figshare.7117052]

Where <TYPE> can be:

wimut -> To execute only wimuT
squin -> To execute only SQUIN
lodalot -> To execute only SPARQLaLOT
all -> To execute wimuT + SQUIN + SPARQLatLOT

Measuring Memmory, CPU and Disk consumption:

python prodimem.py <PID> 60 > prodimem.log 2>&1 &

Stable version of all Source code, experiments and web version: (StableVersionV1)[https://doi.org/10.6084/m9.figshare.7370945]

A prototype is available here (here)[https://w3id.org/wimuq/]

About the code: The main class is 'src/org/wimu/datasetselection/parallelv1/MainParallelv1.java'

(Paper accepted at K-CAP 2019)[https://doi.org/10.1145/3360901.3364436].

@inproceedings{valdestilhas2019more,
  title={More Complete Resultset Retrieval from Large Heterogeneous RDF Sources},
  author={Valdestilhas, Andr{\'e} and Soru, Tommaso and Saleem, Muhammad},
  booktitle={Proceedings of the 10th International Conference on Knowledge Capture},
  pages={223--230},
  year={2019}
}

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
WebContent		WebContent
dist/lib		dist/lib
src/org/wimu/datasetselection/parallelv1		src/org/wimu/datasetselection/parallelv1
56_queries_SPARQL-A-LOT.txt		56_queries_SPARQL-A-LOT.txt
Dataset_code.tsv		Dataset_code.tsv
Endpoints_numtriples_lodcloud.csv		Endpoints_numtriples_lodcloud.csv
ExampleClusterKnnPropertyOccurrences.txt		ExampleClusterKnnPropertyOccurrences.txt
ExampleRicardo.java		ExampleRicardo.java
Feasible_350_queries.txt		Feasible_350_queries.txt
FedBench.txt		FedBench.txt
FedBench_25_queries.txt		FedBench_25_queries.txt
LICENSE		LICENSE
LargeRDFBench_40_queries.txt		LargeRDFBench_40_queries.txt
Property_code.tsv		Property_code.tsv
README.md		README.md
SQUIN-0.1.4.war		SQUIN-0.1.4.war
TestQueriesCluster.txt		TestQueriesCluster.txt
WimuTDesktop_src.zip		WimuTDesktop_src.zip
build.properties		build.properties
build.xml		build.xml
convertHDTLaundromat.sh		convertHDTLaundromat.sh
dense_ClusterKMeans.csv		dense_ClusterKMeans.csv
dense_ClusterKMeans.tsv		dense_ClusterKMeans.tsv
idQuery662.txt		idQuery662.txt
prodimem.py		prodimem.py
queries.txt		queries.txt
queriesLocation.txt		queriesLocation.txt
sparqlAlotQ1.txt		sparqlAlotQ1.txt
sparse_ClusterKMeans.csv		sparse_ClusterKMeans.csv
sparse_ClusterKMeans.tsv		sparse_ClusterKMeans.tsv
squinSPARQLExample.txt		squinSPARQLExample.txt
top5Datasets.txt		top5Datasets.txt
top5DatasetsResults.txt		top5DatasetsResults.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

wimuQ

About

Releases

Packages

Languages

License

firmao/wimuT

Folders and files

Latest commit

History

Repository files navigation

wimuQ

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages