forked from dbpedia/extraction-framework
-
Notifications
You must be signed in to change notification settings - Fork 1
Create Mapping Statistics
Daniel Fleischhacker edited this page May 23, 2014
·
5 revisions
This guide describes how to generate the mappings statistics as displayed as shown at http://mappings.dbpedia.org/server/statistics/
- Update extraction framework to newest version from GitHub
- Make sure newest version of all modules are compiled and installed locally (
install-run
) - Download most recent ontology from mapping wiki
cd core
../run download-ontology
- Commit new ontology version
- Download most recent mappings from mapping wiki
cd ../core
../run download-mappings
- Commit new mappings
- Download most current Wikipedia dumps
cd ../dump
- Choose one of the download.*.properties based on set of relevant Wikipedia language versions
- Adapt download path in property file
../run download config=download.*.properties
- Start extraction limited to data required for mapping statistics
cd ../dump
- adapt "base-dir" extraction.stats.properties to download directory
- adapt "source" parameter, default is NOT .xml.bz2 but .xml though stated differently!!!!
../run stats-extraction extraction.stats.properties
- Start statistics extraction
cd ../server
- Adapt base dir in pom.xml for launcher "stats" to download directory used in previous step
../run stats
- In case you want to run the statistics server on a different system than the one you created the statistics on, copy the
mappingstats_*
files from folderserver/main/src/statistics/
on the generation server to the same folder on the hosting server - Start statistics server
cd ../server
- Adapt server URI in pom.xml for launcher "server"
- If you want to prefer IPv4 on a machine which also supports IPv6 define environment variable _JAVA_OPTIONS first:
export _JAVA_OPTIONS='-Djava.net.preferIPv4Stack=true'
../run server
- The server is now available at the defined URI