Reading the data from OPIEC - an Open Information Extraction corpus
-
Updated
Jun 12, 2019 - Java
Reading the data from OPIEC - an Open Information Extraction corpus
Wikipedia-based Explicit Semantic Analysis, as described by Gabrilovich and Markovitch
A simple utility to index wikipedia dumps using Lucene.
Java tool to Wikimedia dumps into Java Article pojos for test or fake data.
WikimediaDumpExtractor extracts pages from Wikimedia/Wikipedia database backup dumps.
Index and Search wikiDump
Imports a Wikipedia xml dump into a Postgres database
Map/Reduce jobs for extracting data from the English language Wikipedia dump
Add a description, image, and links to the wikipedia-dump topic page so that developers can more easily learn about it.
To associate your repository with the wikipedia-dump topic, visit your repo's landing page and select "manage topics."