Builds Wikipedia corpora in I5 (a TEI-based format)
-
Updated
Jun 21, 2022 - Java
Builds Wikipedia corpora in I5 (a TEI-based format)
A desktop application that searches through a set of Wikipedia articles using Apache Lucene.
Reading the data from OPIEC - an Open Information Extraction corpus
Add a description, image, and links to the wikipedia-corpus topic page so that developers can more easily learn about it.
To associate your repository with the wikipedia-corpus topic, visit your repo's landing page and select "manage topics."