An in-memory, work-in-progress java api for wiktionary.org.
Right now the only feature is that you can check if a (german) word is a noun (based on a dump of de.wiktionary.org).
-
Add the artifact to your classpath (right now
git clone ...
,mvn install
, add maven dep etc.) -
Get the dump from http://dumps.wikimedia.org/dewiktionary/latest/dewiktionary-latest-pages-articles.xml.bz2
-
Unpack the dump (
bzip2 -d dewiktionary-latest-pages-articles.xml.bz2
) -
Create a new WiktionaryLoader instance, passing the unpacked file:
final WiktionaryLoader loader = new WiktionaryLoader(new File("dewiktionary-latest-pages-articles.xml"));
-
From now on you can use the loaded wiktionary dump to check words:
System.out.println("Is noun" + loader.isNoun("Januar"));