Skip to content
This repository has been archived by the owner on Mar 21, 2023. It is now read-only.

juanmirocks/geniatagger

Repository files navigation

My fork of the original GENIA Tagger

The main purpose of this repository is to serve as a reference for my Java port. The original source code (v 3.0.1) is found with the tag original.

For info about the program, see the original README.

Few Improvements

  • The original generated binary geniatagger can only be called from its same path as it makes static local references to some files. You can use the the simple bash script apply-geniatagger to overcome this.

Known Issues

  • The program has to load with every run some big dictionary files which can take quite a lot of time (~15s on a modern machine). The program should be rewritten to be more library-oriented and be able to keep the loaded dictionaries in memory. If you run on the JVM, you can use my Java port.