Modernized version of Eric Brill's Part Of Speech tagger.
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.

#taggerXML# Modernized version of Eric Brill's Part Of Speech tagger. The original code is converted to C++. The program can handle XML input. This program can only tag. For training the tagger with a tagged corpus of your own choice you can use Eric Brill's original software.

You can find a copy of Eric Brill's tagger here:


This file contains linguistic resources for tagging English text as well.


  • Linux:
    1. Download (e.g. git pull) taggerXML, parsesgml, letterfunc and hashmap. If you are going to use the Makefile that comes with taggerXML, locate each of these packages in separate subdirectories under the same directory, and call these subdirectories taggerXML, parsesgml, letterfunc and hashmap, respectively.
    2. Change directory to the 'taggerXML/src' directory.
    3. Run 'make' or 'make taggerXML'. To get rid of object files, run
    4. 'make clean'.

You can use to do all of this automatically.


For running the taggerXML, see Eric Brill's original documentation and

Online availability

taggerXML is demonstrated at CST's website ( and an integrated webservice in the CLARIN-DK infrastructure (

Contact info

For questions and remarks about the program, please feel free to contact us.

Our postal address is:

Center for Sprogteknologi
University of Copenhagen
Njalsgade 140
2300 Copenhagen S.

On the internet, you can visit us at Here you can also try the tagger for Danish and English.