Skip to content

ewoij/generic-uima-pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Generic UIMA pipeline

Simple UIMA pipeline wrap as a console application annotating Sentences, Items and Events. The last ones are easily customisable by updating the current list of terms.

Generated annotations:

  • Sentences
  • Items
    • Annotation
      • genericpipeline.Item
        • property: 'value'(String)
    • Annotator
      • UIMA RUTA MARKTABLE
      • Resources
        • word table taken from: ruta_resources/item_table.csv
          • The first column contains a list of unique items to be tagged in the text.
          • The second column value is assigned to the property 'value'
  • Events
    • Annotation
      • genericpipeline.Event
    • Annotator:
      • UIMA RUTA MARKFAST
      • Resources
        • word list taken from: ruta_resources/event_list.txt

Input

  • Directory containing text files

Output

  • XMI Files

Usage

  1. Download and extract the application: https://github.com/ewoij/generic-uima-pipeline/releases/tag/v1.0.0
  2. Update if necessary the following files with your own domain terms:
    • ruta_resources/item_table.csv
    • ruta_resources/event_list.txt
  3. Run from the current directory
    java -Xms4000m -jar generic-uima-pipeline-1.0.0-standalone.jar <input folder containing text files> <output folder>
    

Resources

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages