Skip to content

inab/docker-textmining-tools

Repository files navigation

docker-text-mining-tools

This are the collection of components of the inab for text-mining tasks

Collection of components.

  • ocfmypdf: Scanned PDF to Readable PDF.
  • grobid: PDF parser; detect structures, e.g. sections, paragraphs, titles, etc.
  • dnorm-gate-wrapper: Diseases tagger.
  • linnaeus-gate-wrapper: Species tagger.
  • metamap-gate-wrapper: Metamap (UMLS) tagger.