Skip to content
@trec-kba

TREC KBA & StreamCorpus

common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text

Popular repositories

  1. streamcorpus streamcorpus Public

    common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text

    Scala 34 19

  2. many-stop-words many-stop-words Public

    stop word lists in several languages

    Python 21 22

  3. streamcorpus-pipeline streamcorpus-pipeline Public

    framework for making streamcorpus data

    HTML 11 4

  4. kba-corpus kba-corpus Public

    Tools for working with TREC KBA Corpora

    Python 5 4

  5. kba-tools kba-tools Public

    Tools for working with TREC KBA entities, training data, and run submissions

    Python 5 2

  6. kba-stanford-corenlp kba-stanford-corenlp Public

    Wrappers for generating one-word-per-line output representing all the goodies from Stanford CoreNLP, so we can include it in the KBA stream corpus.

    Java 4

Repositories

Showing 10 of 14 repositories

Top languages

Loading…

Most used topics

Loading…