Utilities for manipulating text corpora
Python Lua Shell
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
corputils
.project
.pydevproject
README.md
cooccurrence_count.py
corpus2words.sh
create_config.py
dp2dot.py
dpgrep.py
example_config.yml
find_contentful_head_of_nouns.py
kyototycoon.py
kyototycoon_ext.lua
parallel_count.py
pkl2sm.py
print_cooccurrences.py
split_texts.py
trim_sentence.py
utils.py

README.md

corputils

Utilities for manipulating text corpora

Simple usage:

./print_cooccurrences.py bnc.xml | ./coocurrence_count.py -o output

Run ./print_cooccurrences.py -h for a help message