Distributed, out-of-core corpus building, querying, and modeling
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
docs
etc
redicorpus
test
.coveragerc
.gitignore
.travis.yml
LICENSE
README.md
requirements.txt
setup.py

README.md

Redicorpus -- the distributed, out of core, real-time solution for building and querying linguistic data

Build Status codecov.io Documentation Status

In development -- unstable

Description

Redicorpus builds linguistic corpora in real-time to give you temporal resultion on the order of a single day, instead of years or decades.

Its database and computing tasks are distributed in parallel, which makes it fault-tolerant and easy to scale out.

Frequently used intermediate data are computed in advance, which reduces the latency for common queries.