PolyglotDB is a Python package for storing and querying large speech corpora. It constructs various kinds of database, and has a consistent Python API for interacting with the various underlying databases. The online documentation is available at http://polyglotdb.readthedocs.io/en/latest/.
This package is intended for developers and those experienced with scripting in Python. If you would like to use a graphical interface for querying and interacting with PolyglotDB databases, please see Speech Corpus Tools (http://speech-corpus-tools.readthedocs.io/en/latest/).
To install or develop:
- Download or clone the repository
- Install Python requirements (
pip install -r requirements.txt -U
) - Install Neo4j and set it up (see http://speech-corpus-tools.readthedocs.io/en/latest/tutorial/tutorial.html#installation-tutorial)
- Install via
python setup.py install