This repository contains the Almond preprocessor and tokenizer. It is derived from SEMPRE, a semantic parser developed by prof. Percy Liang and his students, but it has diverged significantly.
Download dependencies:
./pull-dependencies.sh
Build the core:
JAVAHOME=<path to java> ant
JAVAHOME should be set to the path to your Java installation, eg. /usr/lib/jvm/openjdk-1.8.0.
Java 1.8 is required. A working C compiler must also be installed.
./run.sh
The tokenizer listens on port 8888 by default.
An example systemd unit service is provided as almond-tokenizer.service.
The service supports socket activation as well.