Skip to content
roberto zanoli edited this page Feb 12, 2015 · 1 revision

Table of Contents

BIUTEE Pipeline

BIUTEE runs in three main phases:

  1. Preprocessing (LAP)
  2. Training
  3. Testing
Given two XML files in RTE format, one for train and one for test, each needs to be preprocessed separately before training/testing. This converts the original text into an annotated dependency tree, and stores it in an internal format. This is useful, since one may want to experiment with different configurations of BIUTEE without running the entire LAP over and over again.

BIUTEE can then train a model from the preprocessed training data. This model is then used by BIUTEE during inference (the testing phase). It is important to note that BIUTEE can be tested in a certain configuration only when using a model that was trained under that exact configuration (same knowledge resources, optimization, etc).

BIUTEE also has an additional running mode - the visual tracing tool.

Running BIUTEE from the Shell

In Linux/Unix, BIUTEE can be run from the command line using run_biutee.sh from $BIUTEE/workdir. Running with -h reveals the modes and options:

 run_biutee.sh [OPTIONS] MODE
 
 Available modes:  lap_train
                   train
                   lap_train,train
                   lap_test
                   test
                   lap_test,test
                   visual
 
 -i <filename>     raw input file (rte pairs format)
 -s <filename>     serialized input file (saves/loads the preprocessed pairs after LAP)
 -m <dir>          models directory
 -o <filename>     results file (only in test modes)
 -r <list>         list of resources, separated by commas (no spaces)
 -t <int>          number of threads
 -e <port>         easyfirst parser's port
 -f                optimize F1 (instead of accuracy)
 -g                gap mode

For first-time use, we recommend running lap_train,train followed by lap_test,test. See how to upgrade BIUTEE and BIUTEE's knowledge resources for more information on -r.

Pre-trained Models

BIUTEE comes along with a few pre-trained models. They are all located under $BIUTEE/workdir/models/. The following table describes how each one was trained.

Model Training Data Resources
basic RTE3 Dev Set WORDNET,CATVAR,SYNTACTIC
basic_bap RTE3 Dev Set WORDNET,CATVAR,SYNTACTIC,REDIS_BAP

More Options

Detailed Guide on Running BIUTEE

Running BIUTEE's visual tracing tool

Clone this wiki locally