Skip to content


Switch branches/tags

Latest commit


Git stats


Failed to load latest commit information.
Latest commit message
Commit time

Bayesian Hypothesis Assessment

A Bayesian hypothesis assessment framework, tailored to comparing pairs of experiments. This package is preliminary implementation statistical analyses described in this paper. This can also be seen as a complimentary to
testSignificanceNLP package which contains popular Frequentist tests.

We focus on the setup where the performances of two algorithms, applied on a certain dataset, are provided based on an evaluation measure. The comparison is done by studying the posterior probabilities (rather than binary commonly-used binary decisions in other tests.)

For further discussion of the approaches implemented here (and their comparison to other techniques), refer to the paper or the extended manual.

Getting started


For running this code, you will need Python (version >=3.6). You have two options for installation:

  • If you don't want to modify any code, you can just install the package and use it in commandline:
 > pip install HyBayes
  • Alternatively, if you'd like to make modifications to the code (e.g., the underlying model) you can clone this project. Before running the code, make sure that you have all the requirements by running the following line to install all the necessary dependencies:
 > pip install -r requirements.txt

In case you have issue with installing GraphViz, consider using conda installer:

 > conda install -c anaconda graphviz --yes

If you are using MacOS, brew command can used:

 > brew install graphviz

Preparing configuration file

To analyze your data, you need to prepare the a configuration file that specifies the information needed for the analysis (e.g., the address to your observation files, the type of plots and files you want to be stored).

You can find more details on this in the extended manual. Additionally, there are examples of complete configurations in configs folder, after running

Running the Analysis

Let's assume that you have followed the above instructions and:

  • (1) you have included the performances observations in two separate files,
  • (2) and you have prepared a config file.

To run the analysis, you pass the name of the config file as argument:

 > python -m HyBayes --config my_config_file.ini --verbose

When the flag --verbose is on, the details of the analysis will be printed in standard output. This flag does not affect the log file that is stored along side other outputs.

And here is the general usage template which can be accessed using --help flag at any time:

 > python -m HyBayes --help
usage: [-h] [-c CONFIG] [-v] [--make_configs] [--make_data]

Run Bayesian Statistics Tailored towardsanalysing the experiment results
specially in NLP area.Email for comments.

optional arguments:
  -h, --help            show this help message and exit
  -c CONFIG, --config CONFIG
                        address of Config file
  -v, --verbose         prints the report of the steps
  --make_configs        if on, example configuration files will be made.
  --make_data           if on, example data files will be made.


To demonstrate everything with a quick example, run the following command to generate toy data:

 > python -m HyBayes --make_data

You can see that some artificial data is made in directory artificial_data. Moreover either use the config files provided in the repository [], or run:

 > python -m HyBayes --make_configs

And then execute the analysis on any of the newly made config files. For example:

 > python -m HyBayes --config configs/config_metric.ini --verbose

Further Reading

If you want to learn more about this package, please refer to the extended manual. If you want to learn about the concepts discusssed here, refer to the paper below.


This code is published under Apache 2.0 license.


If you used this package in your research feel free to cite the following paper:

  title={Not All Claims are Created Equal:Choosing the Right Statistical Approach to Assess Hypotheses},
  author={{Sadeqi Azer}, Erfan and Khashabi, Daniel and Sabharwal, Ashish and Roth, Dan},
  booktitle={Annual Meeting of the Association for Computational Linguistics (ACL)},


Bayesian Assessment of Hypotheses







No packages published