Skip to content
Scalable genomic data analysis.
Python Scala C++ Jupyter Notebook Makefile HTML Other
Branch: master
Clone or download
Type Name Latest commit message Commit time
Failed to load latest commit information.
.github Improve github issue template (#5694) Mar 26, 2019
admin-pod [admin-pod] add admin pod (#7819) Jan 7, 2020
apiserver use nodeSelector, tolerate preemptibles (#7784) Jan 6, 2020
auth Fix user creation (#7813) Jan 7, 2020
batch [batch] shorten worker timeout (#7972) Jan 27, 2020
benchmark [hail] add pc_relate_big benchmark (#7975) Jan 27, 2020
blog use nodeSelector, tolerate preemptibles (#7784) Jan 6, 2020
ci [ci] add zulip notification of failed deploy state (#7889) Jan 17, 2020
datasets [hail] datasets (#6579) Jul 9, 2019
dbuf [dbuf] fix dbuf makefile (#7858) Jan 14, 2020
docker [batch] insertion progress bars (#7876) Jan 18, 2020
etc/pdf-split-shuffle-partition pdf-split-shuffle-partition (#5291) Feb 12, 2019
gateway use nodeSelector, tolerate preemptibles (#7784) Jan 6, 2020
gear [batch] log scheduler queries timing (#7953) Jan 23, 2020
hail [hail][bugfix] Fix divide by zero error in `hl.concordance` (#7976) Jan 27, 2020
image-fetcher removed some unused, out of date files (#7204) Oct 7, 2019
internal-gateway use nodeSelector, tolerate preemptibles (#7784) Jan 6, 2020
letsencrypt use node selector instead of tolerations/node affinity (#7636) Dec 16, 2019
monitoring use nodeSelector, tolerate preemptibles (#7784) Jan 6, 2020
notebook [build] use createDatabase2 (#7705) Jan 7, 2020
pipeline/test [pipeline] wait, open options (#7672) Dec 6, 2019
project Fix init(idempotent=True) / HailContext.getOrCreate (#5872) Apr 20, 2019
router-resolver [router-resolver] use async_get_userinfo (#7859) Jan 13, 2020
router use nodeSelector, tolerate preemptibles (#7784) Jan 6, 2020
scheduler use nodeSelector, tolerate preemptibles (#7784) Jan 6, 2020
scorecard Removing old users (#7804) Jan 6, 2020
site use nodeSelector, tolerate preemptibles (#7784) Jan 6, 2020
ukbb-rg use nodeSelector, tolerate preemptibles (#7784) Jan 6, 2020
vdc [batch2] autoscale front-end deployment (#7287) Oct 15, 2019
web_common [batch] search help for batches, batch jobs (#7777) Dec 27, 2019
.dockerignore rename ci2 => ci (#6193) May 29, 2019
.gitignore [git] Ignore all .iml files (#7526) Nov 15, 2019
.zenodo.json Add .zenodo.json file (#5949) Apr 25, 2019
AUTHORS Added LICENSE and AUTHORS. Apr 21, 2016
LICENSE Fix intellij replace errors Jun 16, 2016
Makefile [batch] rename batch2 => batch (#7664) Dec 6, 2019 Fix Zuckerberg typo (#7382) Oct 25, 2019
build.yaml [batch] improve cancel (#7933) Jan 23, 2020 add project dependencies (#5464) Feb 28, 2019
pylintrc [batch2] delist jobs after 3m or half run duration (#7605) Nov 24, 2019
setup.cfg hailctl deploy dev steps option (#6915) Aug 22, 2019



Hail is an open-source, general-purpose, Python-based data analysis tool with additional data types and methods for working with genomic data.

Hail is built to scale and has first-class support for multi-dimensional structured data, like the genomic data in a genome-wide association study (GWAS).

Hail is exposed as a Python library, using primitives for distributed queries and linear algebra implemented in Scala, Spark, and increasingly C++.

See the documentation for more info on using Hail.


Hail has been widely adopted in academia and industry, including as the analysis platform for the genome aggregation database and UK Biobank rapid GWAS. Learn more about Hail-powered science.


If you'd like to discuss or contribute to the development of methods or infrastructure, please:

Hail uses a continuous deployment approach to software development, which means we frequently add new features. We update users about changes to Hail via the Discussion Forum. We recommend creating an account on the Discussion Forum so that you can subscribe to these updates as well.


Hail is maintained by a team in the Neale lab at the Stanley Center for Psychiatric Research of the Broad Institute of MIT and Harvard and the Analytic and Translational Genetics Unit of Massachusetts General Hospital.

Contact the Hail team at

Citing Hail

If you use Hail for published work, please cite the software. You can get a citation for the version of Hail you installed by executing:

import hail as hl

Which will look like:

Hail Team. Hail 0.2.13-81ab564db2b4.

The Hail team has several sources of funding at the Broad Institute:

  • The Stanley Center for Psychiatric Research, which together with Neale Lab has provided an incredibly supportive and stimulating home.
  • Principal Investigators Benjamin Neale and Daniel MacArthur, whose scientific leadership has been essential for solving the right problems.
  • Jeremy Wertheimer, whose strategic advice and generous philanthropy have been essential for growing the impact of Hail.

We are grateful for generous support from:

  • The National Institute of Diabetes and Digestive and Kidney Diseases
  • The National Institute of Mental Health
  • The National Human Genome Research Institute
  • The Chan Zuckerberg Initiative

We would like to thank Zulip for supporting open-source by providing free hosting, and YourKit, LLC for generously providing free licenses for YourKit Java Profiler for open-source development.

You can’t perform that action at this time.