BetaScan

Welcome! BetaScan implements the β statistic to detect ancient balancing selection, as described in Siewert & Voight, 2017 and Siewert & Voight, 2020. For in-depth instructions, please read the BetaScan wiki.

Update: 11/8/22: The newest version of BetaScan now uses python3! If you want to continue using the old python2 code, you can use BetaScan_python2.py. Otherwise, use BetaScan.py

Basic Usage

To run BetaScan on an input file named SNPFreqs.txt with default parameters:

python BetaScan.py -i SNPFreqs.txt

If you have a folded site frequency spectrum, you must include the -fold flag to calculate β^(1)*. If your data includes substitutions with an outgroup, you can use the -B2 flag, which calculates β⁽²⁾. However, if you use -B2 you must include an estimate of speciation time. See the Usage page of the wiki for details on how you can estimate.

If you also want to standardize β by its variance, you can do so using the -std flag. This flag must be accompanied by an estimate of the mutation rate using the -theta flag. Once again, see the Usage page of the wiki for details on how you can estimate the mutation rate.

Questions? Comments?

Any feedback or questions are very welcome. You can e-mail Katie at ksiewert@hsph.harvard.edu or post a github issue. We know that programs written by other people can be difficult to use, so we’ve tried our best to make this program simple and intuitive. That being said, bioinformatics is bioinformatics, and issues will arise, so don’t hesitate to contact us!

References

The original Beta statistics are described in Detecting Long-Term Balancing Selection Using Allele Frequency Correlation, MBE 2017.

Recent updates to BetaScan, including the β⁽²⁾ statistic and standardization are now published in BetaScan2: Standardized statistics to detect balancing selection utilizing substitution data, GBE 2020.

1000 Genomes Beta Scores

If you would like the β⁽¹⁾ scores for each population in the 1000 Genomes dataset, they are available here. If you just want to look at the top 1% highest scoring haplotypes in each population, that data is also available here. These scores are based on hg19.

β⁽²⁾ Scores are available for the YRI, CEU and CHB populations are available here. These scores are also based on hg19.

Name		Name	Last commit message	Last commit date
Latest commit History 122 Commits
AFRsubset_fortutorial.txt		AFRsubset_fortutorial.txt
BetaScan.py		BetaScan.py
BetaScan_python2.py		BetaScan_python2.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BetaScan

Basic Usage

Questions? Comments?

References

1000 Genomes Beta Scores

About

Releases

Packages

Contributors 2

Languages

ksiewert/BetaScan

Folders and files

Latest commit

History

Repository files navigation

BetaScan

Basic Usage

Questions? Comments?

References

1000 Genomes Beta Scores

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages