A Python CLI and module for calculating per-base sequencing complexity.
NOTE: This project is pre-alpha, all of the badge links are broken and are just placeholders at the moment. Development is ongoing. But feel free to clone the repository and play with the code for yourself!
The Next Generation Sequencing Complexity Index (NGS-CI) is a Python command-line tool designed for bioinformatics applications. It creates a complexity score on a scale of 0-100 that represents the theoretical percent of possible complexity of reads aligned to a given base.
The principle goal of the library is to generate a single per-base metric to summarize sequencing complexity as a percentage of the total possible unique reads aligned to a given base.
OS X and Linux release:
pip install ngsci
Development installation:
git clone https://github.com/MatthewRalston/ngsci.git
cd ngsci
pip install -r requirements.txt
pip install -r requirements-dev.txt
PYTHONPATH=$(pwd):$PYTHONPATH
CLI Usage
./bin/ngsci --help
./bin/ngsci -vv input.bam > output.txt
Check out the Readthedocs documentation, with examples and descriptions of the module usage.
pytest --cov=ngsci
Created by Matthew Ralston - Scientist, Programmer, Musician - Email
Distributed under the GPL v3.0 license. See LICENSE.txt
for the copy distributed with this project. Open source software is not for everyone, but for those of us starting out and trying to put the ecosystem ahead of ego, we march into the information age with this ethos.
- Fork it (https://github.com/MatthewRalston/ngs-ci/fork)
- Create your feature branch (
git checkout -b feature/fooBar
) - Commit your changes (
git commit -am 'Add some fooBar'
) - Push to the branch (
git push origin feature/fooBar
) - Create a new Pull Request