GitHub

GIsaid Subsampling Toolkit

Just a set of scripts organized as a toolkit for conducting sub-sample analyses based on the Augur tool, focused on Brazilian states.

Install

Requirements: python 3.9+

git clone https://github.com/dezordiPhD/gist.git
git add gist
conda env create -f env/gist_ubuntu.yml
conda activate gist
pip install .

## check installation
gist --help

## clone ncov nextstrain repository
git clone https://github.com/nextstrain/ncov.git

Mac users should install ncbi+blast

Run

get-states

This mode get subsampling data based on specific lineages on specific brazilian states and other countries. The input json file should be configure as the template present on templates/get_by_states.json

gist get-states --ncov_dir ncov --sequences <gisaid_genomes.tar.xz> --metadata <gisaid_metadata.tar.xz> --threads <number_of_threads> templates/get_by_states.json

get-genomes

Get gisaid genomes based on blast analysis. The input json file should be configured as the template present on templates/get_similar_genomes.json

gist get-genomes --input <query_genomes.fasta> --sequences <gisaid_genomes.fasta> --metadata <gisaid_metadata.tsv> templates/get_similar_genomes.json

get-algn

Perform a mafft add --keeplength alignment and - if passed mask_pos - mask alignment positions

gist get-algn --input sequences.fa --reference reference.fa --threads 8 --mask_pos templates/mask_pos.tsv

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
env		env
gist		gist
templates		templates
.gitignore		.gitignore
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GIsaid Subsampling Toolkit

Install

Run

get-states

get-genomes

get-algn

About

Releases

Packages

Languages

dezordi/gist

Folders and files

Latest commit

History

Repository files navigation

GIsaid Subsampling Toolkit

Install

Run

get-states

get-genomes

get-algn

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages