Genome Annotator

This is a simple example package. You can use Github-flavored Markdown to write your content.

Start Here :-)

So installation here is a little robust due to the nature of how many programs are used. In order for everything to work smoothly, there are 3 main considerations:
First is the virtual environment you will be in while executing the pipeline. You want to make sure you are in a virtual environment so all the dependencies can play nice.
Second is the programs you install. You will need access to a lot of different programs, which should be housed within your virtual environment.
Third is the databases needed to annotate your data. Some programs have built in data permutations, but many require outside databases in order to compare your data to external data.

The next thing to consider is how to run the program. There are 3 snakemake files: Download.smk, Filter.smk, and Annotate.smk. Each can be assess with the command: $ genotator [download | filter | annotate] ...params. So you choose either the download, filter, or annotate command. Note that everything is chained together such as download --> filter --> annotate, so if you run filter, you're also running download. If you run annotate, you're also running download and filter. See below for detailed instructions on how to run the program.

Installation

Part 1: Create an isolated virtual environment using conda. See conda documentation for more details on conda.

conda create -y --name gen-annotator python=3.10
source activate /path/to/environment/gen-annotator
conda install -y -c conda-forge mamba

Download all of the dependencies

Part 2: Install all the dependencies. Sorry there are so many 😧

Start by downloading mamba, which is how I prefer to install packages when I can (over conda). Almost always mamba is a lot faster at resolving dependencies and the whole installation process. See mamba documentation for more information.

conda install -y -c conda-forge mamba

mamba install -y -c conda-forge -c bioconda python=3.10 \
        sqlite prodigal idba mcl muscle=3.8.1551 famsa hmmer diamond \
        blast megahit spades bowtie2 bwa graphviz "samtools>=1.9" \
        trimal iqtree trnascan-se fasttree vmatch r-base r-tidyverse \
        r-optparse r-stringi r-magrittr bioconductor-qvalue meme ghostscript
mamba install -y -c bioconda fastani
conda install -c bioconda dbcan
pip install checkm-genome

# Grab the latest anvio version
curl -L https://github.com/merenlab/anvio/releases/download/v8/anvio-8.tar.gz --output anvio-8.tar.gz
pip install anvio-8.tar.gz
# pip install fastani (deprecated)

Below is where we setup some databases through the Anvi'o suite.

- anvi-setup-pfams
  https://github.com/merenlab/anvio/blob/master/anvio/pfam.py#L55
- OR, I can download it directly for them into the lib

- anvi-setup-ncbi-cogs (Run_COGs)

- anvi-setup-kegg-kofams (Run_Kegg)

Download NCBI's datasets

conda install -y -c conda-forge ncbi-datasets-cli

Now for some miscellaneous installs

Note that RAST may be finnicky

pip install rast

##Common Issues

CazYME:

If the genome is too small and nothing was annotated, it gives an error!

Probably have some sort of: $ genome-annotator --setup-dbs
Line

Workflow for users:

Install the conda package ($ conda install -c bioconda genome-annotator)
Run the database configuration script ($ annotator --init-dbs {all,cazyme,cogs,kegg,pfam,rast,tigr

#User Guide Note: Any command starting with $ is used to denote being in a terminal (on the command line), and the $ marks the prompt.

Part 1. Setting up the directories

Clone the repository: $ git clonehttps://github.com/ddeemerpurdue/genome-annotator.git
Enter the new directory: $ cd genome-annotator
Create the fresh conda environment: SEE INSTALLATION
Download and initialize databases: SEE INSTALLATION Link to Installation
Add genotator.py to path: export $PATH=$PATH:/path/to/genome-annotator/bin/
Pass

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
bin		bin
genotator		genotator
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bld.bat		bld.bat
build.sh		build.sh
meta.yaml		meta.yaml
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Genome Annotator

Start Here :-)

Installation

Download all of the dependencies

Download NCBI's datasets

Now for some miscellaneous installs

CazYME:

Workflow for users:

Part 1. Setting up the directories

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

wintermutant/genotator

Folders and files

Latest commit

History

Repository files navigation

Genome Annotator

Start Here :-)

Installation

Download all of the dependencies

Download NCBI's datasets

Now for some miscellaneous installs

CazYME:

Workflow for users:

Part 1. Setting up the directories

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages