Skip to content
Gene Co-Occurrence Across Phylogeny
Python Shell
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.

Correlogy v0.1a

Gene Co-Occurrence Across Phylogeny See developer doc:

How to run

Choose BASH or IDE (eg Spyder)
  1. BASH: Must set 'use_IDE' variable to 'False' in

    • minimal example: python3 -i input_gene_list
    • full example: some text
    • usage: correlog [-h] -i INPUT [-o OUTPUT] [--evalue EVALUE] [--entrez ENTREZ] [-s SKIPBLAST]
    • get help, list all flag defaults: python3 -h
  2. IDE: Must set 'use_IDE' variable to 'True' in

    • Hardcode flag settings in --> parse_input() --> arg_dict dictionary

Known bugs

  1. NCBI BLASTing can sometimes time out, resulting in an empty XML file in 01_BLAST_results. Workaround is to use '-s' flag to skip BLAST query and put your own BLAST results in XML format (one file per query) into 01_BLAST-results.

  2. --> create_pa() does 'flattening' of pa_df via 'pa_df_rows = pd.unique(merged_df[pa_df_columns].values.ravel()).tolist()'. This returns some hits as "NaN", not sure why probably related to flattening.

To do

in --> create_folders()

  1. validate input file exists (otherwise throw error)
  2. validate output directory exists (otherwise throw error)
You can’t perform that action at this time.