Skip to content

zhangy72/PseudoDomain

Repository files navigation

Version

This software was last updated 9/17/2014. Questions, comments, suggestions, etc?

Send emails to nick.zhangyuan@gmail.com

Installation

  1. Clone the repository:

git clone git@github.com:zhangy72/PseudoDomain.git

  1. g++ compiler is required in your Unix/Linux system. To install component bin files of PseudoDomain, run the Makeme file using the following command:
    make

  2. Make sure all .sh files and hmmframe are executable in your environment. If not, you can use the command:

chmod 755 *.sh
chmod 755 hmmframe
  1. Python is required in your Unix/Linux system and in the path.

Run PseudoDomain

To run PseudoDomain pipeline, enter the folder of PseudoDomain and use the following command:

./PseudoDomain.sh -m <Pfam HMM file> -f <fasta file> -o <output file> [other options]

Options:
  -h:  show this message
  -c:  domain coverage threshold (default: 0.5)

The hmm file should be in HMMER3.0's hmm file format. It is recommended you download the Pfam-A.hmm file from Pfam (http://pfam.xfam.org/). The nucleotide sequence file should be in fasta format.

Output

The output file specifies the following fields of the identified processed pseudogene:

[input sequence name] [protein domains separated by comma] [beginning position] [ending position] [stop codon number] [frameshift number]

Reference PseudoDomain

PseudoDomain can be referenced as:

Yuan Zhang and Yanni Sun, PseudoDomain: identification of processed pseudogenes based on protein domain classification. ACM Conference on Bioinformatics, Computational Biology and Biomedicine (ACM BCB), 2012.

License

Copyright (C) 2014, Yuan Zhang and Yanni Sun.

You may redistribute this software under the terms of GNU GENERAL PUBLIC LICENSE.

About

Identification of processed pseudogenes based on protein domain classification

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published