GitHub - yanshen43/MCAT: de novo motif finding tool

This is a pipeline for finding motifs in fasta files.
It can be run from the command line as follows:

usage: orange_pipeline_refine.py [-h] [-w W] [--nmotifs NMOTIFS] [--iter ITER] [-c C]
[-s S] [-d] [-ff] [-v V]
positive_seq negative_seq

positional arguments:
positive_seq the fasta file for the positive sequences
negative_seq the fasta file for the negative sequences

optional arguments:
-h, --help show this help message and exit
-w W motif width
--nmotifs NMOTIFS max number of distinct motifs to look for
--iter ITER max number of iterations (for DECOD tool)
-c C config filename
-s S statfile directory
-d turn debug mode on
-ff force (re)filtering
-v V number of results to include in visual output

This pipeline makes use of 6 publicly available 3rd party tools:
DECOD
MEME
CMF
BioProspector
Weeder
XXmotif

This distribution includes the source for all of these tools, as well as 64-bit and 32-bit executables compiled on linux. If you are using Windows or MacOS, you might need to recompile some of the tools on your platform and replace the relevant executables.
All of these sources were downloaded from their respective websites and are unaltered with the exception of 2 small file path changes in Weeder to account for our pipeline not providing files the way weeded expected.

Requirements:
Java 1.8
Python 2.7 SciPy 0.19.0 NumPy 1.12.1 MatPlotLib 1.2.0
perl 5.16.3
Chostscript 9.07

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
BioProspector		BioProspector
DECOD		DECOD
Documentation		Documentation
MotifRank		MotifRank
Papers		Papers
XXmotif		XXmotif
cmf		cmf
data		data
dustmasker		dustmasker
intuitionisticScoring		intuitionisticScoring
meme		meme
test_data		test_data
test_results		test_results
util		util
weblogo		weblogo
weeder		weeder
.DS_Store		.DS_Store
ConfigReadme.txt		ConfigReadme.txt
LICENSE.md		LICENSE.md
README.md		README.md
cap_binding_sites.txt		cap_binding_sites.txt
compareTool.py		compareTool.py
config.txt		config.txt
fasta2bp.py		fasta2bp.py
filterTool.py		filterTool.py
genSeqLogo.py		genSeqLogo.py
graphs.py		graphs.py
mcat_test.py		mcat_test.py
orange_pipeline.py		orange_pipeline.py
orange_pipeline_refine.py		orange_pipeline_refine.py
parseTestResults.py		parseTestResults.py
real_test.py		real_test.py
run.sh		run.sh
runTests.py		runTests.py
sample.sh		sample.sh
score_pipelines.py		score_pipelines.py
searchTool.py		searchTool.py
seq_count.py		seq_count.py
seq_gen.py		seq_gen.py
seq_scrambler.py		seq_scrambler.py
setup.sh		setup.sh
statfile.py		statfile.py
statistics.py		statistics.py
statisticspwm.py		statisticspwm.py
status.txt		status.txt
synDataGenRun.py		synDataGenRun.py
syn_test.py		syn_test.py
test.txt		test.txt
vis.png		vis.png
visTool.py		visTool.py

License

yanshen43/MCAT

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages