Source code for MULCCH: Multi-task consensus clustering

Please see our publication in Bioinformatics: Z Niu, D Chasman, AJ Eisfeld, Y Kawaoka, S Roy. 2016. Multi-task consensus clustering of genome-wide transcriptomes from related biological conditions. Bioinformatics 32 (10), 1509-1517. https://doi.org/10.1093/bioinformatics/btw007

This code is designed to be run AFTER producing the original Arboretum results for several random initializations on the input data.

For more information about Arboretum, please see Roy et al, Genome Research 2013 (http://genome.cshlp.org/content/23/6/1039.full).

There are 2 steps to do the consensus clustering on the results from separate inits of Arboretum.

compute eigen vector for similarity matrix. => Eigen This code is included in EigenVector. We included executable program(Eigen) and an example of usage (runEigen.sh).

Usage1 :./Eigen -s similarity file -d dimension -n [0|1] normalize -t threshold -b binary -o outfile Usage2 :./Eigen -l file list -d dimension -n [0|1] normalize -t threshold -b binary -o outfile

Input Arguments: -s similarity matrix, e.g.similarityMatrix.txt -l file list of multiple cluster assignments -d dimension -n 1 or 0 normalized laplacian or not -t threshold, if less than threshold, set to be 0, if threshold=0 weighted graph is equal to fully connected graph. -b binary,1 or 0. if 1, set the value that is greater than threshold to be 1. -o name of outfile

*NOTE: The genes in output files are placed in the same order as that of similarity matrix (usage1) or the first input cluster assignment(usage2).

use arboretum to cluster eigen vector. => Arboretum We included executable program(incAncClust_SpcFast) with example of usage (runArboretum.sh) in directory of Arboretum.

Usage: ./incAncClust_SpcFast viruses_order mapping k tree initial_clusters random[rseed|none] output_dir mode[learn|generate|visualize] src_virus init_type[uniform|branch_length] p_diagonal_nonleaf

Input Arguments: viruses_order: the order of multiple viruses mapping: viruses mapping k: number of clusters tree: tree structure of viruses describing their relationship initial_clusters: initial cluster assignments

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Arboretum		Arboretum
EigenVector		EigenVector
input		input
Eigen		Eigen
README.md		README.md
contributors.txt		contributors.txt
runEigen.sh		runEigen.sh
similarityMatrix.zip		similarityMatrix.zip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Source code for MULCCH: Multi-task consensus clustering

There are 2 steps to do the consensus clustering on the results from separate inits of Arboretum.

About

Releases

Packages

Languages

Roy-lab/MULCCH

Folders and files

Latest commit

History

Repository files navigation

Source code for MULCCH: Multi-task consensus clustering

There are 2 steps to do the consensus clustering on the results from separate inits of Arboretum.

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages