Structure / Pathway Discovery

MDI Biological Labs. Barnraising workshop - (Maine) May 2016

This project is a collaborative effort between Lisa (Titus group - UC Davis), Harriet (Titus group - UC Davis), Dave Harris (U. Florida), Yuan (Princeton), and Oliver Muellerklein (me - Wayne Getz's group at UC Berkeley). We are working on a way to model structure in pathway emergence from a dataset of

Structure / Pathway Discovery

Have a set of organisms. Have rows of genes. Columns of conditions / specifications / etc per gene. Background: perhaps there are pathways of gene expression we could discover that relate to a species / organism liking burgers and another pathway of genes that relate to liking mushrooms. There can be overlap in genes in pathways for liking burgers and liking mushrooms. So each set of genes / pathway can exist independently.

1. Summary / Overview

1.1 Group To Do

We will create a set of response classes in simulated to train on:

Note: there are no classes in the test (real) data - but creating the model will involve us training and testing validity on only simulated data. I.e. can we recover the simulated response classes from the simulated data?

These response classes; E.g. "likes burgers", "likes mushrooms"

2. Data

We are going to use simulated data that represents rows of genes with columns of condition expression. Simulated data comes from a number of parameters of gene-properties-transcript relationships / complex.

Setting 10 target labels (~10 multi-class predictor probabilities).

Steps for creating simulation data:

create 2 Beta probability distributions
multiplication of the 2 Betas
take Binomial distribution (Bernoulli) of point above

3. Model Approaches

Want to examine dimensional reduction methods. Want to examine the use of PCA, clustering or other dimensional reduction methods (e.g. kernel-PCA, spectral clustering), autoencoding, multi-class classification wiwth soft predictions (i.e. get probabilities per class instead of discrete predictions), auto-encoder.

Background

Titus - structure gene discovery pitch

Overview

matrix with rows of genes and cols of some conditions
there may exist some hidden layer of interactions of these conditions
these hidden layers would correspond to some pathways of conditional relationships (relationships of the features)

Goals:

find deep / hidden layer with either deep-NN and / or XGBoost
visualize clustering of features via t-SNE

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
.ipynb_checkpoints		.ipynb_checkpoints
cluster-and-viz		cluster-and-viz
create-sim-data		create-sim-data
data		data
mathPDF		mathPDF
mmetsp		mmetsp
model-validity		model-validity
plots-images		plots-images
pseudomonas		pseudomonas
sample-models		sample-models
.DS_Store		.DS_Store
README.md		README.md
__main__.py		__main__.py
headfile.fa		headfile.fa
pseudomonas-clustering.ipynb		pseudomonas-clustering.ipynb
pseudomonas-tsne.ipynb		pseudomonas-tsne.ipynb
pseudomonas_KEGG_terms.ipynb		pseudomonas_KEGG_terms.ipynb
row_col_normalized_concentration.csv		row_col_normalized_concentration.csv
setup.py		setup.py
species_properties_expression.ipynb		species_properties_expression.ipynb
unsupervised-Copy1.ipynb		unsupervised-Copy1.ipynb
unsupervised-tsne.ipynb		unsupervised-tsne.ipynb
unsupervised.ipynb		unsupervised.ipynb
unsupervised_models.py		unsupervised_models.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Structure / Pathway Discovery

1. Summary / Overview

1.1 Group To Do

2. Data

3. Model Approaches

Background

Titus - structure gene discovery pitch

Overview

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

Thru-Echoes/structure-gene-discovery

Folders and files

Latest commit

History

Repository files navigation

Structure / Pathway Discovery

1. Summary / Overview

1.1 Group To Do

2. Data

3. Model Approaches

Background

Titus - structure gene discovery pitch

Overview

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages