# Merged PTM and Expression SCLC
This notebook will investigate the cluster of up-regulated PTMs and genes in SCLC lung cancer cell lines. This cluster was isolated in a previous notebook and was saved to a TSV. 

In [1]:
from clustergrammer_widget import *
net = Network(clustergrammer_widget)

In [2]:
net.load_file('histology_clusters/merge_sclc.txt')
merge_sclc = net.export_df()

# Cluster of Up-regulated PTMs and Genes in SCLC Cell Lines
Below we will visualize the cluster. We can see that it is composed of roughly for large clusters that are generally up-reglated in SCLC cell lines and down-regulated in NSCLC cell lines. We can also see that the overall cluster is composed of a mixture of phosphorylation, expression, methylation and acetylation. 

In [3]:
net.cluster(views=[])
net.widget()

## Genes/Proteins of Interest
Here we will highlight a few of the interesting genes/proteins found in this cluster of up-regulated PTM and gene expression data in SCLC cell lines. 

### NKX2-1 and SOX2 Cluster

<img src="img/NKX2-1_cluster.png" width="700">

NKX2-1 is a transcription factor that has a role in lung development and has been used as a biomarker in lung cancer  ([Yang et al. 2012](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3494024/)). NKX2-1 is known to have high expression in SCLC and low expression in NSCLC cell lines. NKX2-1 appears twice in the above cluster based on expression and arginine methylation (R121). Both expression data and methylation data, which were obtained from independent data cluster next to each other, which is reasonable and might imply that increased NKX2-1 methylation is a result of increased protein level. NKX2-1 levels are generall high in SCLC cell lines and are low in the majority of NSCLC cell lines.


NKX2-1 clusters with several other lung associated genes including: STFA3 ([Schict et al. 2014](https://www.ncbi.nlm.nih.gov/pubmed/24743970)); SOX2, which is known to be involved in lung cancer ([Karachaliou et al. 2013](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4367598/)) and is also thouhgt to interact with NKX2-1 ([Ferri et al. 2013](https://www.ncbi.nlm.nih.gov/pubmed/23444355)); GPNMB, which is thought to be involved in lung cancer ([Oyewumi et al. 2016](https://www.ncbi.nlm.nih.gov/pubmed/26883195)), ID4. 

MARCKSL1 is thought to play a role in cytoskeletal reglation [Gene Cards](http://www.genecards.org/cgi-bin/carddisp.pl?gene=MARCKSL1) and its phosphorylation is falls into this cluster. 

# Gene Ontology Biological Process 2015
Here we are enriching for biological processes from the Gene Ontology resource. This will help us understand the broad biological processes occurring in this cluster of up-regulated PTMs and Genes

In [4]:
net.enrichrgram('GO_Biological_Process_2015')
net.cluster(views=[])
net.widget()

We see enrichment for mRNA processing, splicing, and gene expression. We also see similar results with KEGG 2016 enrichment, which can be run in this notebook with the [Enrichrgram button](http://clustergrammer.readthedocs.io/biology_specific_features.html#enrichment-analysis). Genes with these associations are mainly distributed in the top and bottom sub-clusters - note that the largest middle cluster with highly up-regulated values has relatively few genes with this association. We can investigate the biological processes in this sub-cluster by clicking the [dedrogram crop button](http://clustergrammer.readthedocs.io/interacting_with_viz.html#interactive-dendrogram) and re-running the enrihment analysis using the front-end [Enrichrgram button](http://clustergrammer.readthedocs.io/biology_specific_features.html#enrichment-analysis). 

Doing so reveals that this cluster is also enriched for gene expression and mRNA processing but also for several neuronal processes including: neuron projection, axon guidance and neuron projection morphology. This agrees with the neuronal characteristics of SCLC cell lines (see [Onganar et al 2005](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2361510/)). 

# MGI Mammalian Phenotype Level 4
The MGI Mammalian Phenotype Ontology is a collection of phenotypes identified after gene knockdown. It can be used as an less biased method to associate genes with high-level phenotypes.

In [5]:
net.enrichrgram('MGI_Mammalian_Phenotype_Level_4')
net.cluster(views=[])
net.widget()

We see that these genes are associated with perinatal lethality (prenatal, preweaning, and postnatal lethality) and neuronal abnormalities (neuron morphology, brain morphology, spinal cord morphology, and nervous system abnormalities). This less biased screen agrees with the neuronal enrichment we saw above for the middle sub-cluster using Gene Ontology Biological Process. 