Welcome to the Computational Journal Club at Drexel University College of Medicine.
Date | Title | Link | Presenter |
---|---|---|---|
TBD | TBD | TBD | |
TBD | Privacy-preserving generative deep neural networks support clinical data sharing | BioArxiv | Jessica Eager |
TBD | TBD | TBD | |
TBD | DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier | iSCB | Angela Tomita |
Date | Paper | Link | Presenter |
---|---|---|---|
03/29/2019 | Deep learning enables accurate clustering and batch effect removal in single-cell RNA-seq analysis | BioRxiv | Robert Link |
03/01/2019 | Identification and localization of Sea Lion anatomical features using transfer learning | Will Dampier | |
02/08/19 | GOATOOLS: A Python library for Gene Ontology analyses | Scientific Reports | DV Klopfenstein |
02/01/19 | Predicting the mutations generated by repair of Cas9-induced double-strand breaks | Nature Biotechniques | Will Dampier |
01/25/19 | dna2vec: Consistent vector representations of variable-length k-mers; Continuous Distributed Representation of Biological Sequences for Deep Proteomics and Genomics | arxiv arxiv | Angela Tomita |
01/18/19 | Predicting mRNA abundance directly from genomic sequence using deep convolutional neural networks | BioRxiv | Robert Link |
01/11/19 | Synergizing CRISPR/Cas9 Off-Target Predictions for Ensemble Insights and Practical Applications | Pubmed | Cheng-Han Chung |
11/16/18 | Generating and designing DNA with deep generative models | Arxiv | Will Dampier |
Generating and designing DNA with deep generative models [Arxiv]
Nathan Killoran, Leo J. Lee, Andrew Delong, David Duvenaud, Brendan J. Frey
An exploration of a GANN architecture for creating novel DNA segments. The paper explores the biological nature of model. It has abiity to replicate complementarity. It has smooth transitions in sequence-space when interpolating in Z-space. They then explore applications for this in designing DNA probes with idealized binding properties.
Presented by Will Dampier. Slides
Shixiong Zhang, Xiangtao Li, Qiuzhen Lin and Ka-Chun Wong
A basic benchmark of prediction ability among classification algorithms and combinations of them. The ensemble model using CFD, MIT, MIT Website score, Cropit and CCTop altogether had the highest AUC-ROC and AUC-PRC.
Presented by Cheng-Han Chung. Slides
Felicity Allen, Luca Crepaldi, Clara Alsinet, Alexander J. Strong, Vitalii Kleshchevnikov, Pietro De Angeli, Petra Páleníková, Anton Khodak, Vladimir Kiselev, Michael Kosicki, Andrew R. Bassett, Heather Harding, Yaron Galanty, Francisco Muñoz-Martínez, Emmanouil Metzakopian, Stephen P. Jackson & Leopold Parts.
This paper discusses how the gRNA and target influence the type and distribution of Cas9 editing outcomes. They use a clever plasmid system to measure the editing consequences at scale. The authors find that these editing profiles are repoducible within the same gRNA, diverse across different gRNAs, and are predictable when the target sequence is known.
Presented by Will Dampier. Slides
Predicting mRNA abundance directly from genomic sequence using deep convolutional neural networks. BioArxiv
Vikram Agarwal & Jay Shendure
A convolutional neural network (CNN) designed to predict mRNA steady-state abundance from promoter sequence and mRNA half-life proxy. It explains 59% of the variation in expression for humans and 71% of the expression for mice, more than doubling the performance accuracy of previous models attempting to explain transcription.
Presented by Robert Link. Slides
Patrick Ng
Ehsaneddin Asgari, Mohammad R. K. Mofrad
Embeddings are popular in the deep learning community as inputs to models, and these two specific tools are designed for processing and representing dna/protein sequences. The vector arithmetic for dna2vec representations is analogous to nucleotide concatenation using (Spearman = 0.831), and ProtVec protein classification accuracy is 93%.
Presented by Angela Tomita. Slides
GOATOOLS: A Python library for Gene Ontology analyses. Scientific Reports
D. V. Klopfenstein, Liangsheng Zhang, Brent S. Pedersen, Fidel Ramírez, Alex Warwick Vesztrocy, Aurélien Naldi, Christopher J. Mungall, Jeffrey M. Yunes, Olga Botvinnik, Mark Weigel, Will Dampier, Christophe Dessimoz, Patrick Flick, Haibao Tang
Gene Ontology (GO) is used to describe gene products in a computationally acessible manner. This paper describes a Python library and a set of scripts used to query the GO, run enrichment analyses on sets of genes, and describes a novel method for grouping GO terms.
Presented by DV Klopfenstein. Slides
Deep learning enables accurate clustering and batch effect removal in single-cell RNA-seq analysis. BioRxiv
Xiangjie Li, Yafei Lyu, Jihwan Park, Jingxiao Zhang, Dwight Stambolian, Katalin Susztak, Gang Hu, Mingyao Li
A commonly used method to observe expression patterns within single-cell RNA-seq is called clustering. A complication to single-cell RNA-Seq analysis, refered to as batch effect, can cause incorrect results if not properly accounted for. This paper describes a novel method called deep embedding algorithm for single-cell (DESC) that utilizes deep learning to improve upon previous clustering methods and simultaneously remove batch effect.
Presented by Robert Link Slides
Copyright 2018-2019, Dampier Lab. All rights reserved.