Skip to content

Latest commit

 

History

History
7 lines (4 loc) · 1.11 KB

README.md

File metadata and controls

7 lines (4 loc) · 1.11 KB

Learning the Language of the Genome Using RNNs

Epigenetics is the study of how the genome is regulated by external mechanisms. Biological experiments have shown that subsequences of the human genome are regulated by specific proteins. The purpose of this project is to explore how an RNN architecture can be used to learn sequential patterns in genomic sequences. A robust method for modeling the genome can offer insights on genetic patterns related to health and disease.

This project was for Stanford's Deep Learning in NLP course from Spring 2016.

Please see our poster and final report for details. We designed a bidirectional 2-layer RNN for multitask learning using GRUs. You can find code for the TensorFlow model we used here.