Skip to content

Commit

Permalink
Updating Logo and Description in README (#183)
Browse files Browse the repository at this point in the history
* update and move logo to docs

* change logo link, add citation, and modify intro in README

* fix logo path

* add link to website
  • Loading branch information
gwaybio committed Sep 13, 2019
1 parent 8eb29ee commit 8b6219c
Show file tree
Hide file tree
Showing 3 changed files with 14 additions and 4 deletions.
18 changes: 14 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
![logo](https://raw.githubusercontent.com/greenelab/BioBombe/master/logo.png)
![logo](https://raw.githubusercontent.com/greenelab/BioBombe/master/docs/logo.png)

# Sequential Compression of Gene Expression Data Across Latent Space Dimensions

Expand All @@ -10,13 +10,23 @@

The repository stores data and data processing modules to sequentially compress gene expression data.

Named after the [mechanical device developed by cryptologists in World War II](https://en.wikipedia.org/wiki/Bombe) to decipher secret messages sent by [Enigma machines](https://en.wikipedia.org/wiki/Enigma_machine), BioBombe is used to enhance biological signatures in gene expression data.
Inspired by the number-crunching knobs of [Alan Turing's](https://en.wikipedia.org/wiki/Alan_Turing) device, BioBombe sequentially compresses gene expression input across latent dimensionalities and deciphers the the biological signals embedded within compressed gene expression features.
Named after the [mechanical device](https://en.wikipedia.org/wiki/Bombe) developed by [Alan Turing](https://en.wikipedia.org/wiki/Alan_Turing) and other cryptologists in World War II to decipher secret messages sent by [Enigma machines](https://en.wikipedia.org/wiki/Enigma_machine), BioBombe represents an approach used to decipher hidden messages embedded in gene expression data.
We use the BioBombe approach to study different biological representations learned across compression algorithms and various latent dimensionalities.

In this repository, we sequentially compress three different gene expression data sets (TCGA, GTEx, and TARGET) across 28 different latent dimensions (_k_) using five different algorithms (PCA, ICA, NMF, DAE, and VAE).
In this repository, we compress three different gene expression data sets (TCGA, GTEx, and TARGET) across 28 different latent dimensions (_k_) using five different algorithms (PCA, ICA, NMF, DAE, and VAE).
We evaluate each algorithm and dimension using a variety of metrics.
Our goal is to construct reproducible gene expression signatures with unsupervised learning.

Links to access data and archived results can be found here: https://greenelab.github.io/BioBombe/

## Citation

> Sequential compression across latent space dimensions enhances gene expression signatures
Way, G.P., Zietz, M., Himmelstein, D.S., Greene, C.S.
biorXiv preprint (2019) doi:10.1101/573782

## Approach

Our approach is outlined below:

![overview](https://raw.githubusercontent.com/greenelab/BioBombe/master/compression-overview.png)
Expand Down
Binary file added docs/logo.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file removed logo.png
Binary file not shown.

0 comments on commit 8b6219c

Please sign in to comment.