GASTON is an interpretable deep learning model for learning a topographic map of a tissue slice from spatially resolved transcriptomics (SRT) data. Specifically, GASTON models gene expression topography by learning the isodepth, a 1-D coordinate that smoothly varies across a tissue slice.
The isodepth and topographic map learned by GASTON have many downstream applications including
- identifying spatial domains,
- inferring continuous gradients and discontinuous jumps in gene expression,
- deriving maps of spatial variation in cell type organization,
- analyzing continuous gradients in the tumor microenvironment
GASTON is on pypi: https://pypi.org/project/gaston-spatial/.
pip install gaston-spatial
You can also directly install the conda environment from the environment.yml
file:
cd GASTON
conda env create -f environment.yml
Then install GASTON using pip.
conda activate gaston-package
pip install -e .
Installation should take <10 minutes.
We will add GASTON to bioconda soon!
Check out our readthedocs, which includes tutorials for four analyses:
- mouse cerebellum (Slide-SeqV2)
- human colorectal tumor (10x Visium)
- human breast cancer tumor (10x Xenium)
- mouse cortex (MERFISH).
Training GASTON by following the tutorials should take <1 hour.
GASTON requires an NxG gene expression matrix (e.g. UMI counts) and an Nx2 spatial location matrix, which can be provided or read from Space Ranger output.
The data to run the tutorials is located in docs/notebooks/tutorials/
. Note that due to Github size constraints, you have to download the counts matrices for both analyses from Google Drive.
- torch (=2.0.0)
- matplotlib (=3.8.0)
- pandas (=2.1.1)
- scikit-learn (=1.3.1)
- numpy (=1.23.4)
- jupyterlab (=4.0.6)
- seaborn (=0.12.2)
- tqdm (=4.66.1)
- scipy (=1.11.2)
- scanpy (=1.9.5)
- squidpy (=1.3.1)
See full list in environment.yml
file. GASTON can be run on CPU or GPU.
The GASTON manuscript is available on bioRxiv. If you use GASTON for your work, please cite our paper.
@article {Chitra2023,
author = {Uthsav Chitra and Brian J. Arnold and Hirak Sarkar and Cong Ma and Sereno Lopez-Darwin and Kohei Sanno and Benjamin J. Raphael},
title = {Mapping the topography of spatial gene expression with interpretable deep learning},
elocation-id = {2023.10.10.561757},
year = {2023},
doi = {10.1101/2023.10.10.561757},
publisher = {Cold Spring Harbor Laboratory},
URL = {https://www.biorxiv.org/content/early/2023/10/13/2023.10.10.561757},
eprint = {https://www.biorxiv.org/content/early/2023/10/13/2023.10.10.561757.full.pdf},
journal = {bioRxiv}
}
For questions or comments, please file a Github issue and/or email Uthsav Chitra (uchitra@princeton.edu).