graph-ISS

Graph-based Decoding for In Situ Sequencing (ISS).

This repository contains the primary source code implementation of the graph-based image analysis pipeline for processing in situ sequencing data and ipython notebooks for reproducing publication analysis results and figures [1]. The image decoding pipeline consists in three Python 3 library packages for 2D and 3D data proccesing and a Anduril2 [2] pipeline implementing the decoding workflow.

If Anduril2 is not available for your operating system (e.g Windows machines) or your are not confident with Scala programming language, an alternative implementation of the decoding workflow is available in the form of Jupyter notebook

[1] Partel, G. et al. Identification of spatial compartments in tissue from in situ sequencing data. BioRxiv, https://doi.org/10.1101/765842, (2019).

[2] Cervera, A. et al. Anduril 2: upgraded large-scale data integration framework. Bioinformatics, (2019).

Install Requirements

Anduril2 Pipeline

Anduril 2 is a workflow platform for high-throughput analysis of biomedical data. Workflows are constructed using Scala 2.11 and executed in parallel locally or on Linux clusters using a workflow engine optimized for iterative development. Documentation and installation instructions are available at: http://anduril.org.

Anduril2 OS Requirements

Linux operating systems

Jupyter pipeline

A Jupyter installation is required inside the same python evironment ("pgm_pipeline") used for running graph-ISS python modules (see Python Library Requirements). The Jupyter pipeline makes use of IPython Parallel (https://ipyparallel.readthedocs.io/) for processing multiple image tiles in parallel. To install Ipython Parallel run the following command in the jupyter environment ("pgm_pipeline").

conda install ipyparallel

Once installed, we can start N different engines for running in parallel multiple tiles, specifically one for each engine started. As a general rule of thumb, reserve for each engine as many cores as many imaging rounds, in order to also benefit from the python modules parallelization for processing imaging rounds in parallel. For example, on a workstation with 12 cores and a dataset with 4 imaging rounds, we can start 3 engines, each processing one tile with 4 threads. Run the following command to activate the engines:

ipcluster start -n <N>

and wait until you see this message printed before starting to execute the Jupyter pipeline:

[IPClusterStart] Engines appear to have started successfully

Bio-Format Command Line Tools

Bio-format command line tools are necessary for dividing whole slide images in smaller tiles for faster computation. Bio-format command line tools can be downloaded from https://www.openmicroscopy.org/bio-formats.

Python Library Requirements

Create a conda evironment named "pgm_pipeline":

$ conda create --name pgm_pipeline

Activate environment:

$ conda activate pgm_pipeline

Install the following python packages:

joblib==0.13.2
keras==2.2.4
networkx==2.3
numpy==1.13.1
pandas==0.23.4
scikit-image==0.13.0
scikit-learn==0.21.3
scipy==0.19.1
pytables==3.4.2
tqdm==4.32.2
R==4.0

Install SimpleElastix (https://simpleelastix.readthedocs.io) inside the virtual environment following installation instructions available here.

To deactivate the conda enviroment:

$ conda deactivate

Analysis Notebook Install Requirements

The following python packages are required for running the notebooks:

joblib==0.13.2
matplotlib==2.2.2
networkx==2.3
nimfa==1.3.4
numpy==1.13.1
opencv-python==3.4.1.15
pandas==0.23.4
scikit-image==0.13.0
scikit-learn==0.21.3
scipy==0.19.1
seaborn==0.9.0
SpatialDE==1.1.3
tqdm==4.32.2
umap-learn==0.3.9
scanpy[leiden]==1.5.1

Data Download

An example ISS data [3] for testing Anduril decoding pipeline and decoding results for reproducing publication analyses can be downloaded from: https://doi.org/10.5281/zenodo.3928110

[3] Ke, Rongqin, et al. "In situ sequencing for RNA analysis in preserved tissue and cells." Nature methods 10.9 (2013): 857.

Anduril Pipeline Example Usage

An example of 2D Anduril decoding pipeline is availabel for testing (ISS_Anduril_Pipeline_Example.scala). For running the test example, <GRAPH-ISS-FOLDER> and <DATA-FOLDER> strings in the scala file should be replaced respectively with graph-iss and downloaded data folder local paths.

To lunch the execution, run the command:

$ ./ISS_Anduril_Pipeline_Example.scala

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
decoding_pipelines		decoding_pipelines
notebooks		notebooks
pgm_pipeline		pgm_pipeline
prePro_pipeline		prePro_pipeline
prePro_pipeline_3D		prePro_pipeline_3D
wiki		wiki
ISS_Anduril_Pipeline_Example.scala		ISS_Anduril_Pipeline_Example.scala
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

graph-ISS

Install Requirements

Anduril2 Pipeline

Anduril2 OS Requirements

Jupyter pipeline

Bio-Format Command Line Tools

Python Library Requirements

Analysis Notebook Install Requirements

Data Download

Anduril Pipeline Example Usage

About

Releases 2

Packages

Languages

License

wahlby-lab/graph-iss

Folders and files

Latest commit

History

Repository files navigation

graph-ISS

Install Requirements

Anduril2 Pipeline

Anduril2 OS Requirements

Jupyter pipeline

Bio-Format Command Line Tools

Python Library Requirements

Analysis Notebook Install Requirements

Data Download

Anduril Pipeline Example Usage

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages