Skip to content

Profiling Integrative Communities of Aggregated Single-cell Omics data

Notifications You must be signed in to change notification settings

mjoppich/PICASO

Repository files navigation

PICASO: Profiling Integrative Communities of Aggregated Single-cell Omics data

Documentation Status

alt text

Abstract

Various single-cell modalities covering transcriptomics, epigenetic and spatio-temporal changes in health and disease phenotypes are used in an exploratory way to understand biological systems at single-cell resolution. However, the vast amount of such single-cell data is not systematically linked to existing biomedical data. Networks have previously been used to represent harmonized biomedical data. Integrating various resources of biomedical data in networks has recently received increasing attention. These aggregated networks can provide additional insight into the biology of complex human diseases at cell-type level, however, lack inclusion of single cell expression data. Here, we present the PICASO framework, which incorporates single-cell gene expression data as an additional layer to represent associations between cell types, disease phenotypes, drugs and genes. The PICASO network includes several standardized biomedical databases such as STRING, Uniprot, GeneOntology, Reactome, OmniPath and OpenTargets. Using multiple cell type-specific instances of the framework, each annotated and scored with their respective expression data, comparisons between disease states can be made by computing respective sub-networks and comparing the expression scores between conditions. Ultimately, these group-specific networks will allow the identification of relevant genes, processes and potentially druggable targets, as well as the comparison of different measured groups and thus the identification of group-specific communities and interactions.

If you find this work helpful in your analyses, please cite:

PICASO: Profiling Integrative Communities of Aggregated Single-cell Omics data
Markus Joppich, Rafael Kramann, Sikander Hayat
bioRxiv 2024.08.28.610120; doi: https://doi.org/10.1101/2024.08.28.610120

Collaborators

Collaboration with @HayatLab

Setup

Creating environment

conda create --name regnetworks -c conda-forge python=3.11
conda activate regnetworks

# To run in jupyter
conda install -c conda-forge python-igraph ipykernel
python -m ipykernel install --user --name=regnetworks

pip install scanpy networkx scikit-learn natsort seaborn matplotlib leidenalg markov_clustering

# For the AIDescriptor class
pip install huggingface_hub llama-cpp-python 'transformers[tf-cpu]'

Using PICASO

At the moment users have to add the path to the PICASO-repository to their python environment. This can be achieved by extending the python path:

import sys
sys.path.insert(0, <path-to-PICASO-folder>)

After doing so you can use the PICASO framework by importing all functions from the kgraph-module:

from PICASO.kgraph import *

Please also consider our notebooks for usage examples of the PICASO framework:

  • Creating the base PICASO network: notebook

  • KPMP snRNA-seq data preparation: notebook

  • KPMP PICASO data preparation: notebook

  • KPMP PICASO analysis: notebook

  • Myocardial Infarction PICASO analysis: notebook