MarsGT: Multi-omics data analysis for rare population inference using single-cell graph transformer

Description

MarsGT: Multi-omics analysis for rare population inference using single-cell Graph Transformer, which is an end-to-end deep learning model for rare cell population identification from scMulti-omics data.

Installation

System Requirements

Python Version >=3.8.0
Hardware Architecture: x86_64
Operating System: GNU/Linux or Windows or MacOS

Dependencies, MarsGT has the following dependencies:

anndata==0.8.0
dill==0.3.4
matplotlib==3.5.1
numpy==1.22.3
pandas==1.4.2
scipy==1.9.1
seaborn==0.11.2
scikit-learn==1.1.2
torch==1.12.0
torch-geometric==2.1.0.post1
torchmetrics==0.9.3
xlwt==1.3.0
tqdm==4.64.0
scanpy==1.9.1
leidenalg==0.8.10
ipywidgets==8.0.6

Installation Steps

The installation process involves some optional and necessary steps. Here's the detailed breakdown:

Recommended Step: Create a new environment, you should use python 3.8.
```
conda create --name marsgt python=3.8
conda activate marsgt
```

Necessary Step: You need to install either the CPU or GPU version of PyTorch as per your preference, We recommend using the GPU version, which has a faster running speed compared to the CPU version:

CPU Version

For Linux system (torch-1.12.0+ torch_cluster-1.6.0+ torch_scatter-2.0.9+ torch_sparse-0.6.14):

pip install https://download.pytorch.org/whl/cpu/torch-1.12.0%2Bcpu-cp38-cp38-linux_x86_64.whl
pip install https://data.pyg.org/whl/torch-1.12.0%2Bcpu/torch_cluster-1.6.0%2Bpt112cpu-cp38-cp38-linux_x86_64.whl
pip install https://data.pyg.org/whl/torch-1.12.0%2Bcpu/torch_scatter-2.0.9-cp38-cp38-linux_x86_64.whl
pip install https://data.pyg.org/whl/torch-1.12.0%2Bcpu/torch_sparse-0.6.14-cp38-cp38-linux_x86_64.whl

For Windows system (torch-1.12.0+ torch_cluster-1.6.0+ torch_scatter-2.0.9+ torch_sparse-0.6.14):

pip install https://download.pytorch.org/whl/cpu/torch-1.12.0%2Bcpu-cp38-cp38-win_amd64.whl
pip install https://data.pyg.org/whl/torch-1.12.0%2Bcpu/torch_scatter-2.0.9-cp38-cp38-win_amd64.whl
pip install https://data.pyg.org/whl/torch-1.12.0%2Bcpu/torch_sparse-0.6.14-cp38-cp38-win_amd64.whl
pip install https://data.pyg.org/whl/torch-1.12.0%2Bcpu/torch_cluster-1.6.0%2Bpt112cpu-cp38-cp38-win_amd64.whl

For MacOS system (torch-1.12.0+ torch_cluster-1.6.0+ torch_scatter-2.0.9+ torch_sparse-0.6.14):

conda install pytorch==1.12.0
pip install https://data.pyg.org/whl/torch-1.12.0%2Bcpu/torch_scatter-2.0.9-cp38-cp38-macosx_10_15_x86_64.whl
pip install https://data.pyg.org/whl/torch-1.12.0%2Bcpu/torch_cluster-1.6.0-cp38-cp38-macosx_10_15_x86_64.whl
pip install https://data.pyg.org/whl/torch-1.12.0%2Bcpu/torch_sparse-0.6.14-cp38-cp38-macosx_10_15_x86_64.whl

GPU Version

Please visit the official PyTorch website at PyTorch to select and download the CUDA-enabled version of PyTorch that best matches your system configuration.

For linux system(You need to select the version that is compatible with your system's graphics card. For example: torch-1.12.0+ torch_cluster-1.6.0+ torch_scatter-2.1.0+ torch_sparse-0.6.16):

pip install https://download.pytorch.org/whl/cu102/torch-1.12.0%2Bcu102-cp38-cp38-linux_x86_64.whl
pip install https://data.pyg.org/whl/torch-1.12.0%2Bcu102/torch_scatter-2.1.0%2Bpt112cu102-cp38-cp38-linux_x86_64.whl
pip install https://data.pyg.org/whl/torch-1.12.0%2Bcu102/torch_sparse-0.6.16%2Bpt112cu102-cp38-cp38-linux_x86_64.whl
pip install https://data.pyg.org/whl/torch-1.12.0%2Bcu102/torch_cluster-1.6.0%2Bpt112cu102-cp38-cp38-linux_x86_64.whl

For Windows system(You need to select the version that is compatible with your system's graphics card. For example: torch-1.12.0+ torch_cluster-1.6.0+ torch_scatter-2.1.0+ torch_sparse-0.6.16):

pip install https://download.pytorch.org/whl/cu116/torch-1.12.0%2Bcu116-cp38-cp38-win_amd64.whl
pip install https://data.pyg.org/whl/torch-1.12.0%2Bcu116/torch_scatter-2.1.0%2Bpt112cu116-cp38-cp38-win_amd64.whl
pip install https://data.pyg.org/whl/torch-1.12.0%2Bcu116/torch_sparse-0.6.15%2Bpt112cu116-cp38-cp38-win_amd64.whl
pip install https://data.pyg.org/whl/torch-1.12.0%2Bcu116/torch_cluster-1.6.0%2Bpt112cu116-cp38-cp38-win_amd64.whl

For MacOS system(According to the official PyTorch documentation, CUDA is not available on MacOS, please use the default package):

Necessary Step: You can directly install MarsGT using the pip command:
```
pip install --upgrade MarsGT
```

Data Download

You can choose to download them directly from your browser by the following URL:

You also can download them from cmd command：

curl -o Tutorial_example.zip https://zenodo.org/api/files/7ca78984-0e31-48cf-8b48-9544099d57bb/Tutorial_example.zip
curl -o Mouse_retina.zip https://zenodo.org/api/files/7ca78984-0e31-48cf-8b48-9544099d57bb/Mouse_retina.zip
curl -o B_lymphoma.zip https://zenodo.org/api/files/7ca78984-0e31-48cf-8b48-9544099d57bb/B_lymphoma.zip
curl -o PBMCs.zip https://zenodo.org/api/files/7ca78984-0e31-48cf-8b48-9544099d57bb/PBMCs.zip

TF Binding Sites Data

We retrieved the genome browser track file from JASPAR, which stores all known TF binding sites for each TF. A p-value score was provided in JASPAR. Download Links：

Human (hg38 genome): hg38_lisa_500.qsave
Mouse (mm10 genome): mm10.qsave

MarsGT Model Tutorial

We have curated tutorials to assist you in operating the MarsGT model. You can locate these tutorials in the marsgt/Tutorial directory of the project. Additionally, we have provided an Example Dataset to aid you in testing and acquainting yourself with the MarsGT functionality:

Example Dataset ↗

Note-1

The execution of this code approximately takes 2 hours and requires about 250GB of memory.

Note-2

More epochs will lead to better performance, but will also require a longer duration. Please balance this according to your needs to set parameters 'epochs' and 'num_epochs'.

Note-3

For the usage of MAESTRO in this context, please refer to the process in Tutorial/Tutorial_server_version/Case1/MAESTRO_Gene_Peak_Calculation.R.

Beyond the aforementioned resources, we offer two versions of the tutorial designed to reproduce the results of the paper:

Local Version ↗: This version is specifically created for running the model locally on your computer. Utilize this version if you are aiming to reproduce the results documented in the paper.
Server Version ↗: This version is crafted for running the model on a server. Opt for this version if your goal is to reproduce both the model running process and the subsequent results.

To begin the tutorial, select the version that suits your needs or choose the example dataset. Subsequently, follow the directives provided in the README file within the respective directory or refer to the Jupyter notebook for the example dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 296 Commits
Data		Data
MarsGT		MarsGT
Tutorial		Tutorial
images		images
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MarsGT: Multi-omics data analysis for rare population inference using single-cell graph transformer

Description

Installation

System Requirements

Dependencies, MarsGT has the following dependencies:

Installation Steps

Data Download

You can choose to download them directly from your browser by the following URL:

You also can download them from cmd command：

TF Binding Sites Data

MarsGT Model Tutorial

Note-1

Note-2

Note-3

About

Releases

Packages

Languages

OSU-BMBL/marsgt

Folders and files

Latest commit

History

Repository files navigation

MarsGT: Multi-omics data analysis for rare population inference using single-cell graph transformer

Description

Installation

System Requirements

Dependencies, MarsGT has the following dependencies:

Installation Steps

Data Download

You can choose to download them directly from your browser by the following URL:

You also can download them from cmd command：

TF Binding Sites Data

MarsGT Model Tutorial

Note-1

Note-2

Note-3

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages