Comprehensive benchmarking of batch integration methods for spatial transcriptomics using a large-scale cancer atlas

Comprehensive benchmarking of batch integration methods for spatial transcriptomics using a large-scale cancer atlas

The code for the paper "Comprehensive benchmarking of batch integration methods for spatial transcriptomics using a large-scale cancer atlas"

MOSAIC Window Data Access

This benchmark uses data from MOSAIC Window, a 60-patient subset of the larger MOSAIC dataset featuring spatial omics data across four cancer indications: Bladder, Ovarian, Glioblastoma, and Mesothelioma. To access the data, researchers must request access through the European Genome-phenome Archive (EGA) portal. Once approved, data can be downloaded using the pyega3 download client. The required files are the Space Ranger count outputs for Visium data. Deconvolution outputs (cell type proportions for each spot) are already provided in data/deconvolution_outputs and are necessary to run the experiments. Additional preprocessing steps for each integration method are specified in the experiment configuration files.

For detailed information on configuring data paths and environment variables, see src/st_benchmark/datasets/README.md.

Clone repo

mkdir st-benchmark-project && cd st-benchmark-project
git clone git@github.com:owkin/st-benchmark.git
cd st-benchmark

Set up environment and install dependencies

Note: This project uses a hybrid package management approach:

UV manages Python packages and dependencies
Conda/Mamba manages R packages and system dependencies
zsh: This setup expects you to be using zsh

0. Install Miniforge, create mamba alias, and set path variables

make setup-environment
source ~/.zshrc

This will:

Install Miniforge if not present
Clean up old conda installations
Create a mamba alias for easier package management
Set path variables in you ~/.zshrc

1. Create conda environment

conda create -n st-benchmark "python==3.11"
conda activate st-benchmark

2. Clone external repositories

make external-repos

3. Install UV package manager

make install-uv

4. Generate lock file and install Python dependencies

make lock
make install-python
source ./.venv/bin/activate

6. Install R dependencies

make install-R

(This will take a rather long time)

Note: Make sure to activate both environments before running experiments:

conda activate st-benchmark

and

source ./.venv/bin/activate

or prepend your command with uv run ... to run with the locked Python environment.

Adding libraries

Use uv to add a library to the project.

uv add numpy

Use uv to update the lock file.

uv lock

Running experiments

Experiments are run through the hydra framework. Make sure to conda activate st-benchmark and then source ./.venv/bin/activate before running your experiments. An experiment call then looks like

uv run python run.py experiment=st/integration_pca cohort=Bladder experiment_type=all_at_once batch_type=patient data=mosaic_window_dev

Experiment configs live in the experiment_configs folder as .yaml files. Experiments are highly configurable and can be specified through the command line.

experiment: Determines which of 12 integration methods is used.
batch_type: We investigate three types of batch effect--patient, center, and indication.
cohort: Each batch effect type has different cohorts. For patient batch effect, we have five cohorts: Glioblastoma, Lymphoma, Bladder, Mesothelioma, and Ovarian.
experiment_type: This determines how we perform train-test splitting. all_at_once refers to doing all-at-once integration. iid and ood perform the split before integration and, the output metrics are averaged over 5 fold CV.
data: You can use either the full dataset or a dev dataset with a small number of randomly subsampled spots.

You can also run sweeps with the following syntax:

uv run python run.py --multirun experiment=st/integration_pca batch_type=patient cohort=Bladder,Mesothelioma,Glioblastoma,Ovarian,Lymphoma experiment_type=all_at_once data=mosaic_window_full

Pre-configured experipents scripts that replicate the paper's results are available in the runners/ folder. For more details refer to runners/README.md.

Each such call to run.py will create a folder within st_benchmark/outputs with embedding plots and a metric .csv file, organized by date and time of the run.

Many options for various sweeps are given in st_benchmark/runners but feel free to make your own :

Hydra Config Composition

Experiments are composed using Hydra's config composition system. The main_config.yaml selects an experiment config via the defaults list:

defaults:
  - experiment: st/integration_combat
  - _self_

Each experiment config (e.g., experiment/st/integration_pca.yaml) then composes multiple sub-configs:

defaults:
  - /default@experiment.default: default      # Packages into 'experiment' namespace
  - /experiment_type: all_at_once              # Global config for experiment type
  - /data: mosaic_window_full                  # Packages into 'data' namespace
  - /transform: st/pca                         # Packages into 'transform' namespace
  - _self_                                     # Includes current config values

This allows modular composition: swap experiment_type, data, or transform configs independently to create new experiment variants.

Generating Figures

To generate comparison plots and latex tables from sweep experiments, you can run, for example

uv python figures/run_comparison.py --metrics_dir=outputs/2025-09-24/15-33-14

The resulting figures and latex tables will be placed in st_benchmark/figures in a subdirectory with the same time and date.

Both scripts generate:

Comparison plots showing performance gaps across methods and metrics
Summary tables with statistical analysis
LaTeX tables for publication
CSV files with detailed results

The bash script figures/generate_figures.sh contains

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data/deconvolution_outputs		data/deconvolution_outputs
experiment_configs		experiment_configs
figures		figures
runners		runners
src/st_benchmark		src/st_benchmark
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
install_R_dependencies.R		install_R_dependencies.R
install_dependencies.sh		install_dependencies.sh
mosaic_window.jpg		mosaic_window.jpg
pyproject.toml		pyproject.toml
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Comprehensive benchmarking of batch integration methods for spatial transcriptomics using a large-scale cancer atlas

MOSAIC Window Data Access

Clone repo

Set up environment and install dependencies

0. Install Miniforge, create mamba alias, and set path variables

1. Create conda environment

2. Clone external repositories

3. Install UV package manager

4. Generate lock file and install Python dependencies

6. Install R dependencies

Adding libraries

Running experiments

Hydra Config Composition

Generating Figures

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Comprehensive benchmarking of batch integration methods for spatial transcriptomics using a large-scale cancer atlas

MOSAIC Window Data Access

Clone repo

Set up environment and install dependencies

0. Install Miniforge, create mamba alias, and set path variables

1. Create conda environment

2. Clone external repositories

3. Install UV package manager

4. Generate lock file and install Python dependencies

6. Install R dependencies

Adding libraries

Running experiments

Hydra Config Composition

Generating Figures

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages