Figure 1. Architecture overview of HyperST framework.
Create a new conda environment and activate it:
conda env create -f environment.yml
conda activate HyperST- Request access from HEST Database
- Execute download notebook: dataset_download_hest1k.ipynb
⚠️ Critical Fix: Ifhest.pyfails to download correctly in dataset_download_hest1k, replace the faulty version in:~/.cache/huggingface/modules/datasets_modules/datasets/MahmoodLab--hest/with the corrected version hest.py provided in this repository.
hest1k_datasets/
└── {DATASET_NAME}/
├── wsis/ # Whole-slide H&E images
├── st/ # Spatial transcriptomics (.h5ad)
└── processed_data/
├── selected_gene_list.txt # HMHVG genes
├── selected_hvg_gene_list.txt # HVG genes
└── all_slide_lst.txt # Slide IDs
Before training, raw data must be preprocessed.
bash ./scripts/data_proprocess/kidney.sh please also follow their respective installation requirements in GitHub/HuggingFace to access the weight of UNI .
bash ./scripts/split/kidney.sh Before running: Insert your HuggingFace token in scripts
bash ./scripts/train/hyperst/train_HVG.sh
bash ./scripts/train/hyperst/train_HMHVG.shexperiments/
└── hyperst/
└── $EXPERIMENT_NAME/
└── $DATASET_NAME/
├── sample_split_flod_0/
│ ├── checkpoints/ # Model checkpoints
│ └──samples/ # Gene predictions for test samples
└── ... (other folds)