GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and Images

If you find this project useful, please give us a star🌟.

Xiang Lan¹, Feng Wu¹, Kai He¹, Qinghao Zhao², Shenda Hong³, Mengling Feng¹

¹National University of Singapore ²Peking University People's Hospital ³Peking University

Introduction

While recent multimodal large language models (MLLMs) have advanced automated ECG interpretation, they still face two key limitations: (1) insufficient multimodal synergy between ECG time series signals and ECG images, and (2) limited explainability in linking diagnoses to granular waveform evidence. We introduce GEM, the first MLLM unifying ECG time series, 12-lead ECG images and text for grounded and clinician-aligned ECG interpretation. GEM enables feature-grounded analysis, evidence-driven reasoning, and a clinician-like diagnostic process through three core innovations: a dual-encoder framework extracting complementary time series and image features, cross-modal alignment for effective multimodal understanding, and knowledge-guided instruction data generation for generating high-granularity grounding data (ECG-Grounding) linking diagnoses to measurable parameters (e.g., QRS/PR Intervals). Additionally, we propose the Grounded ECG Understanding task, a clinically motivated benchmark designed to comprehensively assess the MLLM's capability in grounded ECG understanding. Experimental results on both existing and our proposed benchmarks show GEM significantly improves predictive performance (CSN +7.4%↑), explainability (+22.7%↑), and grounding (+25.3%↑), making it a promising approach for real-world clinical applications.

🔥Updates

[Sep 2025] GEM has been accepted to NeurIPS 2025! More updates coming soon.
[Jul 2025] The full version of MIMIC-IV-ECG with beat-level features and GPT-4o interpretations has been released — check it out here!
[Mar 2025] GEM-7B and ECG-Grounding-30k are now available.

We will continue to release more ECG-Grounding data and associated beat-level features progressively.

Stay tuned for updates!

Resource

Project Page: 📖 Page

Paper: 📄 Arxiv

Model: 🤗 GEM

Data: 🤗 ECG-Grounding

Setup

git clone https://github.com/lanxiang1017/GEM.git
bash GEM/setup.sh

Data Preparation

Please download required data:

ECG:

Images:

After downloading all of them, organize the data as follows in ./data,

├── ecg_timeseries
    └── champan-shaoxing
    └── code15
    └── cpsc2018
    └── ptbxl
    └── georgia
    └── mimic-iv
├── ecg_images
    └── cod15_v4
    └── csn_aug_all_layout_papersize
    └── csn_ori_layout_papersize
    └── csn_part_noise_layout_papersize
    └── gen_images
      └── mimic_gen
    └── mimic
    └── mimic_v4
    └── ptb-xl
├── ecg_bench
    └── images
    └── ecg-grounding-test-mimiciv.json
    └── ecg-grounding-test-ptbxl.json
├── ecg_jsons
    └── ECG_Grounding_30k.json

Pretrained Model Preparation

Pretrained ECG Encoder:

ECG-CoCa : place it in GEM/ecg_coca/open_clip/checkpoint

Pretrained MLLMs:

PULSE
LLaVA

Train

For training from scratch:

step 1. specify paths in GEM/scripts/train_gem.sh
step 2. run bash GEM/scripts/train_gem.sh

Evaluation

For ECG-Grounding:

step 1. generate interpretations: GEM/evaluation/gem_bench/bench_ecggrounding.sh
step 2. process interpretations: GEM/gem_evaluation/process_gem_outputs.ipynb
step 3. generate GPT evaluation reports: GEM/gem_evaluation/generate_gpt_eval.py
step 4. process evaluation reports and get scores: GEM/gem_evaluation/process_grounding_scores.ipynb

For ECG-Bench:

step 1. generate results: GEM/evaluation/gem_bench/bench_ecgbench.sh
step 2. evaluate results: GEM/evaluation/evaluate_ecgbench.py
step 3. evaluate reports: GEM/evaluation/eval_report.py

Note

1. You need to specify the result paths in all evaluation scripts (For ECG-Bench, you also need to specify the path to question files in evaluate_ecgbench.py).
1. If you download our trained GEM-7B model from HuggingFace, you must set the path to ECG-CoCa in the config.json file (under "mm_ecg_tower") before using it.
1. bench_ecggrounding.sh is designed to use multiple GPUs to generate interpretations simultaneously, reducing generation time. To use it, you must split the test file (ecg-grounding-test-mimiciv.json) into multiple chunks. If you prefer a simpler setup, you can use bench_ecgbench.sh instead. The core generation functions are the same. Example usage: bash bench_ecgbench.sh -m PATH_TO_GEM -d ecg-grounding-test-mimiciv.

Citation

If you find GEM helpful for your research and applications, please cite our paper:

@misc{lan2025gemempoweringmllmgrounded,
      title={GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and Images}, 
      author={Xiang Lan and Feng Wu and Kai He and Qinghao Zhao and Shenda Hong and Mengling Feng},
      year={2025},
      eprint={2503.06073},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2503.06073}, 
}

Acknowledgement

We thank the authors of PULSE and ECG-Chat for their publicly released models, datasets, and training codes.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.devcontainer		.devcontainer
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
ecg_coca		ecg_coca
evaluation		evaluation
gem_evaluation		gem_evaluation
gem_generation		gem_generation
llava		llava
pics		pics
scripts		scripts
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cog.yaml		cog.yaml
main.py		main.py
predict.py		predict.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and Images

If you find this project useful, please give us a star🌟.

Xiang Lan¹, Feng Wu¹, Kai He¹, Qinghao Zhao², Shenda Hong³, Mengling Feng¹

¹National University of Singapore ²Peking University People's Hospital ³Peking University

Introduction

🔥Updates

Resource

Project Page: 📖 Page

Paper: 📄 Arxiv

Model: 🤗 GEM

Data: 🤗 ECG-Grounding

Setup

Data Preparation

Pretrained Model Preparation

Train

Evaluation

Citation

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and Images

If you find this project useful, please give us a star🌟.

Xiang Lan1, Feng Wu1, Kai He1, Qinghao Zhao2, Shenda Hong3, Mengling Feng1 1National University of Singapore 2Peking University People's Hospital 3Peking University

Introduction

🔥Updates

Resource

Project Page: 📖 Page

Paper: 📄 Arxiv

Model: 🤗 GEM

Data: 🤗 ECG-Grounding

Setup

Data Preparation

Pretrained Model Preparation

Train

Evaluation

Citation

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Xiang Lan¹, Feng Wu¹, Kai He¹, Qinghao Zhao², Shenda Hong³, Mengling Feng¹

¹National University of Singapore ²Peking University People's Hospital ³Peking University

Packages