More details can be found in Project page.
- [2026.5] π€© Our training dataset GUI-C2-4K released on HuggingFace.
- [2026.5] π€© Code for difficulty scoring released.
We open-source our 3B model training dataset first, as we believe it offers substantial value to future research in this field. If you find our dataset, difficulty design, and score calculation helpful to your work, please consider citing our paper.
Our dataset is sourced from:
First, you need to conduct an 8-click test (using 8 rollouts as an example) on the data sources and base model you intend to use. Retain the raw output coordinates of the eight clicks, and pre-filter out samples where all 8 clicks are correct or incorrect.
cd /share/home/junlong_li/GUI-C2-main
python build_train_set_diff.py --input filter_rawoutput.json --output train_set_diff.json
We sincerely thank the authors of GUI-G1, OS-Atlas, SeeClick and UI-Bert for their open-source contributions.
@misc{li2026guic2coarsetofineguigrounding,
title={GUI-C$^2$: Coarse-to-Fine GUI Grounding via Difficulty-Aware Reinforcement Learning},
author={Junlong Li and Chao Hao and Lap-Pui Chau and Yi Wang},
year={2026},
eprint={2605.30884},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2605.30884},
}