GitHub - Dog-Yang/ConInfer: CVPR 2026 (Findings)-ConInfer: Context-Aware Inference for Training-Free Open-Vocabulary Remote Sensing Segmentation

This repository contains the official implementation of the CVPR 2026 Findings paper:
"ConInfer: Context-Aware Inference for Training-Free Open-Vocabulary Remote Sensing Segmentation"

📄 Overview

Abstract

Training-free open-vocabulary remote sensing segmentation (OVRSS), empowered by vision-language models, has emerged as a promising paradigm for achieving category-agnostic semantic understanding in remote sensing imagery. Existing approaches mainly focus on enhancing feature representations or mitigating modality discrepancies to improve patch-level prediction accuracy. However, such independent prediction schemes are fundamentally misaligned with the intrinsic characteristics of remote sensing data. In real-world applications, remote sensing scenes are typically large-scale and exhibit strong spatial as well as semantic correlations, making isolated patch-wise predictions insufficient for accurate segmentation. To address this limitation, we propose \textbf{ConInfer}, a context-aware inference framework for OVRSS that performs joint prediction across multiple spatial units while explicitly modeling their inter-unit semantic dependencies. By incorporating global contextual cues, our method significantly enhances segmentation consistency, robustness, and generalization in complex remote sensing environments. Extensive experiments on multiple benchmark datasets demonstrate that our approach consistently surpasses state-of-the-art per-pixel VLM-based baselines such as SegEarth-OV, achieving average improvements of 2.80% and 6.13% on open-vocabulary semantic segmentation and object extraction tasks, respectively. All source codes will be publicly released upon the paper’s acceptance.

🚀 Installation

Clone the repository and install required packages.

git clone https://github.com/Dog-Yang/ConInfer.git
cd ConInfer
conda create -n ConInfer python=3.11
pip install -r requirements.txt

📂 Datasets

We include the following dataset configurations in this repo:

Semantic Segmentation: OpenEarthMap, LoveDA, iSAID, Potsdam, Vaihingen, UAVid^img, UDD5, VDD
Building Extraction: WHU^Aerial, WHU^Sat.Ⅱ, Inria, xBD^pre
Road Extraction: CHN6-CUG, DeepGlobe, Massachusetts, SpaceNet
Water Extraction: WBS-SI

Please refer to dataset_prepare.md for dataset preparation.

🦖 evaluation

Multi-GPU:

bash ./dist_test.sh ./config/cfg_DATASET.py

Results will be saved in results.xlsx.

🙏 Acknowledgments

Thanks to @likyoo for their awesome SegEarth.
Thanks to @facebookresearch for their awesome DINOV3.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
BLIP		BLIP
SimFeatUp		SimFeatUp
configs_ConInfer		configs_ConInfer
configs_baseline		configs_baseline
configs_segov		configs_segov
dinov3		dinov3
gem		gem
open_clip		open_clip
prompts		prompts
simfeatup_dev		simfeatup_dev
tools/dataset_converters		tools/dataset_converters
.gitignore		.gitignore
ConInfer_segmentor.py		ConInfer_segmentor.py
README.md		README.md
custom_datasets.py		custom_datasets.py
dataset_prepare.md		dataset_prepare.md
dist_test.sh		dist_test.sh
eval.py		eval.py
frame.jpg		frame.jpg
gmm.py		gmm.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📄 Overview

Abstract

🚀 Installation

📂 Datasets

🦖 evaluation

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📄 Overview

Abstract

🚀 Installation

📂 Datasets

🦖 evaluation

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages