Connecting Domains and Contrasting Samples: A Ladder for Domain Generalization (DCCL)

Official implementation of "Connecting Domains and Contrasting Samples: A Ladder for Domain Generalization" (KDD 2025).

📋 Overview

DCCL (Domain-Aware Contrastive Cross-domain Learning) is a novel approach for domain generalization that combines multiple complementary loss components to learn robust representations across different domains. The algorithm integrates:

Cross-entropy loss for standard classification
Contrastive loss between aggressively augmented views for invariant representation learning
Layer-wise contrastive loss for contrastive feature alignment with pre-trained models
Generative alignment regularization to generative align features with pre-trained knowledge

🏗️ Code Structure

data/
DCCL/
├── train_all.py                    # Main training script
├── config.yaml                     # Configuration file
├── domainbed/
│   ├── algorithms/
│   │   └── algorithms.py           # 🔥 Core DCCL algorithm implementation
│   ├── lib/
│   │   └── cl_hparams.py           # 🔥 Core hyperparameter settings
│   ├── datasets/                   # Dataset loaders
│   ├── networks.py                 # Network architectures
│   ├── trainer.py                  # Training loop
│   └── ...
└── ...

Key Files:

DCCL/domainbed/algorithms/algorithms.py: Contains the main DCCL algorithm with detailed comments explaining each loss component
DCCL/domainbed/lib/cl_hparams.py: Core hyperparameter configurations for different datasets

🚀 Algorithm Flow

The DCCL algorithm follows this training procedure:

Data Preparation: Load original and augmented image pairs
Feature Extraction: Extract features using trainable and frozen pre-trained networks
Multi-Loss Computation:
- Classification loss (always active)
- Contrastive loss (controlled by --l)
- Domain alignment loss (controlled by --l_d)
- Layer-wise contrastive loss (controlled by --l_layer)
Optimization: Multi-component loss backpropagation with different learning rates

⚙️ Core Hyperparameters

The main tuning parameters are located in DCCL/domainbed/lib/cl_hparams.py:

Essential Parameters (main tuning focus):

--l: Weight for contrastive loss (default: 1.0)
--l_d: Weight for domain alignment loss (default: 0.05)
--l_layer: Weight for layer-wise contrastive loss (default: 1.0)
--t: Temperature for contrastive loss (default: 0.1)
--t_pre: Temperature for pre-trained feature loss (default: 0.2)
--n_layer: Number of layers in projection head (default: 1)

🛠️ Installation

Environment Requirements

Python: 3.6+
PyTorch: latest
Torchvision: 0.10.0
CUDA: 10.2
CUDNN: 7605
NumPy: 1.19.5
PIL: 7.2.0

Setup

You can

git clone <this-repo>
cd DCCL/
pip install -r requirements.txt

📁 Data Preparation

Each dataset can be easily accessed from official sources. For example, the VLCS dataset can be found on the official repo.

To download all datasets automatically:

python download.py --data_dir data

🏃‍♂️ Running Experiments

Basic Usage

Navigate to the DCCL directory and run:

We default set --algorithm DCCL

cd DCCL/
python train_all.py DCCL_OH_0 --dataset OfficeHome --deterministic --trial_seed 0 --checkpoint_freq 100 --data_dir ../data

Multiple Seeds

python train_all.py DCCL_OH_0 --dataset OfficeHome --deterministic --trial_seed 0 --checkpoint_freq 100 --data_dir ../data
python train_all.py DCCL_OH_1 --dataset OfficeHome --deterministic --trial_seed 1 --checkpoint_freq 100 --data_dir ../data
python train_all.py DCCL_OH_2 --dataset OfficeHome --deterministic --trial_seed 2 --checkpoint_freq 100 --data_dir ../data

Different Configurations

Different Backbone Models:

# CLIP ViT-B/16
python train_all.py DCCL_OH_vit --dataset OfficeHome --deterministic --trial_seed 2 --checkpoint_freq 100 --data_dir ../data --model clip_vit-b16

# RegNet
python train_all.py DCCL_OH_reg --dataset OfficeHome --deterministic --trial_seed 2 --checkpoint_freq 100 --data_dir ../data --model regnet

Limited Labeled Data:

# 10% labeled data
python train_all.py DCCL_OH_res50_0.1 --dataset OfficeHome --deterministic --trial_seed 2 --checkpoint_freq 100 --data_dir ../data --label_ratio 0.1

Different Datasets:

# PACS
python train_all.py DCCL_PACS_0 --dataset PACS --deterministic --trial_seed 0 --checkpoint_freq 100 --data_dir ../data

# VLCS  
python train_all.py DCCL_VLCS_0 --dataset VLCS --deterministic --trial_seed 0 --checkpoint_freq 100 --data_dir ../data

# TerraIncognita
python train_all.py DCCL_TI_0 --dataset TerraIncognita --deterministic --trial_seed 0 --checkpoint_freq 100 --data_dir ../data

📊 Results

The training outputs will be saved in DCCL/train_output/[DATASET]/[EXPERIMENT_NAME]/ containing:

Training logs
Evaluation results

🙏 Acknowledgments

This codebase builds heavily upon the excellent SWAD framework. We gratefully acknowledge their foundational work in domain generalization research.

📖 Citation

If you find this work helpful, please kindly cite:

@inproceedings{wei2025connecting,
  title={Connecting domains and contrasting samples: A ladder for domain generalization},
  author={Wei, Tianxin and Chen, Yifan and He, Xinrui and Bao, Wenxuan and He, Jingrui},
  booktitle={Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 1},
  pages={1563--1574},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
DCCL		DCCL
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Connecting Domains and Contrasting Samples: A Ladder for Domain Generalization (DCCL)

📋 Overview

🏗️ Code Structure

Key Files:

🚀 Algorithm Flow

⚙️ Core Hyperparameters

Essential Parameters (main tuning focus):

🛠️ Installation

Environment Requirements

Setup

📁 Data Preparation

🏃‍♂️ Running Experiments

Basic Usage

Multiple Seeds

Different Configurations

📊 Results

🙏 Acknowledgments

📖 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Connecting Domains and Contrasting Samples: A Ladder for Domain Generalization (DCCL)

📋 Overview

🏗️ Code Structure

Key Files:

🚀 Algorithm Flow

⚙️ Core Hyperparameters

Essential Parameters (main tuning focus):

🛠️ Installation

Environment Requirements

Setup

📁 Data Preparation

🏃‍♂️ Running Experiments

Basic Usage

Multiple Seeds

Different Configurations

📊 Results

🙏 Acknowledgments

📖 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages