UniGame: Turning a Unified Multimodal Model Into Its Own Adversary

Overview | Installation | Quick Start | Citation

Official implementation of UniGame, a self-adversarial post-training framework for Unified Multimodal Models (UMMs).

Overview

UniGame is the first self-adversarial post-training framework that improves the consistency between understanding and generation pathways in Unified Multimodal Models. By treating the generation pathway as an active adversary, UniGame enables the model to discover and correct its own inconsistencies.

Quantitative Results:

Installation

Requirements

Python >= 3.8
PyTorch >= 2.0
CUDA >= 11.8 (recommended)

Setup

# Clone the repository
git clone https://github.com/AIFrontierLab/UniGame.git
cd UniGame

# Create conda environment
conda create -n unigame python=3.11 -y
conda activate unigame

# Install dependencies
pip install -r requirements.txt

Quick Start

1. Prepare Dataset

Download the VQAv2 dataset and update the path in main.py:

LOCAL_VQAV2 = "/path/to/your/vqav2"

2. Training

Single GPU:

python main.py

Multi-GPU (DDP):

torchrun --nproc_per_node=4 main.py

SLURM Cluster:

srun --gres=gpu:4 --cpus-per-task=16 torchrun --nproc_per_node=4 main.py

Citation

If you find this work useful, please cite:

@inproceedings{Su2025UniGameTA,
  title={UniGame: Turning a Unified Multimodal Model Into Its Own Adversary},
  author={Zhaolong Su and Wang Lu and Hao Chen and Sharon Li and Jindong Wang},
  year={2025},
  url={https://api.semanticscholar.org/CorpusID:283244819}
}

Acknowledgements

We thank Dr. Ziyue Xu from NVIDIA for his insightful discussions and valuable comments on this project. We thank the authors of Janus-Pro, and other open-source projects that made this work possible.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contact

For questions or issues, please open an issue or contact:

Zhaolong Su: zsu05@wm.edu

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
data		data
fig		fig
frameworks		frameworks
janus_main		janus_main
models/openai-clip-vit-b32		models/openai-clip-vit-b32
trainers		trainers
utils		utils
LICENSE		LICENSE
README.md		README.md
main_janus.py		main_janus.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

UniGame: Turning a Unified Multimodal Model Into Its Own Adversary

Overview

Installation

Requirements

Setup

Quick Start

1. Prepare Dataset

2. Training

Citation

Acknowledgements

License

Contact

About

Uh oh!

Releases

Packages

Languages

License

AIFrontierLab/UniGame

Folders and files

Latest commit

History

Repository files navigation

UniGame: Turning a Unified Multimodal Model Into Its Own Adversary

Overview

Installation

Requirements

Setup

Quick Start

1. Prepare Dataset

2. Training

Citation

Acknowledgements

License

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages