Control-GIC

This is the Pytorch implementation of the paper "Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaption". We design a unified generative compression model (Control-GIC) capable of variable bitrate adaption across a broad spectrum while preserving high-perceptual fidelity reconstruction. Control-GIC allows one model for variable bitrates and once compression on an entire dataset for constrained bitrate conditions.

✨ Model Architecture

🔓 Installation

git clone https://github.com/lianqi1008/Control-GIC.git
cd Control-GIC
conda create -n CGIC python=3.10.4
conda activate CGIC
pip install -r requirements.txt

🚀 Usage

The model weight can be downloaded from GoogleDrive and BaiduNetDisk.

Train

We use OpenImages-v6 dataset and randomly crop images to a uniform 256 × 256 resolution.

python main.py --config configs/config_train.yaml

Inference

We use Kodak and high-resolution DIV2K dataset to evaluate the model.

If you want a wide range of bpp (0.1 ~ 0.75), we recommend you start with the Granularity Ratio in [(0, 0.5, 0.5), (0, 0.8, 0.2), (0.1, 0.8, 0.1), (0.3, 0.6, 0.1), (0.5, 0.4, 0.1), (0.9, 0.1, 0)], where each element represents fine-grained, medium-grained, and coarse-grained ratio from left to right, and fine-tune the ratio values based on these ratios to get a finer bpp.

python inference.py -i input_dir -o output_dir -w

Note that when tuning the ratio to change the bpp, the fine-grained priority is higher than the medium-grained priority, which is higher than the coarse-grained priority. This ensures the best reconstruction quality.

💡 Special Capabilities

Fine control of bitrate

By fine-tuning the granularity ratio, the model is able to make fine adjustments to the bitrate (Evaluated on the Kodak dataset).

Granularity Ratio	Bpp	LPIPS
(0.301, 0.599, 0.1)	0.38925	0.03027
(0.302, 0.598, 0.1)	0.38983	0.03025
(0.303, 0.597, 0.1)	0.39050	0.03020
(0.304, 0.596, 0.1)	0.39108	0.03014

Constant bitrate

Once the granularity ratio is fixed, the bpp values of each image in the dataset are very close to each other, thus easily fulfilling the constrained bandwidth and bitrate conditions. For example, in one compression on the Kodak dataset, we set the ratio to (0.1, 0.8, 0.1), and obtained 24 images with bpp ranging from 0.262 to 0.277, with only slight variations.

Cite

@misc{li2024onceforall,
      title={Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaption}, 
      author={Anqi Li and Yuxi Liu and Huihui Bai and Feng Li and Runmin Cong and Meng Wang and Yao Zhao},
      year={2024},
      eprint={2406.00758},
      archivePrefix={arXiv},
      primaryClass={eess.IV}
}

Thanks

The code references VQGAN, MoVQGAN, DQ-VAE and Huffman coding from Bhrigu Srivastava@bhrigu123. Thanks for these excellent works!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Control-GIC

✨ Model Architecture

🔓 Installation

🚀 Usage

💡 Special Capabilities

Cite

Thanks

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
CGIC		CGIC
configs		configs
figs		figs
README.md		README.md
inference.py		inference.py
main.py		main.py
requirements.txt		requirements.txt

lianqi1008/Control-GIC

Folders and files

Latest commit

History

Repository files navigation

Control-GIC

✨ Model Architecture

🔓 Installation

🚀 Usage

💡 Special Capabilities

Cite

Thanks

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages