MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping

MegaStyle is a novel and scalable data curation pipeline that first explores consistent T2I style mapping ability from current large generative models to construct intra-style consistent, inter-style diverse and high-quality style dataset.

Your star is our fuel! We're revving up the engines with it! Check the our project page for more visual results!

News

[2026/4/21] 🔥 We release the training/inference codes, models and dataset of MegaStyle!!!

TODO List

A more diverse and larger-scale style dataset.

MegaStyle-1.4M

MegaStyle-1.4M is a large-scale style dataset built through a scalable pipeline that leverages consistent text-to-image style mapping of Qwen-Image. It combines 170K curated style prompts with 400K content prompts to generate 1.4M high-quality images that share strong intra-style consistency while covering diverse fine-grained styles.

Get Started

Trained on MegaStyle1.4M, we introduce MegaStyle-FLUX and MegaStyle-Encoder for generalizable style transfer and reliable style similarity measurement.

Clone the Repository

git clone git@github.com:Tencent/MegaStyle.git
cd ./MegaStyle

Environment Setup

conda create -n megastyle python==3.10
conda activate megastyle
pip install diffsynth==1.1.8

Downloading Checkpoints

Download the pretrained models of SigLIP and FLUX.1-dev.
Download the models into ./models/.

Running Inference

For image style transfer, we provide 50 reference style images from StyleBench in ./ref_styles:

python inference.py --ckpt_path models/megastyle_flux.safetensors --ref_path ./ref_styles

For computing style score:

python style_score.py --ckpt_path models/megastyle_encoder.pth --real_image_path <path/to/image.png> --fake_image_path <path/to/image.png>

Training

To train a style transfer model with paired supervision, please download our style dataset, MegaStyle1.4M, and start training with:

bash FLUX.1-dev.sh # FLUX.1-dev-npu.sh for npu

License and Citation

All assets and code are under the license unless specified otherwise.

If this work is helpful for your research, please consider citing the following BibTeX entry.

@article{gao2026megastyle,
  title={MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping},
  author={Gao, Junyao and Liu, Sibo and Li, Jiaxing and Sun, Yanan and Tu, Yuanpeng and Shen, Fei and Zhang, Weidong and Zhao, Cairong and Zhang, Jun},
  journal={arXiv preprint arXiv:2604.08364},
  year={2026}
}

Acknowledgements

The code is built upon DiffSynth-Studio.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
configs		configs
ref_styles		ref_styles
.gitignore		.gitignore
FLUX.1-dev-npu.sh		FLUX.1-dev-npu.sh
FLUX.1-dev.sh		FLUX.1-dev.sh
LICENSE.txt		LICENSE.txt
README.md		README.md
dataset.py		dataset.py
flux_image_mega.py		flux_image_mega.py
inference.py		inference.py
style_score.py		style_score.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping

News

TODO List

MegaStyle-1.4M

Get Started

Clone the Repository

Environment Setup

Downloading Checkpoints

Running Inference

Training

License and Citation

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping

News

TODO List

MegaStyle-1.4M

Get Started

Clone the Repository

Environment Setup

Downloading Checkpoints

Running Inference

Training

License and Citation

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages