CF-Font

By Chi Wang, Min Zhou, Tiezheng Ge, Yuning Jiang, Hujun Bao, Weiwei Xu*

This repo is the official implementation of CF-Font: Content Fusion for Few-shot Font Generation (CVF:Paper link; arXiv:2303.14017) accepted by CVPR 2023.

Video demos for Style Interpolation

A poem demo

cffont_style_interpolation_part2_small.mp4

Comparison with DG-Font

cffont_style_interpolation_part1_small.mp4

Dependencies

Libarary

pytorch (>=1.0)
tqdm
numpy
opencv-python  
scipy
sklearn
matplotlib  
pillow  
tensorboardX
scikit-image
scikit-learn
pytorch-fid
lpips
pandas
kornia

DCN

please refer to https://github.com/chengdazhi/Deformable-Convolution-V2-PyTorch/tree/pytorch_1.0.0 to install the dependencies of deformable convolution.

Dataset

方正字库 provides free font download for non-commercial users.

How to run

prepare dataset

Put your font files to a folder and character file to charset

.
├── data
│   └── fonts
│       ├── Font_Seen240
│       │   ├── 000_xxxx.ttf
│       │   ├── 001_xxxx.ttf
│       │   ├── ...
│       │   └── 239_xxxx.ttf
│       └── Font_Unseen60
├── charset
│   ├── check_overlap.py
│   ├── GB2312_CN6763.txt # all characters used in TRAIN and TEST
│   ├── FS16.txt          # Few-Shot characters, should not be included in TESTxxx.txt for fairness (can be included in TRAINxxx.txt) 
│   ├── TEST5646.txt      # all characters used in TEST
│   └── TRAIN800.txt      # all characters used in TRAIN
└── ...

Generate dataset with the full standard Chinese character set (6763 in total) of GB/T 2312 :

sh scripts/01a_gen_date.sh

After that, your file tree should be:

.
├── data
│   ├── fonts
│   └── imgs
│       ├── Seen240_S80F50_FULL
│       │   ├── id_0
│       │   │   ├── 0000.png
│       │   │   ├── 0001.png
│       │   │   ├── ...
│       │   │   └── 6762.png
│       │   ├── id_1
│       │   ├── ...
│       │   └── id_239
│       └── Unseen60_S80F50_FULL
├── charset
└── ...

Get subsets with train, test and fewshot character txts.

sh scripts/01b_copy_subset.sh

After that, your file tree should be:

.
├── data
│   ├── fonts
│   └── imgs
│       ├── Seen240_S80F50_FS16
│       │   ├── id_0
│       │   │   ├── 0000.png
│       │   │   ├── 0001.png
│       │   │   ├── ...
│       │   │   └── 0015.png
│       │   ├── id_1
│       │   ├── ...
│       │   └── id_239
│       ├── Seen240_S80F50_FULL
│       ├── Seen240_S80F50_TEST5646
│       │   ├── id_0
│       │   │   ├── 0000.png
│       │   │   ├── 0001.png
│       │   │   ├── ...
│       │   │   └── 5645.png
│       │   ├── id_1
│       │   ├── ...
│       │   └── id_239
│       ├── Seen240_S80F50_TRAIN800
│       │   ├── id_0
│       │   │   ├── 0000.png
│       │   │   ├── 0001.png
│       │   │   ├── ...
│       │   │   └── 0799.png
│       │   ├── id_1
│       │   ├── ...
│       │   └── id_239
│       ├── Unseen60_S80F50_FS16
│       ├── Unseen60_S80F50_FULL
│       ├── Unseen60_S80F50_TEST5646
│       └── Unseen60_S80F50_TRAIN800
├── charset
└── ...

Train base network
```
# enable PC-WDL with the flag `--wdl` and PC-PKL with the flag `-pkl` 
sh scripts/02a_run_ddp.sh
```
Option: In order to evaluate the training of the network, we can use the script scripts/option_run_inf_dgfont.sh to inference.

Train CF-Font

Select basis. Basis fonts better contain a standard font, like song.
- Manually. If you want select manually, please put basis ids (one line, seperated with a space) to a txt file, like:
```
0 1 2 3 4 5 6 7 8 9
```
- By clustering.
```
# Content embeddings collection 
sh scripts/03a_get_content_embeddings.sh

# obtain basis ids through clustering
sh scripts/03b_cluster_get_cf_basis.sh
```
Get subsets with basis font ids.
```
sh scripts/03c_copy_basis_subset.sh
```
Get basis weight for content fusion:
```
sh scripts/03d_cal_cf_weights.sh
```

Train CF-Font

# make a folder for CF-Font training, and copy the pretrain model here.
sh scripts/03e_init_cf_env.sh

# train CF-Font
sh scripts/03f_run_ddp_cf.sh

Inference and evaluation:

# Inference (SII with the flag `--ft`)
sh scripts/04a_run_inf_cf.sh

# Evaluation
## get scores for each font
sh scripts/04b_get_scores.sh

## get mean scores (use `-j` to skip the unwanted fonts, like basis fonts)
sh scripts/04c_cal_mean_scores.sh

Citation

@InProceedings{Wang_2023_CVPR,
    author    = {Wang, Chi and Zhou, Min and Ge, Tiezheng and Jiang, Yuning and Bao, Hujun and Xu, Weiwei},
    title     = {CF-Font: Content Fusion for Few-Shot Font Generation},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2023},
    pages     = {1858-1867}
}

Acknowledgements

We would like to thank Alimama (Alibaba Group) and State Key Lab of CAD&CG (Zhejiang University) for their support and advices in our project. Our code is based on DG-Font.

Contact

If you have any questions, please feel free to contact wangchi1995@zju.edu.cn.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
charset		charset
datasets		datasets
eval		eval
functions		functions
models		models
modules		modules
scripts		scripts
src		src
tools		tools
train		train
validation		validation
.gitignore		.gitignore
README.md		README.md
cal_cf_weight.py		cal_cf_weight.py
collect_content_embeddings.py		collect_content_embeddings.py
font2img.py		font2img.py
inf_with_style_ft.py		inf_with_style_ft.py
inf_with_style_ft_cf.py		inf_with_style_ft_cf.py
main.py		main.py
oss_client.py		oss_client.py
requirements.txt		requirements.txt

wangchi95/CF-Font

Folders and files

Latest commit

History

Repository files navigation

CF-Font

Video demos for Style Interpolation

Dependencies

Libarary

DCN

Dataset

How to run

Citation

Acknowledgements

Contact

About

Resources

Stars

Watchers

Forks

Languages