Skip to content

techmn/rfop

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 

Repository files navigation

Rethinking Fusion and Orthogonal Projection for Face-Voice Association (FAME 2026)

The paper is available arxiv

Overview

Architecture

image

Installation

Please follow the instructions here to make the environment and install the libraries.

Training

Use following command to train the model

python main.py --save_dir ./model --lr 2e-5 --batch_size 1024 --max_num_epoch 50 --dim_embed 256 \
--train_path_face <path_to_train_face_features> \
--train_path_voice <path_to_train_voice_features> \
--test_path_face <path_to_test_face_features> \
--test_path_voice <path_to_test_voice_features>

Testing

Use following command to test the trained model

python test.py --ckpt <path to checkpoint.pth.tar> --dim_embed 256 

Download model weights from here

Acknowledgements

The codebase is inspired from the FOP repository. We thank them for releasing their valuable codebase.

Citation

About

Rethinking Fusion and Orthogonal Projection for Face-Voice Association

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published