Rethinking Fusion and Orthogonal Projection for Face-Voice Association (FAME 2026)

The paper is available arxiv

Overview

Architecture

Installation

Please follow the instructions here to make the environment and install the libraries.

Training

Use following command to train the model

python main.py --save_dir ./model --lr 2e-5 --batch_size 1024 --max_num_epoch 50 --dim_embed 256 \
--train_path_face <path_to_train_face_features> \
--train_path_voice <path_to_train_voice_features> \
--test_path_face <path_to_test_face_features> \
--test_path_voice <path_to_test_voice_features>

Testing

Use following command to test the trained model

python test.py --ckpt <path to checkpoint.pth.tar> --dim_embed 256

Download model weights from here

Acknowledgements

The codebase is inspired from the FOP repository. We thank them for releasing their valuable codebase.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Rethinking Fusion and Orthogonal Projection for Face-Voice Association (FAME 2026)

Overview

Architecture

Installation

Training

Testing

Acknowledgements

Citation

About

Uh oh!

Releases

Packages

techmn/rfop

Folders and files

Latest commit

History

Repository files navigation

Rethinking Fusion and Orthogonal Projection for Face-Voice Association (FAME 2026)

Overview

Architecture

Installation

Training

Testing

Acknowledgements

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages