Train

Code for EFT: Expert Fusion Transformer for Voice-Face Representation Learning,ICME,2023

Requirements

pytorch==1.8.1
wandb==0.12.10

Dataset

Download dataset.zip from GoogleDrive (4GB) and unzip it to the project root. The folder structure is shown below:

dataset/
├── evals
│   ├── test_matching_1N.pkl
│   ├── test_matching_g.pkl
│   ├── test_matching.pkl
│   ├── test_retrieval.pkl
│   ├── test_verification_gn.pkl
│   ├── test_verification_g.pkl
│   ├── test_verification_n.pkl
│   ├── test_verification.pkl
│   └── valid_verification.pkl
├── features
└── info
    ├── name2movies.pkl
    ├── name2tracks.pkl
    └── train_valid_test_names.pkl

The dataset/features folder contains vast number of small files. We suggest placing the project on an SSD disk to prevent IO bottlenecks.

Train

python main.py for the default setting in our paper

python main.py --face_features=f_2plus1D_512 for only use the R2+1D expert

python main.py --bc_mode=sbc_3.0 for change the SBC std threshold to 3.0

use wandb to view the training process:

Create .wb_config.json file in the project root, using the following content:
```
{
  "WB_KEY": "Your wandb auth key"
}
```
add --dryrun=False to the training command, for example: python main.py --dryrun=False

Model Checkpoint

You can get the final model checkpoint in eft_checkpoint.zip.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
loaders		loaders
models		models
utils		utils
.gitignore		.gitignore
Readme.md		Readme.md
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Requirements

Dataset

Train

Model Checkpoint

About

Releases

Packages

Languages

my-yy/eft-icme2023

Folders and files

Latest commit

History

Repository files navigation

Requirements

Dataset

Train

Model Checkpoint

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages