Project for Introduction to Auditory-visual Information System -- audio and video matching under noise interference

records

report: https://www.overleaf.com/3393918221kzwhkjdscmfh

record: https://1drv.ms/x/s!Agcs68x5s4XGhngeyctknqAYWlr5?e=swsuEa

usage

data preparation

Extract the .zip file from , and move the feature file under <extractor_name>, like:

Train
├── afeat
│   └── vggish-quant
│   └── <other extractor name>
├── audio
├── vfeat
│   └── <other extractor name>
└── video

Then use tools/generate_random_dataset.py to split the Train dataset into Train_Part(90%) and Dev(10%). Notice you should modify your absolute path in this script.

Also, for recording the result with wandb, you should create one log folder, like below.

After split, the directory tree should like this:

log
data
└── Dataset
    ├── Dev
    │   ├── afeat
    │   │   └── vggish-quant
    │   └── vfeat
    │       └── resnet-101
    ├── Test
    │   ├── Clean
    │   │   ├── afeat
    │   │   │   └── vggish-quant
    │   │   ├── audio
    │   │   ├── vfeat
    │   │   │   └── resnet-101
    │   │   └── video
    │   └── Noise
    │       ├── afeat
    │       │   └── vggish-quant
    │       ├── audio
    │       ├── vfeat
    │       │   └── resnet-101
    │       └── video
    ├── Train
    │   ├── afeat
    │   │   └── vggish-quant
    │   ├── audio
    │   ├── vfeat
    │   │   └── resnet-101
    │   └── video
    └── Train_Part
        ├── afeat
        │   └── vggish-quant
        └── vfeat
            └── resnet-101

matchnet training

Modify the configs/train_config.yaml , and then use python train.py

matchnet training

Modify the configs/test_config.yaml , and then use python test.py

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.vscode		.vscode
Task1&2		Task1&2
configs		configs
models		models
test		test
tools		tools
.gitattributes		.gitattributes
.gitignore		.gitignore
DOCS.md		DOCS.md
README-zh_cn.md		README-zh_cn.md
README.md		README.md
dataset.py		dataset.py
dev_class.npy		dev_class.npy
docs.pdf		docs.pdf
extract_audio.py		extract_audio.py
extract_video.py		extract_video.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py
train_class.npy		train_class.npy
train_class.py		train_class.py
train_part_class.npy		train_part_class.npy
vfeat_pca.py		vfeat_pca.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project for Introduction to Auditory-visual Information System -- audio and video matching under noise interference

records

usage

data preparation

matchnet training

matchnet training

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Project for Introduction to Auditory-visual Information System -- audio and video matching under noise interference

records

usage

data preparation

matchnet training

matchnet training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages