BVQI (Zero-shot Blind Video Quality Index)

The Official Repository for BVQI, a robust zero-shot Video Quality Index, and its efficient fine-tuned version. Accepted by ICME2023, extended to TIP (under review).

Key Features

Robustly predict quality without training from any MOS scores.
Localized semantic quality prediction.
Given a small set of MOS-labelled videos, can robustly+efficiently fine-tune on it.

May 2023 Updates (in correspondance to TIP-submitted extension):

Visualization for SAQI-Local local quality map.
Efficient Fine-tuning.
Optimization on TPQI (the temporal naturalness index) to improve its speed.

Last Version (in correspondance to ICME conference version):

Extract the three quality indexes with Pytorch.
Combine and Evaluate.

Paper Links

ICME2023: Arxiv

Extension (under review for TIP): Arxiv

Installation

Install OpenCLIP

To make local semantic affinity index available, the OpenCLIP should be installed as follows (or equivalently):

git clone https://github.com/mlfoundations/open_clip.git
cd open_clip
sed -i '92s/return x\[0\]/return x/' src/open_clip/modified_resnet.py 
pip install -e .

Install BVQI

Then you need to install the BVQI codebase.

cd ..
git clone https://github.com/VQAssessment/BVQI.git
cd BVQI
pip install -e .

Usage

zero-shot inference

Extract Semantic Affinity Quality Index (SAQI):

python semantic_affinity.py

New! If you would like to use the local semantic affinity quality index, please add -l after the command, i.e.,

python semantic_affinity.py -l

The results will be improved as follows:

	KoNViD-1k	CVD2014	LIVE-VQC	YouTube-UGC (SA-index-only)
SROCC	0.772 (0.760 for global, +1.6%)	0.746 (0.740 for global, +0.8%)	0.794 (0.784 for global, +1.4%)	0.610 (0.585 for global, +3.8%)
PLCC	0.772 (0.760 for global, +1.6%)	0.768 (0.763 for global, +0.7%)	0.803 (0.794 for global, +1.1%)	0.616 (0.606 for global, +1.4%)

Extract Spatial Naturalness Index:

python spatial_naturalness.py

Extract Temporal Naturalness Index:

python temporal_naturalness.py

Evaluate the Aggregated Results

See combine.ipynb

New: visualize local quality maps by SAQI-Local

See Visualization.ipynb for details.

New: fine-tine with a given set of videos

Fine-tuning without Implicit Prompt:

python prompt_tuning.py

Fine-tuning with Implicit Prompt:

python prompt_tuning.py -i

You can also add -cr to enable cross-dataset validation during fine-tuning.

Citation

The following paper is recommended to be cited in the bibliography if relevant papers are proposed.

@article{wu2023bvqiplus,
  title={Towards Robust Text-Prompted Semantic Criterion for In-the-Wild Video Quality Assessment},
  author={Wu, Haoning and Liao, Liang and Wang, Annan and Chen, Chaofeng and Hou, Jingwen and Sun, Wenxiu and Yan, Qiong and Lin, Weisi},
  journal={Arxiv Preprint 2304.14672},
  year={2023}
}

@article{wu2023bvqi,
  title={Exploring Opinion-Unaware Video Quality Assessment with Semantic Affinity Criterion},
  author={Wu, Haoning and Liao, Liang and Hou, Jingwen and Chen, Chaofeng and Zhang, Erli and Wang, Annan  and Sun, Wenxiu and Yan, Qiong and Lin, Weisi},
  journal={IEEE International Conference on Multimedia and Expo (ICME)},
  year={2023}
}

Note: Possible Performance Drop while Totally Using this Codebase

The Code for Temporal Naturalness Index is slightly different from the original version (with only V1 curvature), therefore we might experience some performance drop. We will try to include the code for LGN curvature computation in the following versions. To solve this, we provided the naturalnesses_matlab_results.pkl to assist you reproduce our results with MatLab-obtained SN and TN indexes.

Here shows performance of the Codebase (Performance of Original Paper with MatLab Code):

	KoNViD-1k	CVD2014	LIVE-VQC
SROCC	0.758 (0.760 in paper)	0.683 (0.740 in paper)	0.772 (0.784 in paper)
PLCC	0.755 (0.760 in paper)	0.708 (0.763 in paper)	0.784 (0.794 in paper)

*With GPU, the acceleration is around 10x compared with original MatLab version. *

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
V1_extraction		V1_extraction
buona_vista		buona_vista
examplar_data_labels		examplar_data_labels
figs		figs
pyiqa		pyiqa
LICENSE		LICENSE
Oral-ICME-766.pdf		Oral-ICME-766.pdf
README.md		README.md
Visualization.ipynb		Visualization.ipynb
buona_vista_sa_index.yml		buona_vista_sa_index.yml
buona_vista_sn_index.yml		buona_vista_sn_index.yml
buona_vista_tn_index.yml		buona_vista_tn_index.yml
combine.ipynb		combine.ipynb
load_features.py		load_features.py
maxwell-sa.yml		maxwell-sa.yml
maxwell-sn.yml		maxwell-sn.yml
naturalnesses_matlab_results.pkl		naturalnesses_matlab_results.pkl
prompt_tuning.py		prompt_tuning.py
semantic_affinity.pkl		semantic_affinity.pkl
semantic_affinity.py		semantic_affinity.py
semantic_affinity_pubs.pkl		semantic_affinity_pubs.pkl
spatial_naturalness.pkl		spatial_naturalness.pkl
spatial_naturalness.py		spatial_naturalness.py
spatial_naturalness_val-cvd2014.pkl		spatial_naturalness_val-cvd2014.pkl
spatial_naturalness_val-kv1k.pkl		spatial_naturalness_val-kv1k.pkl
spatial_naturalness_val-l1080p.pkl		spatial_naturalness_val-l1080p.pkl
spatial_naturalness_val-livevqc.pkl		spatial_naturalness_val-livevqc.pkl
spatial_naturalness_val-ltest.pkl		spatial_naturalness_val-ltest.pkl
spatial_naturalness_val-ytugc.pkl		spatial_naturalness_val-ytugc.pkl
temporal_naturalness.py		temporal_naturalness.py
temporal_naturalness_pubs.pkl		temporal_naturalness_pubs.pkl

License

VQAssessment/BVQI

Folders and files

Latest commit

History

Repository files navigation

BVQI (Zero-shot Blind Video Quality Index)

Key Features

Paper Links

Installation

Install OpenCLIP

Install BVQI

Usage

zero-shot inference

New: visualize local quality maps by SAQI-Local

New: fine-tine with a given set of videos

Citation

Note: Possible Performance Drop while Totally Using this Codebase

About

Topics

Resources

License

Stars

Watchers

Forks

Languages