VMH (Video Misleading Headline Dataset)

This is a repository that contains VMH dataset and codes for baseline reproduction from EMNLP 2023 paper that can be found here!

Abstract

Polarization and the marketplace for impressions have conspired to make navigating information online difficult for users, and while there has been a significant effort to detect false or misleading text, multimodal datasets have received considerably less attention. To complement existing resources, we present multimodal Video Misleading Headline (VMH), a dataset that consists of videos and whether annotators believe the headline is representative of the video's contents. After collecting and annotating this dataset, we analyze multimodal baselines for detecting misleading headlines. Our annotation process also focuses on why annotators view a video as misleading, allowing us to better understand the interplay of annotators' background and the content of the videos.

Crowdsourcing Framework

The below framework is specifically devised to reduce the subjectivity of the misleading video headline detection task. The annotators are asked to encounter each schemes to finalize their label annotations on the misleadingness of the headline and rationale annotations that demonstrates the reason behind their decisions.

Label Annotation Scheme

Rationale Annotation Scheme

Codebase

The preprocessing code is based on raw data collected from MTurk crowdsourcing platform.

The code for benchmark models (VideoCLIP and VLM) are forked from https://github.com/facebookresearch/fairseq/blob/main/examples/MMPT/README.md Both models are video-text retrieval models, which are added a classification layer to be used a classifier that detects misleading video headlines. Code is available in ./scripts/MultimodalBaselines.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
data		data
scripts		scripts
.DS_Store		.DS_Store
README		README
README.md		README.md
label_diagram_final.pdf		label_diagram_final.pdf
rationale_diagram_final.pdf		rationale_diagram_final.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

scripts

scripts

.DS_Store

.DS_Store

README

README

README.md

README.md

label_diagram_final.pdf

label_diagram_final.pdf

rationale_diagram_final.pdf

rationale_diagram_final.pdf

Repository files navigation

VMH (Video Misleading Headline Dataset)

Abstract

Crowdsourcing Framework

Label Annotation Scheme

Rationale Annotation Scheme

Codebase

About

Releases

Packages

Languages

yysung/VMH

Folders and files

Latest commit

History

Repository files navigation

VMH (Video Misleading Headline Dataset)

Abstract

Crowdsourcing Framework

Label Annotation Scheme

Rationale Annotation Scheme

Codebase

About

Resources

Stars

Watchers

Forks

Languages