Cross-Lingual Cross-Platform Rumor Verification Pivoting on Multimedia Content

Overview

This repository contains the implementation of methods in "Cross-Lingual Cross-Platform Rumor Verification Pivoting on Multimedia Content".

Library Dependencies

Python 2.7
Pytorch
scikit-learn
Theano
Keras (with Theano backend)
Pandas
...

Data

Three sub-datasets of our CCMR dataset are saved in the folder CCMR as three json files (lists of json objects), "CCMR/CCMR_Twitter.txt", "CCMR_Google.txt" and "CCMR_Baidu.txt".

For CCMR Twitter, each tweet is saved as a json object with keys "tweet_id", "content", "image_id", "event", and "timestamp". For CCMR Google and Baidu, each webpage is saved as a json object with keys "url", "title", "image_id", and "event". The values of "image_id" are lists of image or video names from VMU 2015 dataset. All of those image files and video URLs are available in "images.zip".

Procedure

To reproduce experiments results, simply run main.py.
Download parallel English and Mandarin sentence of news and microblogs from UM-Corpus and save them in a folder named 'UM_Corpus'.
Run prepare_UM_Corpus.py to split and tokenize the data in UM-Corpus.
Run train_multilingual_embedding.py to train the multilingual sentence embedding.
Run prepare_FNC_split.py to tokenize, embed and split the data from Fake News Challenge.
Run train_agreement_classifier.py to train the agreement classifier.
Run prepare_CCMR.py to tokenize the CCMR dataset.
Run extract_clcp_feats.py to extract all cross-lingual cross-platform features and splits of the data we need for experiments. CLCP saves the available output file.
Play with main.py and other scripts to test everything from the Paper.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
CCMR		CCMR
CLCP		CLCP
FNC_1		FNC_1
README.md		README.md
agreement_classifier.py		agreement_classifier.py
crosslingual_crossplatform_features.py		crosslingual_crossplatform_features.py
extract_clcp_feats.py		extract_clcp_feats.py
main.py		main.py
models.py		models.py
multilingual_embedding_module.py		multilingual_embedding_module.py
nlp.py		nlp.py
prepare_CCMR.py		prepare_CCMR.py
prepare_FNC_split.py		prepare_FNC_split.py
prepare_UM_Corpus.py		prepare_UM_Corpus.py
proposed_models.py		proposed_models.py
scripts.py		scripts.py
statistics.py		statistics.py
tools.py		tools.py
train_MLP_classifier.py		train_MLP_classifier.py
train_agreement_classifier.py		train_agreement_classifier.py
train_multilingual_embedding.py		train_multilingual_embedding.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cross-Lingual Cross-Platform Rumor Verification Pivoting on Multimedia Content

Overview

Library Dependencies

Data

Procedure

About

Releases

Packages

Languages

WeimingWen/CCRV

Folders and files

Latest commit

History

Repository files navigation

Cross-Lingual Cross-Platform Rumor Verification Pivoting on Multimedia Content

Overview

Library Dependencies

Data

Procedure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages