Understanding Visual Memes: an Empirical Analysis of Text Superimposed on Memes Shared on Twitter

This repository contains code and instruction to reproduce the IWT meme classifier pipeline presented in the paper Understanding Visual Memes: an Empirical Analysis of Text Superimposed on Memes Shared on Twitter (ICWSM 2020). For those who want analyze IWT memes and use our code, please cite the paper! Thank you.


@article{du_understanding_2020,
	title = {Understanding {Visual} {Memes}: {An} {Empirical} {Analysis} of {Text} {Superimposed} on {Memes} {Shared} on {Twitter}},
	volume = {14},
	copyright = {Copyright (c) 2020 Association for the Advancement of Artificial Intelligence},
	issn = {2334-0770},
	shorttitle = {Understanding {Visual} {Memes}},
	url = {https://www.aaai.org/ojs/index.php/ICWSM/article/view/7287},
	language = {en},
	urldate = {2020-06-11},
	journal = {Proceedings of the International AAAI Conference on Web and Social Media},
	author = {Du, Yuhao and Masood, Muhammad Aamir and Joseph, Kenneth},
	month = may,
	year = {2020},
	pages = {153--164},
}

Content

Introduction to two-step meme classification pipeline
Step 1
Step 2

Introduction to two-step meme classification pipeline

Given a dataset of images shared on social media, we develop a two-step pipeline to identify IWT memes.

Step 1 is to use the Optical Character Recognition (OCR) engine Tesseract [1] to filter out all images that do not contain text.
Step 2 is to use a supervised multimodal neural classifier to distinguish, amongst the remaining images, those that are IWT memes from those that are not.

The overall structure of our multimodal neural classifier is shown as below . First, input images are fed into pretrained neural networks to extract visual feature and textual features. We then concatenate these two feature vectors into a single mutlimodal feature representation and use a final neural network to perform classification.

Step 1

After cloning the repo, run following command from this directory to conduct the step1 of the pipeline.

sh install.sh
cd step1_filter
python filter.py --input_dir=[image_directory]

The scripts will first pull down the GloVe and to install a few packages
After that, running filter.py will forward input images to Tesseract to filter out images without any superimposed texts. The remaining images are moved into '../data/Image_with_Text' directory. Besides that, texts extracted by Tesseract are preprocessed by SpaCy and stored at '../data/name_text.pkl' using dictionary form. The keys of the dictionary are the names of images and values are preprocessed texts that are extracted from them.

Step 2

After finishing the step1, download the Pretrained Model at Link and put the pretrained model at '../data' directory. Then run following commands from the current directory to conduct the step2 of the pipeline.

cd ../step2_MemeClassifier
python classification.py

The scipts will forward extracted images to the multimodal neural IWT meme classification network. The identified IWT memes will be put in the '../data/IWTmeme' directory and the resutls of images will be put in the '../data/nonIWTmeme' directory.

Train your own model

In order to train your IWT meme classifier, you have to first prepare the dataset which contains IWT memes and non IWT images and the binary labels for the dataset. Label has to be stored in a dictionary pickle file whose keys are image names and values are labels (0/1). After these steps, run following command.

python train.py --input_dir=[meme_directory] --dict_dir=[meme_text dictionary pickle file directory] --dict_label=[meme_label dictionary pickle file directory] --output_dir=[model_directory]

Trained model will be placed in the output directory that you specified.

Contact

Feel free to contact [yuhaodu@buffalo.edu] if you encounter any problem.

Acknowledgements

This work was supported by a Google Cloud Platform Research Credit grant. We also gratefully acknowledge the support of NVIDIA Corporation with the donation of the GPU used for this research.

Reference

[1] Smith, R., Antonova, D., & Lee, D. (2009). Adapting the Tesseract open source OCR engine for multilingual OCR. MOCR '09.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

image

image

step1_filter

step1_filter

step2_MemeClassifier

step2_MemeClassifier

README.md

README.md

install.sh

install.sh

Repository files navigation

Understanding Visual Memes: an Empirical Analysis of Text Superimposed on Memes Shared on Twitter

Content

Introduction to two-step meme classification pipeline

Step 1

Step 2

Train your own model

Contact

Acknowledgements

Reference

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
image		image
step1_filter		step1_filter
step2_MemeClassifier		step2_MemeClassifier
README.md		README.md
install.sh		install.sh

yuhaodu/TwitterMeme

Folders and files

Latest commit

History

Repository files navigation

Understanding Visual Memes: an Empirical Analysis of Text Superimposed on Memes Shared on Twitter

Content

Introduction to two-step meme classification pipeline

Step 1

Step 2

Train your own model

Contact

Acknowledgements

Reference

About

Topics

Resources

Stars

Watchers

Forks

Languages