MediaEval 2023- NewsImages

This repository contains an implementation of the NewsImages task of MediaEval 2023, using CLIP (Contrastive Language-Image Pretraining). It aims to match news articles with the most appropriate header image.

The Mean Reciprocal Rank (MRR) is used to evaluate the rank of of each predicted image against the ground truth.

Paper

The paper describing the proposed method can be found here:

Optimizing Visual Pairings: A CLIP Framework for Precision News Image Rematching

Model

CLIP learns a shared embedding space for images and their corresponding textual descriptions, fostering a unified understanding of multimodal relationships. The contrastive learning framework enhances the model's robustness by maximizing the similarity of correct image-text pairs and minimizing the similarity of incorrect pairs. CLIP's capability for zero-shot learning helps in scenarios with diverse datasets, such as news articles with varying topics.

The model is composed of three main sections-the image encoder, text encoder, and a module for the projection of the embeddings.

Results

The results across datasets are summarized in the table below:

Metric	Baseline	GDELT-P1	GDELT-P2
matchIn100	100/1500	796/1500	886/1500
MeanReciprocalR100	0.00346	0.07839	0.09134
MeanReciprocalAt5	0.00333	0.10933	0.12600
MeanReciprocalAt10	0.00667	0.16867	0.20257
MeanReciprocalAt50	0.03333	0.40600	0.45533
MeanReciprocalAt100	0.06667	0.53067	0.59067

On the GDELT-P1 dataset, encompassing standard articles and their corresponding images, the model demonstrated a Mean Reciprocal Rank of 0.07839. In practical terms, this signifies that, on average, the model identifies the first correct match around the 13th position in the list of predictions for each query. Notably, its performance excelled on the GDELT-P2 dataset, primarily comprising images generated by machine-learning models. Here, the model identified the first correct match at around the 10th position.

Examples

The code returns the top 100 images for a given news heading, in order of decreasing relevance.

For example, the statement:

"Oxford researcher to watch Barbie as 'dessert' to Oppenheimer amid dual premiere"

yields the following results:

Authors

Citation

If you find our work useful in your research, please include the following citation:

@article{premnath2023optimizing,
  title={Optimizing Visual Pairings: A CLIP Framework for Precision News Image Rematching},
  author={Premnath, Pooja and Yenumulapalli, Venkatasai Ojus and Sivanaiah, Rajalakshmi and Suseelan, Angel Deborah},
  year={2023}
}

Acknowledgements

A. Lommatzsch, B. Kille, Ö. Özgöbek, M. Elahi, D.-T. Dang-Nguyen, News images in mediaeval 2023, in: Proceedings of the MediaEval Benchmarking Initiative 2023, CEU Workshop Proceedings,2024
M. M. Shariatnia, Simple CLIP, 2021. doi:10.5281/zenodo.6845731.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.ipynb_checkpoints		.ipynb_checkpoints
GDELT-Dataset-2023-Part2		GDELT-Dataset-2023-Part2
GDELT-Dataset		GDELT-Dataset
RT_Images_Test		RT_Images_Test
Testing Dataset		Testing Dataset
Training Dataset		Training Dataset
CLIP Implementation.ipynb		CLIP Implementation.ipynb
CSV_to_JSON.py		CSV_to_JSON.py
MediaEval Data Preprocessing.ipynb		MediaEval Data Preprocessing.ipynb
README.md		README.md
Text_to_CSV.py		Text_to_CSV.py
clip_corrected.jpg		clip_corrected.jpg
sample_output.png		sample_output.png
training_json.json		training_json.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MediaEval 2023- NewsImages

Paper

Model

Results

Examples

Authors

Citation

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

pooja-premnath/MediaEval-2023-NewsImages

Folders and files

Latest commit

History

Repository files navigation

MediaEval 2023- NewsImages

Paper

Model

Results

Examples

Authors

Citation

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages