NLP-2-Assignment-Multimodal-NLP

This is a modified version from the original README.md, please refer to README_ORIGINAL.md to access that file.

The code pipeline has been streamlined and after following the installation below, running main.ipynb should be enough to replicate the results of the paper.

Installation

Here we outline the steps required to run main.ipynb. These steps were used in a machine with Ubuntu 20.04 LTS and a GPU RTX 2060 Super.

Install Anaconda in the machine.
Create a virtual environment with Python 3.7.5 using conda. E.g. Run in the terminal conda create --name "nlp2-multimodal-R-B python=3.7.5".
Activate environment with conda activate nlp2-multimodal-R-B
Clone the repository in the desired filepath. git clone https://github.com/Noixas/Multimodal-NLP.git
To access the data, register in the Hateful Memes challenge
- Download the data and extract the zip file in the folder 'dataset'.
Folder structure should look as follows:

.
├── Multimodal-NLP/
│       ├── dataset
│       ├── img/
│       ├── own_features/
│       ├── train.jsonl
│       ├── dev_seen.jsonl
│       ├── dev_unseen.jsonl
│       ├── test_seen.jsonl
│       ├── test_unseen.jsonl

In the terminal go to the path where the repository was cloned. E.g. /home/username/Documents/Multimodal-NLP/
Run jupyter notebook in the terminal to start a session and open the jupyter tree, it will show all the files in the current folder.
Click on main.ipynb and run all the cells in the notebook, it will install the python libraries that are required and download the rest of the data that is needed.
- If you face problems running the installation part in the notebook, try using the commands directly on the terminal or leave an issue in the repository.

To train the model from the root folder of the repository, run the following in the terminal:

python -u train_uniter.py --config config/uniter-base.json --data_path ./dataset --model_path ./model_checkpoints --pretrained_model_file uniter-base.pt --feature_path ./dataset/own_features --lr 3e-5 --scheduler warmup_cosine --warmup_steps 500 --max_epoch 30 --batch_size 16 --patience 5 --gradient_accumulation 2 --model_save_name meme.pt --seed 43

Name		Name	Last commit message	Last commit date
Latest commit History 103 Commits
TRASH_DELETE		TRASH_DELETE
apex		apex
config		config
data		data
dataset		dataset
figs		figs
model		model
model_checkpoints		model_checkpoints
prediction_analysis		prediction_analysis
utils		utils
vis_checkpoints		vis_checkpoints
wandb		wandb
.gitignore		.gitignore
Multimodal_Hateful_Memes_Poster_Rodrigo_Blazej.pdf		Multimodal_Hateful_Memes_Poster_Rodrigo_Blazej.pdf
README.md		README.md
README_original.md		README_original.md
Report_for_Multimodal_NLP2_project.pdf		Report_for_Multimodal_NLP2_project.pdf
main.ipynb		main.ipynb
paper.pdf		paper.pdf
requirements.txt		requirements.txt
train_uniter.py		train_uniter.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP-2-Assignment-Multimodal-NLP

Installation

About

Releases

Packages

Contributors 2

Languages

Noixas/Multimodal-NLP

Folders and files

Latest commit

History

Repository files navigation

NLP-2-Assignment-Multimodal-NLP

Installation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages