UMMT-VSH

Code for the ACL 2023 paper Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination

Step 0. install prerequisites

conda env create -f environments/full.yml
conda activate UMMT-VSH
pip install -e fairseq/
pip install -e taming-transformers/

Step 1. prepare data

MMT data
- Multi30k
NMT data with image source
- WMT14 En→De, En→Fr
- WMT16 1032 En→Ro
- WIT-images

Step 2. preprocess data

Binarize translation data for fairseq
```
bash scripts/multi30k/preproc.sh
```
Download Flickr30K Flickr30K and MS-COCO image, then create symbolic link
```
ln -s /xxx/flickr30k
ln -s /xxx/mscoco
```
Download WIT translation data from with parallel corpora organized for machine translation. The archive also includes tokenized and BPE encoded sentences.
For each translation task, download images in [train|valid|test]_url.txt to corresponding paths provided in [train|valid|test]_img.txt. Image filenames are the MD5 hashes of their URLs.
Binarize translation data for fairseq
```
bash scripts/wit/preproc.sh
```

Step 3. SG parsing for data

parse the SG structures for all images and texts by the tools in SG-parsing/VSG and SG-parsing/LSG.

Step 4. train system

run scripts/multi30k-train.sh script for multi30k
run scripts/wmt-train.sh script for wmt

Step 5. test with system

run scripts/test.sh script

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
SG-parsing		SG-parsing
checkpoints		checkpoints
data		data
env-config		env-config
fairseq		fairseq
models		models
scripts		scripts
utils		utils
README.md		README.md
generate.py		generate.py
options.py		options.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UMMT-VSH

Step 0. install prerequisites

Step 1. prepare data

Step 2. preprocess data

Step 3. SG parsing for data

Step 4. train system

Step 5. test with system

About

Releases

Packages

Languages

scofield7419/UMMT-VSH

Folders and files

Latest commit

History

Repository files navigation

UMMT-VSH

Step 0. install prerequisites

Step 1. prepare data

Step 2. preprocess data

Step 3. SG parsing for data

Step 4. train system

Step 5. test with system

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages