MMAdaptionPromptV2

an attempt to implement multi-modal llama-adapter-v2 that compatible with more other models.

Notes

use this peft fork which implements prompt-adaption-v2 that supports multi-modal fine-tuning.
currently only trained using clip-large + llama-7b on 1xA100-40G.
one should download zip files of COCO and put into datasets/coco before trying this project.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
datasets		datasets
README.md		README.md
clip.py		clip.py
coco_dataset.py		coco_dataset.py
data.py		data.py
inference.py		inference.py
train.py		train.py