Check requirements.txt file for packages
** follow these steps to train and test DSV-LFS.**
1. Download the dataset from the following links.
- PASCAL-5i: PASCAL VOC 2012 + SBD
- COCO-20i: MSCOCO2014
Download SAM ViT-H pre-trained weights from the link.
Download llava-v1.5-7b model from the link.
Download clip-vit-large-patch14-336 model from the link.
deepspeed train.py \
--version="PATH_TO_llava-v1.5-7b" \
--dataset_dir='./dataset' \
--vision_pretrained="PATH_TO_SAM" \
--vision-tower="PATH to clip" \
--benchmark= "pascal" or "coco" \
--fold="0" \
--exp_name="name"\
--shot="1" \
If you find this project useful in your research, please consider citing:
@inproceedings{Karimi2025DSVLFS,
title = {DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation},
author = {Amin Karimi and Charalambos Poullis},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
year = {2025},
}
