Language-driven Grasp Detection

This is the repository of the paper "Language-driven Grasp Detection."

Installation

Create a virtual environment

$ conda create -n lgd python=3.9
$ conda activate lgd

Install pytorch

$ conda install pytorch==1.12.1 torchvision==0.13.1 torchaudio==0.12.1 cudatoolkit=11.3 -c pytorch
$ pip install -r requirements.txt

Datasets

Our dataset can be accessed via this link.

Training

We use GR-ConvNet as our default deep network. To train GR-ConvNet on different datasets, you can use the following command:

$ python -m torch.distributed.launch --nproc_per_node=<num_gpus> --use_env -m train_network_diffusion --dataset grasp-anywhere --dataset-path data/grasp-anything++/ --add-file-path data/grasp-anything++/seen --description training_grasp_anything++_lgd --use-depth 0 --seen 1 --network lgd --epochs 1000

Furthermore, if you want to train linguistic version of other networks, use the following command:

$ python train_network.py --dataset grasp-anywhere --dataset-path data/grasp-anything/ --add-file-path data/grasp-anything++/seen --description <description> --use-depth 0 --seen 1 --network <network_name>

We also provide training for other baselines, you can use the following command:

$ python train_network.py --dataset <dataset> --dataset-path <dataset> --description <your_description> --use-depth 0 --network <baseline_name>

For instance, if you want to train GG-CNN on Grasp-Anything++, use the following command:

$ python train_network.py --dataset grasp-anywhere --dataset-path data/grasp-anything/ --add-file-path data/grasp-anything++/seen --description training_grasp_anything++_lggcnn --use-depth 0 --seen 1 --network lggcnn

Testing

For testing procedure, we can apply the similar commands to test different baselines:

$ python -m torch.distributed.launch --nproc_per_node=1 --use_env -m evaluate_diffusion --dataset grasp-anywhere --dataset-path data/grasp-anything++/ --add-file-path data/grasp-anything++/seen  --iou-eval --seen 1 --use-depth 0 --network <path_to_pretrained_network>

or

$ python evaluate.py --dataset grasp-anywhere --dataset-path data/grasp-anything --add-file-path data/grasp-anything++/seen --iou-eval --seen 0 --use-depth 0 --network <path_to_pretrained_network>

Important note: <path_to_pretrained_network> is the path to the pretrained model obtained by training procedure. Usually, the pretrained models obtained by training are stored at logs/<timstamp>_<training_description>. You can select the desired pretrained model to evaluate. We do not have to specify neural architecture as the codebase will automatically detect the neural architecture. Pretrained weights are available at this link.

Acknowledgement

Our codebase is developed based on Vuong et al.. If you find our codebase useful, please consider citing:

   @InProceedings{Vuong_2024_CVPR,
    author    = {Vuong, An Dinh and Vu, Minh Nhat and Huang, Baoru and Nguyen, Nghia and Le, Hieu and Vo, Thieu and Nguyen, Anh},
    title     = {Language-driven Grasp Detection},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2024},
    pages     = {17902-17912}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
diffusion		diffusion
hardware		hardware
inference		inference
models/grasp_det_seg		models/grasp_det_seg
script		script
split		split
trained-models		trained-models
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
_config.yml		_config.yml
cleanup.sh		cleanup.sh
evaluate.py		evaluate.py
evaluate_diffusion.py		evaluate_diffusion.py
read_data.py		read_data.py
requirements.txt		requirements.txt
run_calibration.py		run_calibration.py
run_grasp_generator.py		run_grasp_generator.py
run_offline.py		run_offline.py
run_realtime.py		run_realtime.py
run_robotic_exp.py		run_robotic_exp.py
train_network.py		train_network.py
train_network_diffusion.py		train_network_diffusion.py
train_network_grasp_det_seg.py		train_network_grasp_det_seg.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Language-driven Grasp Detection

Table of contents

Installation

Datasets

Training

Testing

Acknowledgement

About

Releases

Packages

Languages

License

andvg3/LGD

Folders and files

Latest commit

History

Repository files navigation

Language-driven Grasp Detection

Table of contents

Installation

Datasets

Training

Testing

Acknowledgement

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages