KITE: Keypoint-Conditioned Policies for Semantic Manipulation

[Keypoint Training Repo]

Priya Sundaresan, Suneel Belkhale, Dorsa Sadigh, Jeannette Bohg

KITE is a framework for semantic manipulation using keypoints as a mechanism for grounding language instructions in a visual scene, and a library of keypoint-conditioned skills for execution.
This repo provides the code for training an (image, language) --> keypoint model
See our simulated semantic grasping demo for an example of how this model can be used for downstream semantic manipulation

git clone https://github.com/priyasundaresan/kite_keypoint_training.git

cd /path/to/kite_keypoint_training/docker

./docker_build.py

After this step, run docker images to confirm that the image has built. You should see the following:

REPOSITORY            TAG       IMAGE ID       CREATED       SIZE
lang-manip-training   latest    bf3a316e74c5   10 minutes ago   4.14GB

cd /path/to/kite_keypoint_training/docker
./docker_run.py

You should now be inside the Docker container. Run the following to train on the example semantic_grasping_dset dataset:

python train.py

python analysis.py

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
docker		docker
src		src
.gitattributes		.gitattributes
README.md		README.md
analysis.py		analysis.py
config.py		config.py
train.py		train.py