KITE: Keypoint-Conditioned Policies for Semantic Manipulation

[Pointnet2 Architecture for Waypoint-Defined Primitives]

Priya Sundaresan, Suneel Belkhale, Dorsa Sadigh, Jeannette Bohg

[Project] [arXiv]

Description

KITE is a framework for semantic manipulation using keypoints as a mechanism for grounding language instructions in a visual scene, and a library of keypoint-conditioned skills for execution.
This repo provides the code for training a keypoint-conditioned skill policy from point cloud input (point cloud + keypoint --> waypoints)
See our simulated semantic grasping demo for an end-to-end example of KITE and our keypoint training repo to train your own (image + language --> keypoint) model

Installation

Create a conda environment per env.yml (am using torch=1.9.0+cu102)

Usage

We provide an example dataset of 20 demonstrations for opening different drawer cabinets (top/middle/bottom). Given an image, KITE's grounding module outputs a keypoint for the appropriate drawer handle, and we deproject this keypoint onto the 3D point cloud. This annotated point cloud serves as input to a skill policy, which outputs waypoints (gripper position/orientation) for the robot arm to go to in order to grasp and open the cabinet handle.

To train the model:

python train_start.py

To run inference and visualize predictions:

python inference.py

You should see the input/output as follows; the input is a point cloud annotated with the deprojected keypoint, and the output is the heatmap of offsets, with blue representing the skill waypoint (orientation not visualized).

This will visualize the ground truth input point cloud (xyz, color, and a mask for the deprojected keypoint), and the predicted waypoint (position / orientation).

For an example of training a skill parameterized by more than one waypoint, see train_start_end.py

Datasets

Data should be organized in the data/ folder as follows:

data/dset_open/
├── test
└── train

where train is organized as follows:

train
├── 00000.npy
├── 00001.npy
├── 00002.npy
...
├── 00047.npy
├── 00048.npy
└── 00049.npy

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
assets		assets
data		data
data_utils		data_utils
models		models
.gitattributes		.gitattributes
README.md		README.md
env.yml		env.yml
inference.py		inference.py
provider.py		provider.py
train_start.py		train_start.py
train_start_end.py		train_start_end.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

data

data

data_utils

data_utils

models

models

.gitattributes

.gitattributes

README.md

README.md

env.yml

env.yml

inference.py

inference.py

provider.py

provider.py

train_start.py

train_start.py

train_start_end.py

train_start_end.py

Repository files navigation

KITE: Keypoint-Conditioned Policies for Semantic Manipulation

[Pointnet2 Architecture for Waypoint-Defined Primitives]

Description

Table of Contents

Installation

Usage

Datasets

About

Releases

Packages

Languages

priyasundaresan/pointnet2_primitives

Folders and files

Latest commit

History

Repository files navigation

KITE: Keypoint-Conditioned Policies for Semantic Manipulation

[Pointnet2 Architecture for Waypoint-Defined Primitives]

Description

Table of Contents

Installation

Usage

Datasets

About

Resources

Stars

Watchers

Forks

Languages