Skip to content
master
Switch branches/tags
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 
 
 

Scaling Robot Supervision to Hundreds of Hours with RoboTurk: Robotic Manipulation Dataset through Human Reasoning and Dexterity

For more information, see the RoboTurk Project from Stanford University

Introduction

This library provides scripts to parse the HDF5 file and aligned with the videos in the video directory. We also provide scripts to process and split the dataset for video prediction SV2P along with the necessary changes needed in tensor2tensor to match the hyperparameters in the original paper.

Citation

If you find our work useful in your research, please consider citing:

@misc{m2019scaling,
    title={Scaling Robot Supervision to Hundreds of Hours with RoboTurk: Robotic Manipulation Dataset through Human Reasoning and Dexterity},
    author={Ajay Mandlekar and Jonathan Booher and Max Spero and Albert Tung and Anchit Gupta and Yuke Zhu and Animesh Garg and Silvio Savarese and Li Fei-Fei},
    year={2019},
    eprint={1911.04052},
    archivePrefix={arXiv},
    primaryClass={cs.RO}
}

Installation

Tensorflow and Tensor2Tensor. It is required that you have access to GPUs to train. We will also provide a requirements.txt file for installing a python virtual environment with the required dependencies.

The code is tested with Python 3.6.

Parsing the Dataset and Training Video Prediction

We have outlined a more detailed wiki page with associated code blocks here that can help you parse the dataset and documents the video prediction pipeline: https://github.com/StanfordVL/roboturk_real_dataset/wiki

Video Prediction Metrics

We show that video prediction metrics of our reproduced SV2P results on BAIR, SawyerLaundryLayout and SawyerTowerCreation. The orange line indicates the number of frames that were predicted on during training, matching the hyperparameters of the original SV2P paper. We have also included the raw values of the video prediction dataset under the results folder.

License

This dataset and codebase are released under the MIT License.

Acknowledgements

Thanks to Suraj Nair for assistance with the SV2P Tensor2Tensor code setup.

Change Log

About

No description, website, or topics provided.

Resources

License

Releases

No releases published

Packages

No packages published