Skip to content
Please note that GitHub no longer supports Internet Explorer.

We recommend upgrading to the latest Microsoft Edge, Google Chrome, or Firefox.

Learn more
Branch: master
Find file History
Alex Irpan Copybara-Service
Alex Irpan and Copybara-Service Support deferred decoding of images. Modify meta_learning reshaping t…
…o support dynamic shapes. Add random subsampling util.

PiperOrigin-RevId: 291050120
Latest commit 3ab8aa5 Jan 22, 2020

README.md

QT-Opt

This directory contains network architecture definitions for the Grasping critic architecture described in QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation.

Running the code

The following command trains the QT-Opt critic architecture for a few gradient steps with mock data (real data is not included in this repo). The learning obective resembles supervised learning, since Bellman targets in QT-Opt are computed in a separate process (not open-sourced).

git clone https://github.com/google/tensor2robot
# Optional: Create a virtualenv
python3 -m venv ~/venv
source ~/venv/bin/activate
pip install -r tensor2robot/requirements.txt
python -m tensor2robot.research.qtopt.t2r_models_test
You can’t perform that action at this time.