Panda Control RL Suite

Description

The goal of this project is to handle every task in the panda-gym library. This repo can also be used as a starting point for further research without having to set-up the project from scratch.

The simulation is based on PyBullet physics engine wrapped in Gymnasium.
Environment created using: panda-gym.
Baselines obtained with Stablebaselines3.

How to run

First, install dependencies (python==3.8 required)

# clone project   
git clone https://github.com/lambdavi/panda-control-rl.git

# install requirements   
cd panda-control-rl 
pip install -r requirements.txt

Available scripts

# run training script (example: training on PickAndPlace-v3 task)   
python train.py --env_id PandaPickAndPlace-v3 --algo ddpg

# run hyperparameters tuning (example: on PandaReach-v3 with SAC) 
python tuning.py --env_id PandaReach-v3 --algo sac

# eval your agent
python eval.py --env_id PandaReach-v3 --algo ddpg --path models/PandaReach_DDPG_50000_steps.zip

# just visualize the environment (random actions)
python visualize.py --env_id PandaReach-v3

Arguments for train:

--lr: Learning rate
- Type: float
- Default: 0.001
--gamma: Gamma value
- Type: float
- Default: 0.99
--buffer_size: Buffer size
- Type: int
- Default: 1,000,000
--batch_size: Batch size
- Type: int
- Default: 100
--tau: Tau value
- Type: float
- Default: 0.005
--learning_starts: When to start the learning
- Type: int
- Default: 100
--steps: Number of steps
- Type: int
- Default: 50,000
--env_id: Environment ID
- Type: str
- Default: "PandaReach-v3"
- Choices: ["PandaReach-v3", "PandaReachDense-v3", "PandaPickAndPlace-v3", "PandaPickAndPlaceDense-v3"]
--algo: Algorithm to solve the task
- Type: str
- Default: "ddpg"
- Choices: ["ddpg", "sac", "dqn"]

Citation

@article{gallouedec2021pandagym,
    title        = {{panda-gym: Open-Source Goal-Conditioned Environments for Robotic Learning}},
    author       = {Gallou{\'e}dec, Quentin and Cazin, Nicolas and Dellandr{\'e}a, Emmanuel and Chen, Liming},
    year         = 2021,
    journal      = {4th Robot Learning Workshop: Self-Supervised and Lifelong Learning at NeurIPS},
    }

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
media		media
models		models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
eval.py		eval.py
requirements.txt		requirements.txt
train.py		train.py
tuning.py		tuning.py
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Panda Control RL Suite

Description

How to run

Available scripts

Arguments for train:

Citation

About

Releases

Packages

Languages

License

lambdavi/panda-control-rl

Folders and files

Latest commit

History

Repository files navigation

Panda Control RL Suite

Description

How to run

Available scripts

Arguments for train:

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages