Install Requirements

This repo hosts the code for the online and offline reinforcement learning experiments on The Simulated Industrial Manufacturing and Process Control Learning Environments (SMPL).

Install Requirements

$ pip install -r requirements.txt

Online Experiments

Online Training

You simply need to run the online_experiments.sh script. Note that you may want to edit the online_experiments.yaml for different configurations. Or, optionally, you can go into the specific directory of environments (e.g. mabenv_experiments) and execute the online_experiments.sh (which executes online_experiments.py for the online RL algorithms) there for their specific configurations. Moreover, you can edit the configurations in online_experiments.yaml.

Online Inference

After you trained an online RL algorithm, you could do the inference with online_inference.py. You need to set the env_name, model_names, best_checkpoint_paths and config_dirs accordingly such that the correct checkpoint(s) are loaded. You can also set the plot configurations to visualize how the trained algorithm actually performs. For more details, please consult the docstring in online_inference.py and this documentation.

Offline Experiments

Offline Training

You first need to generate a dataset with the baseline algorithm using the script offline_data_generation.py located in {env_name}_experiments. After successfully generated the training, evaluating and testing initial states and datasets, you can then use the offlineRL_training.py to train the offline RL algorithms. Don't forget that you can edit the configurations in offline_experiments.yaml.

Offline Inference

The OFFLINE_BEST.yaml in {env_name}_experiments specifies the location of your current offline RL experiments. For example, if you finished the experiment of Behavior Cloning and you put "d3rlpy_logs/42" in the OFFLINE_BEST.yaml, then you should be able to locate the best checkpoint in d3rlpy_logs/42/BC/best.pt, which is the checkpoint used to perform the inference with offline_inference.py. Again you can set the plot configurations to analyze and visualize the results.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
algo_configs/online_experiments		algo_configs/online_experiments
atropineenv_experiments		atropineenv_experiments
beerfmtenv_experiments		beerfmtenv_experiments
configdata		configdata
mabenv_experiments		mabenv_experiments
pensimenv_experiments		pensimenv_experiments
reactorenv_experiments		reactorenv_experiments
smpl/configdata/mabenv		smpl/configdata/mabenv
.gitignore		.gitignore
README.rst		README.rst
__init__.py		__init__.py
generated_dataset_evaluation.py		generated_dataset_evaluation.py
models.py		models.py
offlineRL_training.py		offlineRL_training.py
offline_data_generation.py		offline_data_generation.py
offline_experiments.yaml		offline_experiments.yaml
offline_inference.py		offline_inference.py
online_experiments.py		online_experiments.py
online_experiments.sh		online_experiments.sh
online_experiments.yaml		online_experiments.yaml
online_inference.py		online_inference.py
online_loading_try.py		online_loading_try.py
project_title.txt		project_title.txt
requirements.txt		requirements.txt

Mohan-Zhang-u/smpl-experiments

Folders and files

Latest commit

History

Repository files navigation

Install Requirements

Online Experiments

Online Training

Online Inference

Offline Experiments

Offline Training

Offline Inference

About

Resources

Stars

Watchers

Forks

Languages