Question about the configuration dictionary for the default high/low rewrd values for each envs #3

HYDesmondLiu · 2023-11-17T18:28:21Z

As inspecting through your codes, I found there is a function cal_return_to_go which requires a config dictionary for the high/low reward values for each env.

What is its purpose and what if in real-world problems we cannot ensure the high/low rewards of the environment?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the configuration dictionary for the default high/low rewrd values for each envs #3

Question about the configuration dictionary for the default high/low rewrd values for each envs #3

HYDesmondLiu commented Nov 17, 2023

Question about the configuration dictionary for the default high/low rewrd values for each envs #3

Question about the configuration dictionary for the default high/low rewrd values for each envs #3

Comments

HYDesmondLiu commented Nov 17, 2023