Custom model for Soft-Actor-Critic in [rllib] #13218

sahikagenc · 2021-01-05T23:59:12Z

Is it possible to provide a custom model to SAC from a configuration file such as the case for model parameter as follows:

                    # Model options for the Q network(s).
                    "Q_model": {
                        "model": "SoftQ_V2_model_sac",
                        #"fcnet_activation": "relu",
                        #"fcnet_hiddens": [256, 256],
                    },
                    # Model options for the policy function.
                    "policy_model": {
                        "model": "Policy_V2_model_sac",
                        #"fcnet_activation": "relu",
                        #"fcnet_hiddens": [256, 256],
                    },

The text was updated successfully, but these errors were encountered:

sven1977 · 2021-01-13T10:56:29Z

Great question, @sahikagenc . Let me try to make this work with the existing model building APIs. ...

sven1977 · 2021-01-13T11:06:37Z

Btw, did you try simply sub-classing SACTF|TorchModel and then implement your own get_q_value, get_policy_output, etc.. logics?

sven1977 · 2021-01-13T11:49:58Z

There is also a bug in SAC, which makes it not learn the "state-preprocessor" (e.g. when you use a CNN in front of the policy- and Q-nets). The problem is in SAC's compute_and_clip_gradients (tf) and optimizer_fn (torch), where the optimizers are told to only optimize the policy and Q-nets, but never the pre-network.
Fixing this now ...

sven1977 · 2021-02-10T14:00:16Z

This is now fixed (via this PR: #13522). You can provide custom models to SAC via the following options:

custom Q-model:

config:
    Q_model:
        custom_model: [your registered custom Q-model class]

custom policy-model:
config:
policy_model:
custom_model: [your registered custom p-model class]
custom SAC model (as a whole):

sub-class SACTF|TorchModel
override the new build_policy_model and build_q_model methods in there to return whatever custom model(s) you want.

Closing this issue now.

sahikagenc added the enhancement Request for new feature and/or capability label Jan 5, 2021

sven1977 added P2 Important issue, but not time-critical rllib labels Jan 13, 2021

sven1977 self-assigned this Jan 13, 2021

sven1977 closed this as completed Feb 10, 2021

gresavage mentioned this issue Jan 19, 2023

[RLlib] AlgorithmConfig() defaults not used by build_sac_model when implementing custom model #31783

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Custom model for Soft-Actor-Critic in [rllib] #13218

Custom model for Soft-Actor-Critic in [rllib] #13218

sahikagenc commented Jan 5, 2021

sven1977 commented Jan 13, 2021

sven1977 commented Jan 13, 2021

sven1977 commented Jan 13, 2021

sven1977 commented Feb 10, 2021

Custom model for Soft-Actor-Critic in [rllib] #13218

Custom model for Soft-Actor-Critic in [rllib] #13218

Comments

sahikagenc commented Jan 5, 2021

sven1977 commented Jan 13, 2021

sven1977 commented Jan 13, 2021

sven1977 commented Jan 13, 2021

sven1977 commented Feb 10, 2021