[rllib][tune] How to save and later use the agent/model #7983

Rockyyost · 2020-04-11T16:41:19Z

What is your question?

I've successfully used Tune to train an RL model, with checkpoints. I'd like to be able to 'save' the model so that I can use it later for inferencing. As an example, I will use an API to bring in data that is affectively my observations and I'd like to get from the saved model the actions.

I've been reading through the documentation and I think I'm starting to build the intuition on how to do this, but, I feel like I'm not quite their yet.

Any help will be greatly appreciated!

Carlz182 · 2020-04-15T08:51:56Z

What I am using at the moment is a combination of the tune.run call and a customizable trainer function that specifies how often checkpoints are saved, implements a training curriculum and so on.

def train_ppo(config, reporter):
    agent = PPOTrainer(config)
    agent.restore("/path/checkpoint_41/checkpoint-41") #continue training
    #training curriculum, start with phase 0
    phase = 0
    agent.workers.foreach_worker(
            lambda ev: ev.foreach_env(
                lambda env: env.set_phase(phase)))
    episodes = 0
    i = 0
    while True:
        result = agent.train()
        if reporter is None:
            continue
        else:
            reporter(**result)
        if i % 10 == 0: #save every 10th training iteration
            checkpoint_path = agent.save()
            print(checkpoint_path)
        i+=1
        #you can also change the curriculum here

You can use this function either directly by calling train_ppo(config,None) or inside a tune call:

    trainingSteps = 1000000
    trials = tune.run(
        train_ppo,
        config = config,
        resources_per_trial={
            "cpu": 7,
            "gpu": 1,
            "extra_cpu": 0,
        },
        stop={
            "training_iteration": trainingSteps,
        },
        return_trials=True)

Note that when using tune you will end up with 2 folders in your directory. One generated by tune and the other by the trainer function. The latter contains your checkpoints. You can also wrap the training configuration in a Trainer class which implements a train, setup and save function but I have not tried that successfully.

For inference you generate the agent again and load the checkpoints:

        self.agent = PPOTrainer(ppo_config)
        self.agent.restore(checkpoint_path)

Then you can get the action using the pre-trained model like this: agent.compute_action(observation)

Often you don't want to use the training environment for inference because you want to avoid a physics simulation. The easiest ways would be to generate a fake environment with the same observation and action space and provide this to your agent via the config file. ppo_config['env'] = FakeEnv. Then use the real world data as observations.

Rockyyost · 2020-04-20T11:26:40Z

Awesome! This advice worked out great for me.

Thank you!

stefanbschneider · 2020-06-23T14:07:34Z

Any way to change the directory to which the agent is saved when calling .save()?

Update: Ok, got it, just pass the path as arg. The path seems to be relative to the experiment directory, which contains some hash code. Any idea how to get the absolute path?

stefanbschneider · 2020-08-05T14:22:55Z

Is it possible to restore an agent and continue training it with tune.run() without any custom train_ppo function?

I'm looking for something like this (which doesn't work):

# restore the agent outside of tune.run()
agent = PPOTrainer(config)
agent.restore("/path/checkpoint_41/checkpoint-41")
# then continue training it; this breaks
tune.run(agent, config)

Catypad · 2020-08-20T08:16:34Z

Hello @stefanbschneider! Can you explain how to change the directory where the agent is saved when you define the agent and then use agent.train() to train. Thank you!

stefanbschneider · 2020-09-08T11:56:02Z

By setting the local_dir argument in tune.run(): #9123 (comment)

But I think I already resolved my question regarding restoring a trained agent. Thanks anyways!

Rockyyost added the question Just a question :) label Apr 11, 2020

Rockyyost closed this as completed Apr 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rllib][tune] How to save and later use the agent/model #7983

[rllib][tune] How to save and later use the agent/model #7983

Rockyyost commented Apr 11, 2020

Carlz182 commented Apr 15, 2020

Rockyyost commented Apr 20, 2020

stefanbschneider commented Jun 23, 2020 •

edited

Loading

stefanbschneider commented Aug 5, 2020

Catypad commented Aug 20, 2020 •

edited

Loading

stefanbschneider commented Sep 8, 2020

[rllib][tune] How to save and later use the agent/model #7983

[rllib][tune] How to save and later use the agent/model #7983

Comments

Rockyyost commented Apr 11, 2020

What is your question?

Carlz182 commented Apr 15, 2020

Rockyyost commented Apr 20, 2020

stefanbschneider commented Jun 23, 2020 • edited Loading

stefanbschneider commented Aug 5, 2020

Catypad commented Aug 20, 2020 • edited Loading

stefanbschneider commented Sep 8, 2020

stefanbschneider commented Jun 23, 2020 •

edited

Loading

Catypad commented Aug 20, 2020 •

edited

Loading