We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
In train.py, your code is as follows:
for i, transition in enumerate(episode_cache): new_goals = generate_goals(i, episode_cache, args.HER_sample_num) for new_goal in new_goals: reward = calcu_reward(new_goal, state, action) state, action, new_state = gene_new_sas(new_goal, transition) ram.add(state, action, reward, new_state)
But I think it should be like that:
for i, transition in enumerate(episode_cache): new_goals = generate_goals(i, episode_cache, args.HER_sample_num) for new_goal in new_goals: state = transition[0] action = transition[1] reward = calcu_reward(new_goal, state, action) state, action, new_state = gene_new_sas(new_goal, transition) # 一个transition被换成了各种goals ram.add(state, action, reward, new_state)
Otherwise, this algorithm is not convergent. I have tried to train it.
The text was updated successfully, but these errors were encountered:
No branches or pull requests
In train.py, your code is as follows:
PART II hindsight replay
But I think it should be like that:
PART II hindsight replay
Otherwise, this algorithm is not convergent. I have tried to train it.
The text was updated successfully, but these errors were encountered: