When using experience replay, why don't you update Q_target?

```
# Recompute prediction value and label for replay buffer
if sample_primitive_action == 'push':
    trainer.predicted_value_log[sample_iteration] = [np.max(sample_push_predictions)]
    # trainer.label_value_log[sample_iteration] = [new_sample_label_value]
elif sample_primitive_action == 'grasp':
    trainer.predicted_value_log[sample_iteration] = [np.max(sample_grasp_predictions)]
    # trainer.label_value_log[sample_iteration] = [new_sample_label_value]
```
@andyzeng 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When using experience replay, why don't you update Q_target? #81

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

When using experience replay, why don't you update Q_target? #81

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions