visual-pushing-grasping icon indicating copy to clipboard operation
visual-pushing-grasping copied to clipboard

When using experience replay, why don't you update Q_target?

Open Zixin-Tang opened this issue 4 years ago • 0 comments

# Recompute prediction value and label for replay buffer
if sample_primitive_action == 'push':
    trainer.predicted_value_log[sample_iteration] = [np.max(sample_push_predictions)]
    # trainer.label_value_log[sample_iteration] = [new_sample_label_value]
elif sample_primitive_action == 'grasp':
    trainer.predicted_value_log[sample_iteration] = [np.max(sample_grasp_predictions)]
    # trainer.label_value_log[sample_iteration] = [new_sample_label_value]

@andyzeng

Zixin-Tang avatar May 17 '21 14:05 Zixin-Tang