wangsd01
Results
1
comments of
wangsd01
This part is to add divergence of predicted trajectory and sampled trajectory as additional cost. i.e. (Kx + k - u).T * inverse_policy_variance_matrix * (Kx+k -u) u is sampled action...