wangsd01

Results 1 comments of wangsd01

This part is to add divergence of predicted trajectory and sampled trajectory as additional cost. i.e. (Kx + k - u).T * inverse_policy_variance_matrix * (Kx+k -u) u is sampled action...