rl-collision-avoidance icon indicating copy to clipboard operation
rl-collision-avoidance copied to clipboard

Query on stage1_1.pth

Open weihan08 opened this issue 5 years ago • 1 comments

Hello,

Could you please explain the difference between loading the policy of stage1_1.pth and no policy at all? Is the stage1_1.pth some form of supervised learning?

weihan08 avatar Jan 25 '21 05:01 weihan08

Hi, the model in the policy fold is what I have trained. If you want to train from the scratch, you may run the following to trian the stage 1:

rosrun stage_ros_add_pose_and_crash stageros -g worlds/stage1.world
mpiexec -np 24 python ppo_stage1.py

then modify this line to load the policy of stage 1 and train stage2 afterwards:

rosrun stage_ros_add_pose_and_crash stageros -g worlds/stage2.world
mpiexec -np 44 python ppo_stage2.py

Acmece avatar Jan 25 '21 07:01 Acmece