PufferLib icon indicating copy to clipboard operation
PufferLib copied to clipboard

Improve Boids Reward

Open PLAZMAMA opened this issue 10 months ago • 1 comments

Description

This PRs goal is to improve the reward calculation of the boids env and train a policy on it

Todo

  • [x] Improve reward calculations

    • [x] margin factor
    • [x] separation factor
    • [x] cohesion factor
    • [x] alignment factor
  • [x] Train policy on two factors

    • [x] Margin and seperation
    • [x] Cohesion and factor
  • [x] Train policy successfully on all factors

PLAZMAMA avatar Jun 05 '25 07:06 PLAZMAMA

Hi @jsuarez5341, sorry for being out for the last 3 months, I got a new job and stuff.

However, I was able to improve the policy and solve Boids successfully with all of the rewards. All that's left is to do a sweep which I don't have the ability to do right now.

P.S. When I get situated in my new job(couple weeks hopefully) I'll come back to PufferLib and contribute in my free time. I look forward to working on cool stuff together again soon!

PLAZMAMA avatar Oct 01 '25 13:10 PLAZMAMA