Improve Boids Reward by PLAZMAMA · Pull Request #246 · PufferAI/PufferLib

PLAZMAMA · 2025-06-05T07:02:06Z

Description

This PRs goal is to improve the reward calculation of the boids env and train a policy on it

Todo

PLAZMAMA · 2025-10-01T13:01:03Z

Hi @jsuarez5341, sorry for being out for the last 3 months, I got a new job and stuff.

However, I was able to improve the policy and solve Boids successfully with all of the rewards.
All that's left is to do a sweep which I don't have the ability to do right now.

P.S. When I get situated in my new job(couple weeks hopefully) I'll come back to PufferLib and contribute in my free time.
I look forward to working on cool stuff together again soon!

PLAZMAMA · 2026-03-02T06:24:51Z

@jsuarez5341 I see that this PR has not been look at( Probably because I forgot to leave it as a draft). But I think it will need to be rebased to Puffer 4.0. If you would like me to do that, then let me know, BOIDS is not relevant longer and isn't worth working on anymore please let me know as well.

PLAZMAMA · 2026-04-10T01:29:41Z

Closing PR as I just made a new PR migrating the improvements of this PR to 4.0

PLAZMAMA marked this pull request as draft June 5, 2025 07:58

PLAZMAMA force-pushed the improve_boids branch from 3945f0f to 7a6cfd1 Compare June 12, 2025 01:30

PLAZMAMA force-pushed the improve_boids branch from e0825a6 to 77976e7 Compare July 7, 2025 04:38

PLAZMAMA added 27 commits July 27, 2025 16:32

changed policy and rename factors to match common names

0734d89

remove unused log fields

e94a4aa

remove unused variable

817dc14

remove unused commented code

e232d3e

remove unused boid_logs and fix logs calculation

4eb410a

fix overflow and zero report_interval

1d424d7

add above zero checks for num_boids and report_interval

7376334

remove unused commented flat_actions

ff483f6

simplify seperation reward and test it

26bebef

test out only avoid factor

06878eb

remove unused avg_reward and change seperation factor reward

35de375

fix factor names

9709a46

remove unused commented code

1c28c72

fix seperation factor reward calculation

bf8c75f

remove unused commented params

23e2399

remove normalization from separation factor calculation

bb162fb

fix visual range

85d5891

remove positve margin rewards and remove commented code

435ac9e

add factors to env run with "boids.c"

463f60a

add debug margin lines and adjust reward normalization

00af443

only turn on margin turn factor and adjust total timesteps

fc4e722

change top/bottom margins

a413221

account for boid width and hight in margin reward calculation

342d83c

increase max steps

01a84c0

remove debug margin lines

117e9b6

fix observations for margin factor

618cb0b

remove single agent params

7526366

PLAZMAMA added 11 commits July 27, 2025 16:32

update boids.c observations allocation

2d38d98

update observations and actions comments

971732b

remove commented parameters and update parameters to current best

bcecdd2

fix to "separation_factor" instead of "seperation_factor"

a9f1b98

update preset env parameters

9dc328f

condence controlled boid observation loop

4b339a3

remove use of protected range diff

e56d28f

change reward normalization number

cc8a397

update puffer resource path

0eda889

enable all factors

6158014

add euclidean distance to observations

e67ad51

PLAZMAMA force-pushed the improve_boids branch from 9839e16 to e67ad51 Compare July 27, 2025 20:32

add euclidean distance to local build observations

0bf4d74

PLAZMAMA marked this pull request as ready for review March 2, 2026 06:22

PLAZMAMA mentioned this pull request Apr 10, 2026

Improve boids 4.0 #523

Draft

PLAZMAMA closed this Apr 10, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Boids Reward#246

Improve Boids Reward#246
PLAZMAMA wants to merge 39 commits intoPufferAI:3.0from
PLAZMAMA:improve_boids

PLAZMAMA commented Jun 5, 2025 •

edited

Loading

Uh oh!

PLAZMAMA commented Oct 1, 2025

Uh oh!

PLAZMAMA commented Mar 2, 2026

Uh oh!

PLAZMAMA commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

PLAZMAMA commented Jun 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Todo

Uh oh!

PLAZMAMA commented Oct 1, 2025

Uh oh!

PLAZMAMA commented Mar 2, 2026

Uh oh!

PLAZMAMA commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

PLAZMAMA commented Jun 5, 2025 •

edited

Loading