PufferLib issues

Fix vectorization.py for multiprocessing

1

Example of CleanRL PPO in Nethack

Hi! Is there an example script to train a baseline PPO agent using CleanRL on Nethack? Ty!

PufferLib customized with changes to support frame_stack=4 pokegym, updated counts_map, folder logic

Several changes including working counts_map, modifiable to report to wandb less-frequently, some speed-related (debatable) changes, stats reporting put in different loop, config.yaml updated with best settings, etc.

xinpw8

Cache the number of elements in the action space

3

You probably dont need to dispatch to numpy everytime you call `split` to calculate the number of elements in the space. This PR caches the sizes (in a less than...

thatguy11325

Logits

Should hopefully be faster. Based on my [comparison of different categorical distribution sampling methods](https://gist.github.com/thatguy11325/4df3b4d39e9b707b5ee0e09a7489769c). [Wandb tests](https://wandb.ai/thatguy11325/pufferlib/groups/puf-0.7.0-baseline/workspace?workspace=user-thatguy11325)

thatguy11325

CleanRL PPO demo appears outdated

Attempts to access pufferlib.environments.atari.make_env

JoshuaPurtell

pip install error

7

windows wsl The pip install bufferlib error is as follows： Using cached pufferlib-0.4.0.tar.gz (92 kB) Preparing metadata (setup.py) ... error error: subprocess-exited-with-error × python setup.py egg_info did not run successfully....

xiao-hua-sheng

pip install pufferlib gives ValueError: 'pufferlib/extensions.pyx' doesn't match any files

1

pip install pufferlib Collecting pufferlib Using cached pufferlib-0.4.5.tar.gz (94 kB) Installing build dependencies ... done Getting requirements to build wheel ... done Preparing metadata (pyproject.toml) ... done Collecting gym==0.23 (from...

AwesomeCap

'Rating' object is not subscriptable

Found an issue with the `OpenSkillRating` which causes Wandb logging to fail in the policy ranker. Problem is here https://github.com/PufferAI/PufferLib/blob/889f172cb27819f771681c91c9b51f8f1e132a17/pufferlib/policy_ranker.py#L90 ``` Exception has occurred: TypeError (note: full exception trace is...

trangml

Clean up seed and add test

1. Clean unused seed value in environment 2. Add test case for reset and step 3. Change session folder to sub directory Feedback welcome on the test case, was not...

Iron-Bound

PufferLib
PufferLib copied to clipboard

Metadata

Fix vectorization.py for multiprocessing

Example of CleanRL PPO in Nethack

PufferLib customized with changes to support frame_stack=4 pokegym, updated counts_map, folder logic

Cache the number of elements in the action space

Logits

CleanRL PPO demo appears outdated

pip install error

pip install pufferlib gives ValueError: 'pufferlib/extensions.pyx' doesn't match any files

'Rating' object is not subscriptable

Clean up seed and add test

← Metadata

Owner

Metadata

PufferLib PufferLib copied to clipboard

Metadata

← Metadata

Owner

Metadata

PufferLib
PufferLib copied to clipboard