cat-state

Results 18 comments of cat-state

``` trlx/model/accelerate_base_model.py:6:1: F401 'torch.nn.functional as F' imported but unused trlx/model/accelerate_base_model.py:9:1: F401 'torch.utils.data.DataLoader' imported but unused trlx/model/accelerate_base_model.py:11:1: F401 'transformers.AutoConfig' imported but unused trlx/model/accelerate_base_model.py:13:1: F401 'trlx.data.BatchElement' imported but unused trlx/model/accelerate_base_model.py:13:1: F401 'trlx.data.RLElement'...

Should merge https://github.com/CarperAI/trlx/pull/24 first as most of the errors re in files touched by it

Addressed by Triton Inference Server client https://github.com/CarperAI/trlx/tree/add-hh-example

> I can get to it tomorrow or Monday. I'm wondering what the API should be to avoid modifying the model definitions? I think it would be like, instead of...

@albertsun1 > Hey! I'm new to contributing to trlx, would it be worth for me to give this a go for the ppo/ilql sentiment examples? Sure, although you might need...

So I see that WandB actually lists the commit hash used for a run. So if we could find/tag TRLX runs in wandb then each commit could be matched up...

Also ultimately depends on https://github.com/CarperAI/trlx/issues/14 for larger scale

https://github.com/CarperAI/trlx/pull/39

This would also tie in to MCTS in the future, although that would likely require more thought on how to do it efficiently