libowen424 comments

Repositories
Issues
Comments

Results 3 comments of


                                            libowen424

The configuration for Llama-7b on 4 RTX4090

i success on the following configuration: ` set -x export PATH=$HOME/.local/bin/:$PATH ray job submit --address="http://127.0.0.1:8265" \ --runtime-env-json='{"working_dir": "/openrlhf", "pip": "/openrlhf/requirements.txt"}' \ -- python3 examples/train_ppo_ray.py \ --ref_num_nodes 1 \ --ref_num_gpus_per_node 1...

can't set model.eval

actually, i find that, `self.model.zero_grad()` `loss.backward()` `data_grad = data.grad.data ` but data_grad is nan, your code is optimize the model, not the `data` it's wired

can't set model.eval

> thanks, i will try to find a way to solve this too. Perhaps it's actually due to the limitation of framework-level implementation.