yechenzhi

Results 3 issues of yechenzhi

As previously discussed, we will support the parallel training of DPO. However, it seems that the content of the parallel training config file is almost the same as that of...

CLA Signed

When setting max_response_length >= 16k, I'm encountering OOM errors even with ulysses_sequence_parallel_size >= 2 when I set use_dynamic_bsz = False. ``` for epoch in range(self.config.ppo_epochs): for batch_idx, data in enumerate(dataloader):...