yechenzhi
Results
3
issues of
yechenzhi
As previously discussed, we will support the parallel training of DPO. However, it seems that the content of the parallel training config file is almost the same as that of...
CLA Signed
When setting max_response_length >= 16k, I'm encountering OOM errors even with ulysses_sequence_parallel_size >= 2 when I set use_dynamic_bsz = False. ``` for epoch in range(self.config.ppo_epochs): for batch_idx, data in enumerate(dataloader):...