Open-Assistant
Open-Assistant copied to clipboard
Distributed Sampler
Make the current sampler work correctly for distributed training
- split the dataset per epoch per device
- fix small error that cased the fractions/sizes to be ignored