OpusTools
OpusTools copied to clipboard
Add yield tuple write mode
Motivation. I want OpusRead.printPairs to be a generator for downstream task. Specifically, I intend to share Opus as a huggingface dataset (see: DatasetBuilder._generate_examples in link).
Change. Added yield_tuple write mode which allows OpusRead.printPairs to be a generator.
P.s. thanks for this amazing package!