Jonas Pfeiffer

Results 3 issues of Jonas Pfeiffer

Hey, Thanks for this amazing repo! I have downloaded the preprocessed GQA dataset with the two different strategies: ``` wget https://biglmdiag.blob.core.windows.net/oscar/datasets/gqa.zip unzip gqa.zip -d ./ ``` and ``` path/to/azcopy copy...

For validation it might be useful to have the generator loop through the data in the same order so that analysis is easier.

if correct embedding dim is not set from start, it stores the default 300 dim after reloading.