Aly M. Kassem
Aly M. Kassem
@AchintyaX @hamzakhalem The problem is the ```torchaudio.load``` you will find that it returns 2d array like that ```(2,82585)``` but it should be like that ```(82585,)```. So the solution is to...
Yes, it's not splitting. To be sure try ```librosa.load``` instead of ```torch.audio```, it will output a single array that is equal [0]. > I have the same bug but the...
Hello @elmadany , the MARBERTv2 Model is deleted and released several times in hugging face hub models, but now I can't find it anymore. Will it be released soon?
@elmadany, can you please release the pre-trained model soon because there is a competition for offensive and hate speech detection in the Arabic language and all participants will benefit from...
Thanks, i appreciate your help.
same problem here with a longer sequence. @vblagoje @lvwerra
@adhitya-synth I used the same configuration as you mentioned and I found out that when the batch size is small it happens as you said but with a larger batch...
Thanks for the clarification. But, I am mentioning that based on his observations when the batch size is small what he mentioned happens, but when I increased the batch size...
To use these large models, you will need to parallelize them on multiple GPUs because they won't fit on a single GPU. I think they mentioned in the readme that...
In their paper, I think they use the T5(220M) and GPT-2(117M). But I think the same methodology is applied by different papers (eg, InstructGPT). So you can give it a...