Aly M. Kassem comments

Results 10 comments of


                                            Aly M. Kassem

Error in Preprocessing the data

@AchintyaX @hamzakhalem The problem is the ```torchaudio.load``` you will find that it returns 2d array like that ```(2,82585)``` but it should be like that ```(82585,)```. So the solution is to...

Error in Preprocessing the data

Yes, it's not splitting. To be sure try ```librosa.load``` instead of ```torch.audio```, it will output a single array that is equal [0]. > I have the same bug but the...

Can't find MARBERT-v2

Hello @elmadany , the MARBERTv2 Model is deleted and released several times in hugging face hub models, but now I can't find it anymore. Will it be released soon?

Can't find MARBERT-v2

@elmadany, can you please release the pre-trained model soon because there is a competition for offensive and hate speech detection in the Arabic language and all participants will benefit from...

Can't find MARBERT-v2

Thanks, i appreciate your help.

Reward either goes down or stays stagnant

same problem here with a longer sequence. @vblagoje @lvwerra

Reward either goes down or stays stagnant

@adhitya-synth I used the same configuration as you mentioned and I found out that when the batch size is small it happens as you said but with a larger batch...

Reward either goes down or stays stagnant

Thanks for the clarification. But, I am mentioning that based on his observations when the batch size is small what he mentioned happens, but when I increased the batch size...

Larger models like GPT-J and GPT-NeoX-20B

To use these large models, you will need to parallelize them on multiple GPUs because they won't fit on a single GPU. I think they mentioned in the readme that...

Larger models like GPT-J and GPT-NeoX-20B

In their paper, I think they use the T5(220M) and GPT-2(117M). But I think the same methodology is applied by different papers (eg, InstructGPT). So you can give it a...