guojinma

Results 7 comments of guojinma

Will you release the emore + Glin-Asia data, or show how to combine the two?

Actually, we've just run it on Ubuntu and have no feasible sugestions for your problem. But is your spyder 32-bit version? If so, it may be solved by reinstalling a...

> The currently expected speed on the 3090 with this model and quantization is roughly 8 tokens/second (10-11 on 4090) Your log looks good except for the thread count, I've...

Another weird thing is that I test the model on three different GPUs like 3090, A6000 and A100 (40G),all the three GPU shows just nearly the same speed. Comparing with...

> I think all 3 of those are probably within 10-15% the same raw cuda processing speed and chip generation, they differ mostly in memory and multi gpu capabilities. Right...