guojinma
guojinma
Will you release the emore + Glin-Asia data, or show how to combine the two?
骗了一波吆喝?
same problem too
Actually, we've just run it on Ubuntu and have no feasible sugestions for your problem. But is your spyder 32-bit version? If so, it may be solved by reinstalling a...
> The currently expected speed on the 3090 with this model and quantization is roughly 8 tokens/second (10-11 on 4090) Your log looks good except for the thread count, I've...
Another weird thing is that I test the model on three different GPUs like 3090, A6000 and A100 (40G),all the three GPU shows just nearly the same speed. Comparing with...
> I think all 3 of those are probably within 10-15% the same raw cuda processing speed and chip generation, they differ mostly in memory and multi gpu capabilities. Right...