Soumendu Kumar Ghosh issues

Repositories
Issues
Comments

Results 3 issues of


                                            Soumendu Kumar Ghosh

[BUG] Symbolic tracing not working for few models

**Describe the bug** I am working on quantization of few timm models using Torch FX Graph Mode Quantization. Specifically, I am looking into post training static quantization. For static models...

bug

Question on Performance Comparison using Different Cache Bit Precision

Testing the impact of KV cache quantization on the performance of llama2 model demonstrates decrease in tokens/sec as the cache bits is reduced. However, the reduction in cache memory is...

Installation Issue

The following two package versions, as present in requirements.txt, are not found when using pip install command. ``` torch==2.5.0.dev20240723+cu121 pytorch-triton==3.0.0+dedb7bdf33 ```