Hans Han
Results
1
issues of
Hans Han
Less memory requirement for 4bit and 8bit quantized models. https://huggingface.co/docs/transformers/main/main_classes/quantization