Hans Han

Results 1 issues of Hans Han

Less memory requirement for 4bit and 8bit quantized models. https://huggingface.co/docs/transformers/main/main_classes/quantization