puja93
Results
1
issues of
puja93
HI, i've been trying to reload a 4-bit quantized llama checkpoint. For any mlp or attention layer, it expands to 5 new layers (bias, g_idx, qweight, qzeros, scales. Each with...
needs-more-information