NOLA
NOLA copied to clipboard
QNoLA (PTQ and QAT)
I wanted to check if the current codebase support quantization. I'm interested in reproducing the PTQ and QAT results in Table 4 (Quantization of coefficients). Could you possibly provide instructions on how to replicate this experiment?