James Liu
Results
3
comments of
James Liu
Am seeing the same discrepency - I tried both 0-shot and 5-shot for Winogrande on `meta-llama/llama-2-7b-chat-hf` and get similar results (66.63, 66.46).
Yeah this is interesting. I wonder if MMLU is contaminated
yes its primarily a decoding kernel. A.4 is based on loss evals, where the sparsity (both weight and activation) is simulated