James Liu comments

Repositories
Issues
Comments

Results 3 comments of


                                            James Liu

Winogrande Performance Discrepency

Am seeing the same discrepency - I tried both 0-shot and 5-shot for Winogrande on `meta-llama/llama-2-7b-chat-hf` and get similar results (66.63, 66.46).

it's strange for the MMLU result

Yeah this is interesting. I wonder if MMLU is contaminated

Incomplete implementation of SparseGEMV

yes its primarily a decoding kernel. A.4 is based on loss evals, where the sparsity (both weight and activation) is simulated