TEAL issues

Hi author, Thanks for publishing such high-quality training-free sparisty research work. I saw that you have plans to integrate sparse computing and speculative decoding, and can you share me when...

CarlHuangNuc

Reproduction about speedup

Hello, I really enjoy this paper and the code. I want to reproduce the part below(speedup) ![Image](https://github.com/user-attachments/assets/9e73446b-316f-4f5b-9c6f-79efc5451a5f) So I tested with 1 A100(with following explanation in README), and obtained the...

quaternior

it's strange for the MMLU result

1

Hello, I test the MMLU, using different sparsity from 0 to 90%。But the mmlu score forcus on 50%, which didn't show significant decline as the sparsity becoming large. Since MMLU...

milktea888

Main

Fix the typo in Readme

shaginhekvs

Incomplete implementation of SparseGEMV

3

https://github.com/FasterDecoding/TEAL/blob/fb7373c93ac3594817c9ee64d4e08b47430a1822/kernels/sparse_gemv.py#L271 Hi, I notice that the SparseGEMV kernel only manage the case when `batch_size=1 & seqlen=1`. Beyond that case, the kernel outputs wrong answer. Is it expected that this kernel...

kyang-06

TEAL
TEAL copied to clipboard

Metadata

I want to eval the model on the MMUL, CEVAL, HUMAN-EVAL.... Could you give me some help?

When will support Speculative decoding in Training free activation sparsity base code ?

Reproduction about speedup

it's strange for the MMLU result

Main

Incomplete implementation of SparseGEMV

← Metadata

Owner

Metadata

TEAL TEAL copied to clipboard

Metadata

I want to eval the model on the MMUL, CEVAL, HUMAN-EVAL.... Could you give me some help?

When will support Speculative decoding in Training free activation sparsity base code ?

Reproduction about speedup

it's strange for the MMLU result

Main

Incomplete implementation of SparseGEMV

← Metadata

Owner

Metadata

TEAL
TEAL copied to clipboard