KavioYu comments

Results 29 comments of


                                            KavioYu

[Roadmap] vLLM Roadmap Q2 2024

I'm very interested in implementing tree attention for speculative decoding. @simon-mo

Is there scripts to calculate the overall acceptance rate?

> After obtaining the result file, you can run the _[eagle/evaluation/alpha.py](https://github.com/SafeAILab/EAGLE/blob/main/eagle/evaluation/alpha.py)_ file to get the acceptance rate. [eagle/evaluation/gen_ea_alpha_llama2chat.py](https://github.com/SafeAILab/EAGLE/blob/main/eagle/evaluation/gen_ea_alpha_llama2chat.py) could't been excuted. It seems to be because the ea_model forward interface...

[Develop] Performance Improving Feature

> 1 and 3 are interesting to us. 2 has been implemented here > > https://github.com/sgl-project/sglang/blob/5ff25cdf5b1310e83d9e595142b39ae4d7b561e9/python/sglang/srt/server_args.py#L426-L430 > > , although there is still room for improvement. > Please join our...

KavioYu

[Roadmap] vLLM Roadmap Q2 2024

Is there scripts to calculate the overall acceptance rate?

[Develop] Performance Improving Feature

Speculative decoding with lookahead

[WIP] Spec infer with EAGLE2

[WIP] Spec infer with EAGLE2

[WIP] Spec infer with EAGLE2

[WIP] Spec infer with EAGLE2

[WIP] Spec infer with EAGLE2

[WIP] Spec infer with EAGLE2