sys_reading icon indicating copy to clipboard operation
sys_reading copied to clipboard

DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving

Open pentium3 opened this issue 1 year ago • 1 comments

https://arxiv.org/pdf/2401.09670v1.pdf

https://www.usenix.org/conference/osdi24/presentation/zhong-yinmin

pentium3 avatar Mar 21 '24 06:03 pentium3