Junrong Lin

Results 13 comments of Junrong Lin

Hi @bluechanel , thx for PR. Since the current open-source solution is entirely based on fastchat. How about also refactor the `opensource model`?

Hi! @dandansamax @Wendong-Fan Do you have time to take a look at this PR and give me some feedback?

Emmm ok I see some inflexibilities of directly applying this method like it cannot apply to a single message but should be a chat history. During inference it is fine...

@vermouth1992 Hi, the test `dataset.yml` encounter ``` ../../../../.local/lib/python3.10/site-packages/torch/__init__.py:290: in from torch._C import * # noqa: F403 E ImportError: libcudnn.so.9: cannot open shared object file: No such file or directory ```...

> > Update: flashinfer-python 0.2.3 has the same issue and 0.2.5 has OOM every time > > Log flashinfer_python 0.2.5 > > Update: flashinfer-python 0.2.5 works with newest commit and...

@zhaochenyang20 @zhuzilin @yizhang2077 @fzyzcjy

> Great job. But lint it please 😂 Done

> File "/sglang-checkout/python/sglang/srt/managers/scheduler_output_processor_mixin.py", line 373, in process_batch_result_decode .buffer[ ^^^^^^ AttributeError: 'NoneType' object has no attribute 'buffer' > > CI failed @ocss884 Done