Alan May comments

Results 13 comments of


                                             Alan May

impossible to load vicuna-13B-1.1-GPTQ-4bit-128g

As a temporary solution, you can [convert the GPTQ 4bit model locally](https://github.com/qwopqwop200/GPTQ-for-LLaMa/tree/cuda#llama). I will test compatibility with other models released by TheBloke

Improve SSE User Experience

@VGEAREN I have made a similar modification before, but it has a problem that it is not compatible with the openai python sdk, because it will **send a ping event**...

[WIP] Fixe FSDP saving error

@merrymercy I can help with the test, since I had the same problem before. Update results later. --- update Try this PR with 4*A100(80G), training is ok, OOM when saving....

@merrymercy @zhisbug Tried several different settings using the FSDP API, all failed when saving the model. But based on [this comment](https://github.com/tatsu-lab/stanford_alpaca/issues/81#issuecomment-1494614864), I finally managed to save the model with **python3.10**...

关于中文预训练阶段的Loss情况咨询

你好，我也有类似的问题，请教下你的loss起始值是多少呢？我是从8.0开始下降

FastChat - error on 4bit GPTQ

@zhisbug Hi, I make a new PR to address GPTQ-4bit, can you take a look and give some advice? Thanks! #1209

MPT support

please🙏

Add Default Timeout to urllib.request.urlopen Calls to Prevent Potential Hanging

Same problem with sglang 0.2.13

Support `locals` for highlighting

> The discussion reads like semantic (or at least scope aware) highlighting should already work. However, I was not able to find any plugin that uses nvim-treesitter to implement something...

Autocomplete on vscode stops working

@Patrick-Erichsen I'm facing same issue, the reason why the completion was *a space character* is because the `stop` list contains "```". And Qwen2.5-Coder-7B-Instruct completion result is ``` ```python\n # Uncomment...