ZeroYuJie issues

Results 5 issues of


                                            ZeroYuJie

Can Models with Different vocab_size be Merged?

Great job for this toolkit . I'm attempting to merge two models with differing `vocab_size`: `augmxnt/shisa-7b-v1` (base) and `teknium/OpenHermes-2.5-Mistral-7B`. The `augmxnt/shisa-7b-v1` model has an expanded `vocab_size`. However, after merging them...

panic:fatal error: concurrent map writes

I got error panic: `concurrent map writes` , BPE `TokenizeWithCache` func, Concurrent read and write operations on the map can lead to a panic. ``` func (b BPE) TokenizeWithCache(sequence string)...

Question about example_flask.py

I found an example regarding using Flask for API requests. I gave it a try, but when making concurrent requests, the generated responses from the inference appear as garbled text....

VLLM 运行输出会输出不同语言

我在prompt里规定限制了语言，在使用https://github.com/dachengai/vllm 运行会出现输出不同语言的情况，在Transformers 中不会出现这种情况

【Frontend】Add sampler_priority and repetition_penalty_range

To solve https://github.com/vllm-project/vllm/issues/8835, add sampler_priority to control the execution order of the samplers, and add repetition_penalty_range to control penalties sampler token range

frontend

needs-rebase

stale