goodglitch comments

Results 13 comments of


                                            goodglitch

llama3 instruct models need multiple `eos_token_id` to make the output stop correctly

Can this be fixed, please? I am using gguf format, so can't just change tokenizer_config.json I also have tried to modify text_generation.py but it doesn't fix the problem. Btw, in...

llama3 instruct models need multiple `eos_token_id` to make the output stop correctly

> For API I had to manually insert in completions.py the fields: 'skip_special_tokens': False, 'custom_stopping_strings': '""' > > as the other side doesnt insert those fields. I think the API...

llama3 instruct models need multiple `eos_token_id` to make the output stop correctly

> > > For API I had to manually insert in completions.py the fields: 'skip_special_tokens': False, 'custom_stopping_strings': '""' > > > as the other side doesnt insert those fields. I...

llama3 instruct models need multiple `eos_token_id` to make the output stop correctly

> > > > > For API I had to manually insert in completions.py the fields: 'skip_special_tokens': False, 'custom_stopping_strings': '""' > > > > > as the other side doesnt...

OpenAI api seems to ignore system messages.

> did you try "mode": "chat-instruct", Thanks for the reply! The mode "chat-instruct" produced exactly the same results as "chat". However, just "instruct" has done the job)) Do you know...

OpenAI api seems to ignore system messages.

For those who wonder you can use "/v1/internal/model/load" to load model. So I close the issue.

[REQUEST] Colored User Input

I am new to this library and thought that I just can't find this feature, but it looks like it is missing. So +1 for colored text input.

[REQUEST] Colored User Input

Thanks! I will try escape sequences. On Wed, Jul 31, 2024, 14:03 j0yk1ll ***@***.***> wrote: > I am new to this library and thought that I just can't find this...

How to increase token size ?

You can't change token's size (and you don't want), but you can increase maximal number of tokens that LLM generates in response to your prompt: -n N, --n-predict N: Set...

Update dependencies for getting LLAMA 3.1 to work

Really waiting for the version bump to solve the problems with llama 3.1. Hope Oobabooga team would do it soon.