thot experiment comments

Results 15 comments of


                                            thot experiment

Generation API responses incorrectly cut the reply on applying extensions

I'd been dealing with this same bug (thought it was my own off-by-one, was driving me up the wall) but it seems to be better now. I can repro the...

new out of memory issue (possible regression?)

Just successfully launched, changed nothing just ran the script again on the off chance it would work. ¯\\\_(ツ)\_/¯

new out of memory issue (possible regression?)

fwiw I still occasionally get this, it was always intermittent and it seems to be better now, but it does still happen from time to time, i just bumped to...

Stream API example seems broken.

Same bug, anyone know which commit this broke on? Was working maybe 2 or 3 days ago. I'll look into this. I believe going forward we should probably switch to...

Stream API example seems broken.

So it looks like this API is something that's autogenerated by Gradio itself (sorry again for the naivete here I really have no idea what's going on) and because of...

"Transformers bump" commit ruins gpt4-x-alpaca if using an RTX3090: model loads, but talks gibberish

Ok, so I'm trying to gather all info I can about this gibberish issue as it appears to persist for me regardless of tokenizer config as per this comment [#1029](https://github.com/oobabooga/text-generation-webui/issues/1029#issuecomment-1502539767)...

thot experiment

Generation API responses incorrectly cut the reply on applying extensions

new out of memory issue (possible regression?)

new out of memory issue (possible regression?)

Stream API example seems broken.

Stream API example seems broken.

"Transformers bump" commit ruins gpt4-x-alpaca if using an RTX3090: model loads, but talks gibberish

[Bug]: Constant hanging/freezing on image generation

[Bug]: Constant hanging/freezing on image generation

3-bit and 2-bit GPTQ support

4bit LLaMA-65B: DefaultCPUAllocator:not enough memory, on 64GB RAM and 48GB total VRAM