fgdfgfthgr-fox comments

Results 26 comments of


                                            fgdfgfthgr-fox

Character duplication at the start of each generation

The issue seems to be resolved in the last pull, at least I am not experiencing it anymore. Likely from commit #de6a09d

[Bug]: Cannot install on 64-bit Windows 10

You should try with python 3.9.

Support StableLM From StabilityAI

> The q4_x files output from ggml are not compatible with llama.cpp? It seems so currently.

Only 3t/s with 3090 at 13b model

Can you post your detailed system specs?

$ in GALACTICA model causes RuntimeError: main thread is not in main loop

> Are you using the latest gradio version? > > ``` > pip install -r requirements.txt --upgrade > ``` Updated to the latest gradio, still face the same issue. My...

$ in GALACTICA model causes RuntimeError: main thread is not in main loop

Updated to the latest webui version as well as dependences again. Tried launching in both terminal and Pycharm, still the same issue.

$ in GALACTICA model causes RuntimeError: main thread is not in main loop

Updated to gradio 3.24.0 version, still no improvement. Maybe it's something to do with web browser? I was using firefox.

$ in GALACTICA model causes RuntimeError: main thread is not in main loop

Seems fixed indeed!

Add Erebus and GALACTICA support

I managed to make FlexGen work for Galactica-1.3b model by changing opt_config.py, flex_opt.py and tokenizer_config.json. @oobabooga 's Webui can successfully load the model and generate text using it. Vram use...

Add Erebus and GALACTICA support

> @fgdfgfthgr-fox can you create a fork of https://github.com/FMInference/FlexGen with your changes? @oobabooga https://github.com/fgdfgfthgr-fox/FlexGen---galactica-support Is this what you want?