aiweker comments

Results 5 comments of


                                            aiweker

the result of running is noise, why? please help

base_model_path content are from SG161222--RealVisXL_V3.0-11ee564ebf4bd96d90ed5d473cb8e7f2e6450bcf.tar ``` shell SG161222/ ├── model_index.json ├── scheduler ├── text_encoder ├── text_encoder_2 ├── tokenizer ├── tokenizer_2 ├── unet └── vae ```

the result of running is noise, why? please help

I found that my GPU do not support bfloat16， it was solved by changing bfloat16 to float16.

what differences Between the GitHub Open-Source Version and the Paper Implementation of DeepSeek-Chat-Lite

Thank you for your reply. I’m using the main branch. Has the **FlashInfer** feature already been integrated? If not, I’m looking forward to its integration. I’m even more looking forward...

[Feature Request]How to measure the generation throughput(token/s)?

Has there been any progress on this issue? thank you

[Feature Request]How to measure the generation throughput(token/s)?

> > Has there been any progress on this issue? thank you > > There is a big gap on kernel implementation comparing to SOTA like vLLM, SGLang, or Ollama....