Results 5 comments of aiweker

base_model_path content are from SG161222--RealVisXL_V3.0-11ee564ebf4bd96d90ed5d473cb8e7f2e6450bcf.tar ``` shell SG161222/ ├── model_index.json ├── scheduler ├── text_encoder ├── text_encoder_2 ├── tokenizer ├── tokenizer_2 ├── unet └── vae ```

I found that my GPU do not support bfloat16, it was solved by changing bfloat16 to float16.

Thank you for your reply. I’m using the main branch. Has the **FlashInfer** feature already been integrated? If not, I’m looking forward to its integration. I’m even more looking forward...

Has there been any progress on this issue? thank you

> > Has there been any progress on this issue? thank you > > There is a big gap on kernel implementation comparing to SOTA like vLLM, SGLang, or Ollama....