Fireblade2534 comments

Results 62 comments of


                                            Fireblade2534

Server doesn't seem to respond and world doesn't load after entering End portal

enable antiscreen and disable end screen

[Usage] Qwen3 Usage Guide

When I use this command: `vllm serve Qwen/Qwen3-30B-A3B-FP8 --tensor-parallel-size 2 --enable-reasoning --reasoning-parser deepseek_r1 --host 0.0.0.0 --port 6060` I get this error: ``` [multiproc_executor.py:470] ValueError("type fp8e4nv not supported in this architecture....

Add basic authentication for HTTP

@wangjia184 why not use the api_key param so it is compatible with openai specs and API design in general

Add basic authentication for HTTP

@wangjia184 https://platform.openai.com/docs/api-reference/authentication @RBEmerson970 I think that having the option for authentication is a good idea as long as it can be disabled. Also the implementation in this pr is not...

Add basic authentication for HTTP

Agreed authentication should be optional

Excuse me, how does this utilize FastAPI and other technologies to solve the Python Global Interpreter Lock (GIL) issue, and how can it achieve concurrency?

How is this an issue and it achieves concurrency because every request is run in a different thread ( this is default fast-api behaviour). As far as I know this...

Fireblade2534

Server doesn't seem to respond and world doesn't load after entering End portal

[Usage] Qwen3 Usage Guide

Add basic authentication for HTTP

Add basic authentication for HTTP

Add basic authentication for HTTP

Excuse me, how does this utilize FastAPI and other technologies to solve the Python Global Interpreter Lock (GIL) issue, and how can it achieve concurrency?

Cannot build triton

Cannot use Custom Phonemes in v0.2.4

Cannot use Custom Phonemes in v0.2.4

Cannot use Custom Phonemes in v0.2.4