mphilippnv comments

Results 9 comments of


                                            mphilippnv

[Model] DeepSeek-V3 Enhancements

Deepseek v3 doesn't appear to support pipeline parallelism. I get this error when attempting to deploy to 2 8x H100 nodes: ``` NotImplementedError: Pipeline parallelism is only supported for the...

[Model] DeepSeek-V3 Enhancements

> @july8023 It should work on 4090, generally the models takes about 600GB memory, then you want about 100-300GB for KV cache so feel free to plan around that. @fsaudm...

[Bugfix]: serialize config instances by value when using --trust-remote-code

Just a heads up, I am trying to incorporate this into my pipeline parallel setup for deepseek v2. I modified my docker file to build vllm from this branch. I...

[Bugfix]: serialize config instances by value when using --trust-remote-code

> > @justinthelaw @mphilippnv does this pr solve your problem? @youkaichao I am still having trouble building my docker container with the config changes. I have modified my local vllm...

[Bugfix]: serialize config instances by value when using --trust-remote-code

> @mphilippnv Have you had a chance to verify these changes in your environment? No. I have too much work coming in. Haven't had time to really play with it.

[Bug] vllm deploy InternVL3_5-241B-A28B error

Also seeing this error deploying on H100's using vllm v0.10.2. ``` --port 8002 --model /config/models/model --tensor-parallel-size 8 --disable-log-requests --enable-chunked-prefill --enable-prefix-caching --max-model-len 32768 --served-model-name intern-vl-241b-a28b --trust-remote-code --enable-expert-parallel ```

mphilippnv

[Model] DeepSeek-V3 Enhancements

[Model] DeepSeek-V3 Enhancements

[Bugfix]: serialize config instances by value when using --trust-remote-code

[Bugfix]: serialize config instances by value when using --trust-remote-code

[Bugfix]: serialize config instances by value when using --trust-remote-code

[Bug] vllm deploy InternVL3_5-241B-A28B error

Jetbrains Integration Codebase Context Bug with FTS Database

Jetbrains Integration Codebase Context Bug with FTS Database

[CON-261] Only first module detected in JetBrains editors