Vensen
Vensen
### Description This PR resolves a `KeyError` that occurs when attempting to serve a Gemma-3 model that has been quantized by the `vllm-project/llm-compressor` library. It makes the SigLIP vision model...
## Purpose This PR fixes a potential out-of-bounds (OOB) memory access in the `gather_and_maybe_dequant_cache` CUDA kernel, as originally reported in Issue #27909. The bug was identified by static analysis. The...
Hi, Maintainer. This PR fixes the issue described in #1438. Problem: The preprocessing code for the BraTS nnU-Net example was using `np.where(image[i]
Fixes #3605 ## Description This PR addresses the issue of broken links detected by the `github-actions` bot. It fixes three incorrect URLs in the documentation: - In `CONTRIBUTING.md`, removed an...