DwenGu
Results
2
issues of
DwenGu
Hi, @dkurt Thanks for your sharing! I wonder whether this op works for other arguments? For example, F.grid_sample(img, grid, mode='bilinear', padding_mode='reflection', align_corners=True). Will it be supported in iGPU in the...
We are using vLLM/FlashInfer to optimize LLM models. Low latency and throughput@latency are two scenarios that customers care about most. W4A8 Gouped Gemm kernel perf is the key point for...
feature request