Vishal Agarwal

Results 3 issues of Vishal Agarwal

ran the following command to install TRT-LLM on windows ```sh pip install tensorrt_llm==0.11.0 --extra-index-url https://pypi.nvidia.com ``` Getting the following error. Followed instructions from [here](https://nvidia.github.io/TensorRT-LLM/installation/windows.html). ```sh Looking in indexes: https://pypi.org/simple, https://pypi.nvidia.com...

installation
stale

Add `-d` or `--n-depth` arg in llama-bench to run tests with prefilled KV cache context Relevant discussion https://github.com/ggml-org/llama.cpp/pull/12874 ### Sample output ```sh $ .\llama-bench.exe -d 0,512 | model | size...

examples