Le Tuan Thanh

Results 4 issues of Le Tuan Thanh

Could you tell me about the detail of the way you build the global graph of all relations from Wikidata?

I built the engines for T5 model with the following scripts for the latest version of TensorRT-LLM: ``` export MODEL_DIR="path_to_t5_model" # or "flan-t5-small" export MODEL_NAME="t5model" export MODEL_TYPE="t5" export INFERENCE_PRECISION="float16" export...

I try inference my T5 model with C++ runtime used Paged KV at the commit `b777bd64750abf30ca7eda48e8b6ba3c5174aafd`. Its result is normal when inference with single input text, but with multiple input...

triaged
Investigating

* I am following the example of BART LoRA at [here](https://github.com/NVIDIA/TensorRT-LLM/tree/main/examples/enc_dec#run-bart-with-lora). I cannot convert the engines without flag `--context_fmha disable`. I change `test_remove_padding = False` then running `run.py` raises the...