Laikh Tewari comments

Results 9 comments of


                                            Laikh Tewari

Deprecation policy

@peri044 can you help fill in the instructions for the export flow?

✨[Feature] Delayed Initialization for `TRTModule` Classes

@gs-olive would you ever want to set `construct_live=False` in the compile path? It sounds like this feature reduces device memory pressure between compilation and execution at the cost of added...

🐛 [Bug] Nightly import fails

Looks like it, I have tensorrt==8.6.1.post1 Shouldn't that be installed automatically as a dependency when I pip install torch_tensorrt?

❓ [Question] Speed problem about TRTorch and Torch-TensorRT - Device Compatibility Check

https://developer.nvidia.com/blog/cuda-pro-tip-the-fast-way-to-query-device-properties/ --- This check was added that likely caused perf issue: https://github.com/pytorch/TensorRT/blob/bf4474dc7816c184489d3985ce892315f5e0cc42/core/runtime/runtime.cpp#L81 This check invokes a constructor for a TensorRT wrapper object RTDevice::RTDevice https://github.com/pytorch/TensorRT/blob/bf4474dc7816c184489d3985ce892315f5e0cc42/core/runtime/RTDevice.cpp#L16 And this is invoking cudaGetDeviceProperties which...

README.md: Add 3rd Party Inference Speed Dashboard

Hi @matichon-vultureprime, we're discussing the best way to manage community highlights -- thanks for the PR and your patience!

llama 3.2 checkpoint conversion fails

Hi @stas00, thank you for raising this issue! TensorRT-LLM doesn't support Llama 3.2 (yet -- coming soon!), though I suspect from the code snippet shared, the question is about Llama...

llama 3.2 checkpoint conversion fails

Oops copied the wrong username, thanks @jinxiangshi !

[None][feat] Integrate helix parallelism

Where is usage documented? I don't see any docs in the changed files list

Running into free(): double free detected in tcache 2 when using trtllm-bench in a multi-node scenario

Same issue observed running dsv3 example trtllm-bench cmd on **single node** H200 Command: `trtllm-bench --model deepseek-ai/DeepSeek-V3 --model_path /workspace/dsv0324/ throughput --backend pytorch --max_batch_size 2 --max_num_tokens 1160 --dataset /workspace/dataset.txt --tp 8 --ep...