tricky comments

Results 10 comments of


                                            tricky

Training on single speaker (male) hindi dataset - unable to attend (flow=1)

> An update, since this issue has been open for a long time. My model learned to attend, kinda. It still has issues during inference and I've been playing around...

to run quant train, which cuda version should use?

> Tesla V100-SXM2

to run quant train, which cuda version should use?

> Which GPU are you using? By the way, there are running warnings bellow, does the error have relation with the warning? I cannot find this file in my system....

to run quant train, which cuda version should use?

> The recently released quant training of lightseq layers only supports A100 or compute capability greater than 80, because it uses real int8 gemm instead of fake quantization. If you...

change model parameter does not work when rebuild lpcnet_demo?

no I copyed several directory and build separately.

does triton support different model-repository assemble into a batch?

or how does different model-repository do parallel computing on one gpu?

The trt llm container does not have the other backends

> No, the Python Backend should be the same. Does the 24.05-py3 contain backends of ONNX、TensorRT and TorchScript?

The trt llm container does not have the other backends

> @tricky61 The `nvcr.io/nvidia/tritonserver:24.05-py3` container contains ONNX, TRT and PyTorch backends. The `nvcr.io/nvidia/tritonserver:24.05-trtllm-python-py3` only has TRTLLM and Python backends. ok. I am using nvcr.io/nvidia/tritonserver:24.05-vllm-python-py3 I also use tritonserver:23.11 and add...

Abnormal of onnx model to trt model in the inference results

> You can upload the trtexec --verbose build log. thanks for your reply. since it is difficult to send files out from my company's computer, I will try to upload...

Assertion `!(srcMmaLayout && dstMmaLayout) && "Unexpected mma -> mma layout conversion"' failed.

> As suggested by [Jokeren](https://github.com/Jokeren), storing the temporary values to the global memory and then reload from it with latest triton version is working on V100. which version will work？...