ToduBem comments

Results 27 comments of


                                            ToduBem

IOFormat

Hi @Railcalibur, The error occurred because the first layer was set to use int8 precision. If you want to use fp16 input, please set the precision of the first layer...

Will cuDA standalone mode occupy cuda gpu resources?

Using cuDLA requires all layers can be supported by DLA, we moved several unsupported layers into post-processing, so that it won't use GPU resource in the runtime. Comparing to cuDLA...

Will cuDA standalone mode occupy cuda gpu resources?

Then it should be due to bandwidth-bound. DLA and GPU both consume the same resource: system DRAM. The more bandwidth-bound a workload is, the higher the chances that both DLA...

How to obtain the min and max values of activation and weights？

what chip do you use? is it a common question for TensorRT? could you new an issue in https://github.com/NVIDIA/TensorRT/issues?

How to obtain the min and max values of activation and weights？

closing since no activity for several months, thanks!

bash data/model/build_dla_standalone_loadable.sh error

Sorry for the late reply, checked the source code of trtexec in branch 8.4, you may delete this [line](https://github.com/NVIDIA/TensorRT/blob/release/8.4/samples/common/sampleEngines.cpp#L951) and recompile the trtexec. The error said "kPREFER_PRECISION_CONSTRAINTS cannot be set...

ToduBem

IOFormat

Will cuDA standalone mode occupy cuda gpu resources?

Will cuDA standalone mode occupy cuda gpu resources?

How to obtain the min and max values of activation and weights？

How to obtain the min and max values of activation and weights？

bash data/model/build_dla_standalone_loadable.sh error

bash data/model/build_dla_standalone_loadable.sh error

/model.0/conv/_input_quantizer/Constant_1_output_0' is not supported on DLA.

How to measure inference time for cudla standalone mode？

How to measure inference time for cudla standalone mode？