Maosheng Liao

https://blog.bobliao.xyz

Nvidia Shanghai, China Now I am working with talented teammates!

Results 26 comments of


                                            Maosheng Liao

g2o_viewer

If there are still someone has a problem with `g2o_viewer` under 14.04, I wrote a [blog](https://blog.bobliao.fun/2019/04/30/g2o-in-ubuntu14-04/). You can have it a look.If you have any questions, contact with me. 我写了一篇关于g2o的组件在ubuntu14.04下如何安装的博客,需要的可以参考一下.

how to convert onnx model to saved model

@winnietsang If you update this, you should document it.

how to convert onnx model to saved model

@winnietsang And I didn't see this. https://github.com/onnx/onnx-tensorflow/blob/main/doc/API.md#onnx_tfbackend_reptensorflowrepexport_graph Now I want to know how can I export frozen graph? should I need to convert it to frozen graph use tensorflow API？Glad...

请教各位，这个按压时间系数怎么确定的呢？

@hao0917 能微调到百分位，也算是辛苦了:laughing:

[BUG] Example 09_turing_tensorop_conv2dfprop does not work

I got the same error. Seems the assertion failed in `integer_subbyte`, but that's weird.

[Feature]: Could you please publish some docs like cuda programming guide?

Thanks for your feeding back. Much better now! Thanks!

[HIPIFY][feature] Support for `fp8` data types

Thanks for replying. Hope to see the fp8 support available.

[HIPIFY][feature] Support for `fp8` data types

> > Thanks for replying. Hope to see the fp8 support available. > > Hi @foreverlms, > > It is available now. Going to revise fp16 soon. Thanks, this could...

Enable flashinfer for dsv2.

> @foreverlms may you paste the gsm8k result? (triton backend vs flashinfer backend) > > ```shell > python3 benchmark/gsm8k/bench_sglang.py --num-shots 8 --num-questions 1319 --parallel 1319 > ``` Is this for...

Enable flashinfer for dsv2.

Got: ``` Accuracy: 0.453 Invalid: 0.005 Latency: 37.627 s Output throughput: 4522.078 token/s ``` Compared with mla not enabled: ``` Accuracy: 0.659 Invalid: 0.002 Latency: 40.824 s Output throughput: 4552.229...

1
2
3
›