ChaunceyChan comments

Results 8 comments of


                                            ChaunceyChan

how to set `model_transaction_policy`

+1, have you solved the problem?

About hubert finetuning

hi, did you reproduce 10h result comparing to the performance in the paper using base_10h.yaml config?

Loss Nan Value

如果是fp16训练遇到nan是正常的吗？

Loss Nan Value

> @xxchauncey 可以用bf16，性能比fp16差点，但不怎么遇到nan 感谢，我是最近才接触audio separation这一块的，前阵子切换了好几种backbone都会在训练中期出现nan，在v100卡上解决方案只能是切回32精度继续训练。以前不管是asr还是小型nlp模型都没有碰到过这样的情况，所以比较好奇。

关于模型微调的数据量

> 声纹识别模型不能直接微调，因为这样会打乱之前训练好的说话人的ID。是否可以扩充说话人ID并且扩充模型最后的linear层来实现微调？类似于NLP中扩充词表的做法。另外我发现CAM++模型中，“更大数据集”版本存了classifier的weight，但是“超大数据集“版本却没有存。

关于模型微调的数据量

你好，注意到repo更新了增量学习，但有一点疑惑：说明里面指出直接微调模型可能导致不可用，但是表格中的指标直接微调要好于增量学习，甚至优于from scratch训练，这里如何理解？

[bug] Encountered an error in forwardAsync function: Assertion failed: mNextBlocks.empty()

@hypdeb could you please provide some details about how this bug is triggered. I'm using V100 GPU which any version above v0.15.0 would not support according to update logs.

Error malloc(): unaligned tcache chunk detected Always Occur after tensorrt server handling a certain amount requests

ran into same problem, any updates?