dyyoungg issues

Results 6 issues of


                                            dyyoungg

计划支持多模态模型，比如llava1.5的long sequence的训练吗

如题，

这 ma adaption pretrain代码和论文完全不一样啊

按照论文的意思，第一步应该是让语言模型理解语音的语义，所以使用speechTokenizer去获得语音的离散化token表示，讲道理是应该将speechtokenizer codebook的表示project后（因为hidden dim可能不一致，需要project）添加到语言模型的embedding层中，然后使用自回归形式预测语音token。但我在code中没有看到任何关于speechtokenizer的部分，并且tokenize部分更像是在扩大词表，训speech的文本语料，我实在不能理解。 ``` # line 230 ~ 250 text_column_name = "text" if "text" in column_names else column_names[0] def tokenize_function(examples): output = tokenizer(examples[text_column_name]) return output tokenized_cache_file_names = {...

dyyoungg

计划支持多模态模型，比如llava1.5的long sequence的训练吗

这 ma adaption pretrain代码和论文完全不一样啊

Traning on wenetspeech couldn‘t converge

chinese soft hubert is not available？

Optimize multimodal resource allocation with concurrency and improved batch RPC

Inquiry about MoE (Mixture of Experts) Training Support