Yinghan Huang comments

Results 11 comments of


                                            Yinghan Huang

Chinese Text2video retrieval support?

> > How do you set the ckpt_path? ![image](https://private-user-images.githubusercontent.com/48858574/379989179-622dcd0f-981a-414f-a2a9-a9bb8806e92e.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzAyNTMwNDIsIm5iZiI6MTczMDI1Mjc0MiwicGF0aCI6Ii80ODg1ODU3NC8zNzk5ODkxNzktNjIyZGNkMGYtOTgxYS00MTRmLWEyYTktYTliYjg4MDZlOTJlLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDEwMzAlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQxMDMwVDAxNDU0MlomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTBkNzdiMzdmOWEwNThjMTlkNTNhNDQxM2ViY2ZlMjVhNDg4NGJiNjRiNTEwNDYxZmIxMWZlMTUwMzI0NWY5OGUmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.LWUN6rnrpWGx1o1bfbsxpWQC86FPKDlffE-iq1j-y90) , if you set the ckpt of InternVideo2_Stage2 to `vision_ckpt_path`, it shouldn't meet size mismatch of `text_proj.weight`. > > Thanks, I...

Qwen3omni-30B-A3B-Instruct Lora微调训练一直卡住

> 我用ds_z0_config可以训练，ds_z3_config和ds_z2_config都会卡住。感谢大佬！我试下

Qwen3omni-30B-A3B-Instruct Lora微调训练一直卡住

> 可以试试单节点, 我是qwen3-vl:30b的. 3张卡会一直卡住, 单卡试了下可以. 我试了下单卡96G的情况下就CUDA OOM了，数据集小一点确实可以跑

Qwen3omni-30B-A3B-Instruct Lora微调训练一直卡住

> 我用ds_z0_config可以训练，ds_z3_config和ds_z2_config都会卡住。神奇。。我这确实一样的现象，ds_z0_config可以训练，是因为目前对于qwen3omni 只支持数据并行吗

qwen3omni微调后用官方推理脚本报错

> 看起来是chattemplate.jinja的问题你和原来的模型的模版对一下diff看看确实，原版没有这个文件而是chat_template.json, 而且微调后（左边）少了很多

qwen3omni微调后用官方推理脚本报错

> 看起来是chattemplate.jinja的问题你和原来的模型的模版对一下diff看看大佬，我用原版的chat_template替换之后，遇到了另一个比较奇怪的报错，我在微调的时候和原版一样保持了bf16=true，但是推理时在运行到 inputs = inputs.to(model.device).to(model.dtype) 这一步会报错: Traceback (most recent call last): File "/cpfs01/users/yinghan.huang/Software/anaconda3/envs/qwen3omni-dense/lib/python3.10/runpy.py", line 196, in _run_module_as_main return _run_code(code, main_globals, None, File "/cpfs01/users/yinghan.huang/Software/anaconda3/envs/qwen3omni-dense/lib/python3.10/runpy.py", line 86, in _run_code exec(code,...

Yinghan Huang

Chinese Text2video retrieval support?

Qwen3omni-30B-A3B-Instruct Lora微调训练一直卡住

Qwen3omni-30B-A3B-Instruct Lora微调训练一直卡住

Qwen3omni-30B-A3B-Instruct Lora微调训练一直卡住

qwen3omni微调后用官方推理脚本报错

qwen3omni微调后用官方推理脚本报错

qwen3omni微调后用官方推理脚本报错

RumtimeError: Expected to have have finished reduction in the prior iteration before starting a new one

RumtimeError: Expected to have have finished reduction in the prior iteration before starting a new one

RumtimeError: Expected to have have finished reduction in the prior iteration before starting a new one