Qnancy
Qnancy
I meet the same problem
have you solved it?
> same question... do you get the access? No one has replied to me yet...
> It's okay~ 感谢回复!我已经使用之前说的情况训练得到了ckpt_latest.pth,但在inference的时候出现了'San francisco, california, usa. june 2019: a man cleaning a couch. \n\nSan francisco, california, usa. june 2019: a man cleaning a couch.‘类似的乱码,可能是什么情况呢
> @Qnancy 如果您使用的是vicuna的语言模型,请检查一下是否是下载的delta模型并已经完成了权重的转换。 转换的过程您可以参考https://github.com/OpenGVLab/Ask-Anything/blob/main/video_chat/README_CN.md#running-%E4%BD%BF%E7%94%A8%E6%96%B9%E6%B3%95 感谢回复~我使用的是mistrial,且在训练前使用videochat2_mistral_7b_stage3.pth进行inference是完全没有问题的,将checkpoint_path变成训练得到的ckpt_latest.pth就输出了乱码
[train.log](https://github.com/user-attachments/files/16964706/train.log) 这是全部的train.log,非常感谢您的帮助~~
> Hi~ 训练后只会保存`更新过的权重`,考虑到你是冻结训练的,加载权重的时候,需要先加载原始的`videochat2_mistral_7b_stage3.pth`,再加载你的`ckpt_latest.pth` 原来是这样,我去尝试一下您说的操作,非常感谢!
you can add these two lines: import sys sys.path.append('../Ask-Anything/video_chat2')[your path]