Wang Jiaqi

Results 2 comments of Wang Jiaqi

hello, I used the same code, conda env, hyper-parameters and dataset, just replaced the base model with LLAMA-2. It did run, but the loss couldn't converge, which was the problem...

I met the same issues on my custom dataset and qwen2.5vl 7b model.