Wang Jiaqi
Results
2
comments of
Wang Jiaqi
hello, I used the same code, conda env, hyper-parameters and dataset, just replaced the base model with LLAMA-2. It did run, but the loss couldn't converge, which was the problem...
I met the same issues on my custom dataset and qwen2.5vl 7b model.