Guang Yang comments

Results 5 comments of


                                            Guang Yang

I want to discuss the impact of hyperparameters in PEFT/LoRA on CodeT5+

> Hi both, this might be due to the inference difference between CodeT5+ 220M/770M and 2B/6B/16B models. The former models are pretrained from scratch while the latter group of models...

I want to discuss the impact of hyperparameters in PEFT/LoRA on CodeT5+

> Hi both, this might be due to the inference difference between CodeT5+ 220M/770M and 2B/6B/16B models. The former models are pretrained from scratch while the latter group of models...

百度网盘链接失效

您好，链接又失效了，能再更新下嘛？

QWEN3 FINE-TUNING now in Unsloth!

> hi [@shimmyshimmer](https://github.com/shimmyshimmer) , **Seems there is a bug when using lora finetune Qwen3, I can't load saved merged model correctly but random init** > > ==((====))== Unsloth 2025.4.1: Fast...

[Feature] Can support THUDM/GLM-Z1-9B-0414, thanks

Maybe missing 'glm4.py' in 'unsloth/models' when try to fine-tune. I try to change something in 'qwen2.py' to adapt the 'glm4.py', but failed. So, can unsloth give sone examples to support...