Guang Yang

Results 2 issues of Guang Yang

I tried to fine tune the code generation tasks on specific domains using lora for the 220M, 770M, 2B, 6B models. When I kept the hyperparameters consistent (target_modules I set...

**What features would you like to see? Is it related to a problem or a new feature you'd like to see? Please describe.** Now, both transformers and VLLM support the...

feature request